spaCy/spacy/gold
Matthew Honnibal cc477be952
Improve gold-standard alignment (#5711)
* Remove previous alignment

* Implement better alignment, using ragged data structure

* Use pytokenizations for alignment

* Fixes

* Fixes

* Fix overlapping entities in alignment

* Fix align split_sents

* Update test

* Commit align.py

* Try to appease setuptools

* Fix flake8

* use realistic entities for testing

* Update tests for better alignment

* Improve alignment heuristic

Co-authored-by: svlandeg <sofie.vanlandeghem@gmail.com>
2020-07-06 17:39:31 +02:00
..
converters Auto-format 2020-07-04 16:25:34 +02:00
__init__.pxd Improve spacy.gold (no GoldParse, no json format!) (#5555) 2020-06-26 19:34:12 +02:00
__init__.py Improve gold-standard alignment (#5711) 2020-07-06 17:39:31 +02:00
align.py Improve gold-standard alignment (#5711) 2020-07-06 17:39:31 +02:00
augment.py Improve spacy.gold (no GoldParse, no json format!) (#5555) 2020-06-26 19:34:12 +02:00
corpus.py Auto-format and update URL 2020-07-04 14:23:44 +02:00
example.pxd Improve gold-standard alignment (#5711) 2020-07-06 17:39:31 +02:00
example.pyx Improve gold-standard alignment (#5711) 2020-07-06 17:39:31 +02:00
gold_io.pyx Make docs_to_json backwards-compatible with v2 (#5714) 2020-07-06 14:15:00 +02:00
iob_utils.py Auto-format 2020-07-04 16:25:34 +02:00