spaCy/spacy/gold/converters
Matthew Honnibal a902b5f217
Record whether Doc objects are built from known spacing (#5697)
* Tell convert CLI to store user data for Doc

* Remove assert

* Add has_unknwon_spaces flag on Doc

* Do not tokenize docs with unknown spaces in Corpus

* Handle conversion of unknown spaces in Example

* Fixes

* Fixes

* Draft has_known_spaces support in DocBin

* Add test for serialize has_unknown_spaces

* Fix DocBin serialization when has_unknown_spaces

* Use serialization in test
2020-07-03 12:58:16 +02:00
..
__init__.py Add conllu2docs converter (#5704) 2020-07-03 12:54:32 +02:00
conll_ner2docs.py Improve spacy.gold (no GoldParse, no json format!) (#5555) 2020-06-26 19:34:12 +02:00
conllu2docs.py Add conllu2docs converter (#5704) 2020-07-03 12:54:32 +02:00
iob2docs.py Improve spacy.gold (no GoldParse, no json format!) (#5555) 2020-06-26 19:34:12 +02:00
json2docs.py Record whether Doc objects are built from known spacing (#5697) 2020-07-03 12:58:16 +02:00