spaCy/spacy/tests/serialize
Matthew Honnibal a902b5f217
Record whether Doc objects are built from known spacing (#5697)
* Tell convert CLI to store user data for Doc

* Remove assert

* Add has_unknwon_spaces flag on Doc

* Do not tokenize docs with unknown spaces in Corpus

* Handle conversion of unknown spaces in Example

* Fixes

* Fixes

* Draft has_known_spaces support in DocBin

* Add test for serialize has_unknown_spaces

* Fix DocBin serialization when has_unknown_spaces

* Use serialization in test
2020-07-03 12:58:16 +02:00
..
__init__.py Revert #4334 2019-09-29 17:32:12 +02:00
test_serialize_config.py Remove F841 [ci skip] 2020-06-20 14:47:17 +02:00
test_serialize_doc.py Record whether Doc objects are built from known spacing (#5697) 2020-07-03 12:58:16 +02:00
test_serialize_extension_attrs.py Merge branch 'master' into develop 2020-02-18 14:47:23 +01:00
test_serialize_kb.py Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
test_serialize_language.py Default settings to configurations (#4995) 2020-02-27 18:42:27 +01:00
test_serialize_pipeline.py Tidy up and auto-format 2020-06-20 14:15:04 +02:00
test_serialize_tokenizer.py Merge branch 'master' into tmp/sync 2020-03-26 13:38:14 +01:00
test_serialize_vocab_strings.py Tidy up and auto-format 2020-06-20 14:15:04 +02:00