mirror of
https://github.com/explosion/spaCy.git
synced 2024-11-11 04:08:09 +03:00
a902b5f217
* Tell convert CLI to store user data for Doc * Remove assert * Add has_unknwon_spaces flag on Doc * Do not tokenize docs with unknown spaces in Corpus * Handle conversion of unknown spaces in Example * Fixes * Fixes * Draft has_known_spaces support in DocBin * Add test for serialize has_unknown_spaces * Fix DocBin serialization when has_unknown_spaces * Use serialization in test |
||
---|---|---|
.. | ||
__init__.py | ||
test_serialize_config.py | ||
test_serialize_doc.py | ||
test_serialize_extension_attrs.py | ||
test_serialize_kb.py | ||
test_serialize_language.py | ||
test_serialize_pipeline.py | ||
test_serialize_tokenizer.py | ||
test_serialize_vocab_strings.py |