spaCy/spacy/cli
Matthew Honnibal a902b5f217
Record whether Doc objects are built from known spacing (#5697)
* Tell convert CLI to store user data for Doc

* Remove assert

* Add has_unknwon_spaces flag on Doc

* Do not tokenize docs with unknown spaces in Corpus

* Handle conversion of unknown spaces in Example

* Fixes

* Fixes

* Draft has_known_spaces support in DocBin

* Add test for serialize has_unknown_spaces

* Fix DocBin serialization when has_unknown_spaces

* Use serialization in test
2020-07-03 12:58:16 +02:00
..
__init__.py Import project_run_all function 2020-06-29 16:54:19 +02:00
_app.py Update with DVC WIP 2020-06-27 13:02:10 +02:00
convert.py Record whether Doc objects are built from known spacing (#5697) 2020-07-03 12:58:16 +02:00
debug_data.py refactor fixes (#5664) 2020-06-29 14:33:00 +02:00
download.py Start updating website for v3 [ci skip] 2020-07-01 21:26:39 +02:00
evaluate.py Output more stats in evaluate 2020-06-28 15:34:28 +02:00
info.py Tidy up info 2020-06-22 01:17:11 +02:00
init_model.py bugfixing prune_vectors and vectors_loc 2020-07-01 21:00:47 +02:00
package.py Fix package command and add version option 2020-06-27 20:36:08 +02:00
pretrain.py fix to pretrain script (#5699) 2020-07-02 21:48:01 +02:00
profile.py Remove ml_datasets from install dependencies 2020-06-22 12:14:51 +02:00
project.py Merge pull request #5681 from svlandeg/bugfix/exec-cwd 2020-07-01 14:13:19 +02:00
train.py Fix max_steps 2020-07-01 18:08:14 +02:00
validate.py Refactor CLI 2020-06-21 21:35:01 +02:00