spaCy/spacy/cli
adrianeboyd a58cb023d7 WIP: Extending debug-data (#4114)
* Extending debug-data with dependency checks, etc.

* Modify debug-data to load with GoldCorpus to iterate over .json/.jsonl
files within directories

* Add GoldCorpus iterator train_docs_without_preprocessing to load
original train docs without shuffling and projectivizing

* Report number of misaligned tokens

* Add more dependency checks and messages

* Update spacy/cli/debug_data.py

Co-Authored-By: Ines Montani <ines@ines.io>

* Fixed conflict

* Move counts to _compile_gold()

* Move all dependency nonproj/sent/head/cycle counting to
_compile_gold()

* Unclobber previous merges

* Update variable names

* Update more variable names, fix misspelling

* Don't clobber loading error messages

* Only warn about misaligned tokens if present
2019-08-16 10:52:46 +02:00
..
converters Replace cytoolz.partition_all with util.minibatch 2019-05-11 21:12:09 +02:00
__init__.py Move UD scripts to bin 2019-03-20 01:19:34 +01:00
_schemas.py Store JSON schemas in Python and tidy up (#3235) 2019-02-07 19:44:31 +11:00
convert.py Change default output format from jsonl to json for cli convert (#3583) (closes #3523) 2019-04-12 11:31:23 +02:00
debug_data.py WIP: Extending debug-data (#4114) 2019-08-16 10:52:46 +02:00
download.py Require downloaded model in pkg_resources (#4090) 2019-08-07 13:18:11 +02:00
evaluate.py Make flag shortcut consistent and document 2019-04-22 14:23:44 +02:00
info.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
init_model.py Fix init_model if there's no vocab (closes #4048) (#4049) 2019-08-01 17:26:09 +02:00
link.py Small CLI improvements (#3030) 2018-12-08 11:49:43 +01:00
package.py Also support "requirements" in model.json 2019-07-27 13:34:57 +02:00
pretrain.py Merge branch 'master' into feature/nel-wiki 2019-07-09 21:57:47 +02:00
profile.py Fix cytoolz import cytoolz 2018-12-06 16:04:12 +01:00
train.py Call rmtree and copytree with strings (closes #3713) 2019-05-11 15:48:35 +02:00
validate.py Strip out .dev versions in spacy validate [ci skip] 2019-03-17 12:16:53 +01:00