spaCy/spacy/cli
adrianeboyd b71a11ff6d
Update morphologizer (#5108)
* Add pos and morph scoring to Scorer

Add pos, morph, and morph_per_type to `Scorer`. Report pos and morph
accuracy in `spacy evaluate`.

* Update morphologizer for v3

* switch to tagger-based morphologizer
* use `spacy.HashCharEmbedCNN` for morphologizer defaults
* add `Doc.is_morphed` flag

* Add morphologizer to train CLI

* Add basic morphologizer pipeline tests

* Add simple morphologizer training example

* Remove subword_features from CharEmbed models

Remove `subword_features` argument from `spacy.HashCharEmbedCNN.v1` and
`spacy.HashCharEmbedBiLSTM.v1` since in these cases `subword_features`
is always `False`.

* Rename setting in morphologizer example

Use `with_pos_tags` instead of `without_pos_tags`.

* Fix kwargs for spacy.HashCharEmbedBiLSTM.v1

* Remove defaults for spacy.HashCharEmbedBiLSTM.v1

Remove default `nM/nC` for `spacy.HashCharEmbedBiLSTM.v1`.

* Set random seed for textcat overfitting test
2020-04-02 14:46:32 +02:00
..
converters Tidy up and fix issues 2020-02-18 15:17:03 +01:00
__init__.py Remove symlinks, data dir and related stuff 2020-02-18 17:20:17 +01:00
convert.py Add convert CLI option to merge CoNLL-U subtokens (#4722) 2020-01-29 17:44:25 +01:00
debug_data.py Fix formatting and update docs for v2.2.4 2020-03-09 11:17:20 +01:00
download.py Remove symlinks, data dir and related stuff 2020-02-18 17:20:17 +01:00
evaluate.py Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00
info.py Remove symlinks, data dir and related stuff 2020-02-18 17:20:17 +01:00
init_model.py Simplify warnings 2020-02-28 12:20:23 +01:00
package.py Modernize plac commands for Python 3 (#4836) 2020-01-01 13:15:46 +01:00
pretrain.py Tidy up and auto-format 2020-02-28 11:57:41 +01:00
profile.py Update spaCy for thinc 8.0.0 (#4920) 2020-01-29 17:06:46 +01:00
train_from_config.py Tidy up and auto-format 2020-02-28 11:57:41 +01:00
train.py Update morphologizer (#5108) 2020-04-02 14:46:32 +02:00
validate.py Remove symlinks, data dir and related stuff 2020-02-18 17:20:17 +01:00