spaCy/spacy
Matthew Honnibal a5ebfb20f5 Serialize all attrs by default
Move converters under spacy.gold

Move things around

Fix naming

Fix name

Update converter to produce DocBin

Update converters

Make spacy convert output docbin

Fix import

Fix docbin

Fix import

Update converter

Remove jsonl converter

Add json2docs converter
2020-06-22 00:46:08 +02:00
..
cli Serialize all attrs by default 2020-06-22 00:46:08 +02:00
displacy unicode -> str consistency [ci skip] 2020-05-24 18:51:10 +02:00
gold Serialize all attrs by default 2020-06-22 00:46:08 +02:00
lang Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
matcher Merge branch 'develop' into master-tmp 2020-06-03 14:36:59 +02:00
ml train is from-config by default (#5575) 2020-06-12 02:02:07 +02:00
pipeline fix renames and simple_ner labels 2020-06-18 15:33:28 +02:00
syntax clean up 2020-06-22 00:46:08 +02:00
tests fix test checking for variants 2020-06-22 00:46:08 +02:00
tokens Serialize all attrs by default 2020-06-22 00:46:08 +02:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py Simplify warnings 2020-04-28 13:37:37 +02:00
__main__.py add discard_oversize parameter, move optimizer to training subsection 2020-06-03 10:04:16 +02:00
about.py Set version to v3.0.0.dev9 2020-05-21 20:47:52 +02:00
attrs.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
attrs.pyx Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
compat.py Merge branch 'develop' into refactor/remove-symlinks 2020-02-18 17:22:20 +01:00
errors.py throw informative error when running the components with the wrong type of objects 2020-06-18 10:36:05 +02:00
glossary.py unicode -> str consistency 2020-05-24 17:20:58 +02:00
kb.pxd Tidy up compiler flags and imports (#5071) 2020-03-02 11:48:10 +01:00
kb.pyx unicode -> str consistency 2020-05-24 17:20:58 +02:00
language.py clean up 2020-06-22 00:46:08 +02:00
lemmatizer.py Merge branch 'develop' into master-tmp 2020-06-03 14:36:59 +02:00
lexeme.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
lexeme.pyx Merge branch 'develop' into master-tmp 2020-06-03 14:36:59 +02:00
lookups.py Merge branch 'develop' into master-tmp 2020-06-03 14:36:59 +02:00
morphology.pxd Tidy up compiler flags and imports (#5071) 2020-03-02 11:48:10 +01:00
morphology.pyx Fix typo 2020-06-03 14:42:39 +02:00
parts_of_speech.pxd Add support for Universal Dependencies v2.0 2017-03-03 13:17:34 +01:00
parts_of_speech.pyx Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
pipe_analysis.py unicode -> str consistency 2020-05-24 17:20:58 +02:00
schemas.py Fix test and schemas 2020-05-21 19:01:02 +02:00
scorer.py fixing language and scoring tests 2020-06-15 15:02:05 +02:00
strings.pxd Tidy up compiler flags and imports (#5071) 2020-03-02 11:48:10 +01:00
strings.pyx unicode -> str consistency [ci skip] 2020-05-24 18:51:10 +02:00
structs.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
symbols.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
symbols.pyx Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
tokenizer.pxd Tidy up compiler flags and imports (#5071) 2020-03-02 11:48:10 +01:00
tokenizer.pyx various fixes in scripts - needs to be further tested 2020-06-17 12:05:58 +02:00
typedefs.pxd Update spaCy for thinc 8.0.0 (#4920) 2020-01-29 17:06:46 +01:00
typedefs.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
util.py minibatch utiltiy can deal with strings, docs or examples 2020-06-16 21:35:55 +02:00
vectors.pyx Merge branch 'develop' into master-tmp 2020-06-03 14:36:59 +02:00
vocab.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
vocab.pyx unicode -> str consistency [ci skip] 2020-05-24 18:51:10 +02:00