spaCy/spacy
2015-11-18 18:09:46 +01:00
..
data add data dir 2015-11-18 11:48:55 +01:00
de
en change index server 2015-11-18 18:09:46 +01:00
fi
it
munge
serialize * Rename Doc.data to Doc.c 2015-11-04 00:17:13 +11:00
syntax * Adjust NER rules, so that U entries in gazetteer don't become B moves to the model 2015-11-12 04:48:23 +11:00
tests * Add test for matcher end-point problem 2015-11-12 05:00:40 +11:00
tokens * Add .rank property to Token and Lexeme, for frequency rank 2015-11-08 16:18:25 +01:00
__init__.pxd
__init__.py
_nn.py
_nn.pyx
_theano.pxd
_theano.pyx
attrs.pxd
attrs.pyx
cfile.pxd
cfile.pyx
gold.pxd
gold.pyx
language.py * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. 2015-11-07 03:24:30 +11:00
lemmatizer.py Fixing encoding issue #4 2015-10-21 20:45:56 +03:00
lexeme.pxd
lexeme.pyx * Add .rank property to Token and Lexeme, for frequency rank 2015-11-08 16:18:25 +01:00
matcher.pyx * Rename Doc.data to Doc.c 2015-11-04 00:15:14 +11:00
morphology.pxd
morphology.pyx * Fix capitalization in lemmatizer 2015-11-06 05:44:35 +11:00
multi_words.py
orth.pxd
orth.pyx
parts_of_speech.pxd
parts_of_speech.pyx
scorer.py
senses.pxd
senses.pyx
strings.pxd * Use unicode in StringStore.intern, instead of unreliably casting to bytes. 2015-11-05 11:32:19 +00:00
strings.pyx * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. 2015-11-07 03:24:30 +11:00
structs.pxd * Clean up unused Constituent struct. 2015-11-03 23:48:21 +11:00
symbols.pxd
symbols.pyx
tagger.pxd * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. 2015-11-07 03:24:30 +11:00
tagger.pyx * Make constructor of ParserModel and TaggerModel the same as AveragedPerceptron, for each pickling. 2015-11-07 18:25:17 +11:00
tokenizer.pxd Add __reduce__ to Tokenizer so that English pickles. 2015-10-23 22:24:03 -07:00
tokenizer.pyx * Rename Doc.data to Doc.c 2015-11-04 00:15:14 +11:00
typedefs.pxd
typedefs.pyx
util.py
vocab.pxd
vocab.pyx * Merge vocab changes 2015-11-06 00:48:08 +11:00