spaCy/spacy
2015-10-09 13:26:17 +02:00
..
de * Add rule to ensure ordinals are preserved as single tokens 2015-09-22 12:26:05 +10:00
en * Increment data version 2015-10-09 13:26:17 +02:00
fi * More work on language-generic parsing 2015-08-28 02:02:33 +02:00
it * Delete extra wordnets 2015-09-13 10:31:37 +10:00
munge * Fix Python3 problem in align_raw 2015-07-28 16:06:53 +02:00
serialize
syntax * Fix L/R edge bug, by ensuring l_edge and r_edge are preset, and fixing the way the edge update in del_arc. Bugs keep arising here because the edges are absolute positions, where everything else is relative. I'm also not 100% convinced that del_arc is handled correctly. Do we need to update the parents? 2015-09-09 03:40:44 +02:00
tokens * Rename _seq to doc attribute in Span 2015-09-29 23:03:55 +10:00
__init__.pxd
__init__.py
_ml.pxd
_ml.pyx * Tagger training now working. Still need to test load/save of model. Morphology still broken. 2015-08-27 09:16:11 +02:00
_nn.py
_nn.pyx
_theano.pxd
_theano.pyx
attrs.pxd * Add PROB attribute in attrs.pxd 2015-08-26 19:14:19 +02:00
attrs.pyx
cfile.pxd
cfile.pyx
gold.pxd
gold.pyx
language.py * Fix Issue #116: Misleading handling of True value in Language.__init__. 2015-09-29 20:54:12 +10:00
lemmatizer.py * Add support for punctuation lemmatization, to handle unicode characters. This should help in addressing Issue #130 2015-10-09 18:54:40 +11:00
lexeme.pxd * Fix ugly py_check_flag and py_set_flag functions in Lexeme 2015-09-15 13:06:18 +10:00
lexeme.pyx * Fix vectors bugs for OOV words 2015-09-22 02:10:25 +02:00
matcher.pyx * Fix phrase matcher 2015-10-09 02:00:45 +11:00
morphology.pxd * More work on language independent parsing 2015-08-28 03:44:54 +02:00
morphology.pyx * Allow punctuation to be lemmatized 2015-10-09 19:02:42 +11:00
multi_words.py
orth.pxd
orth.pyx * Fix type declaration in asciied function 2015-10-09 13:46:57 +11:00
parts_of_speech.pxd * Tagger training now working. Still need to test load/save of model. Morphology still broken. 2015-08-27 09:16:11 +02:00
parts_of_speech.pyx * Tagger training now working. Still need to test load/save of model. Morphology still broken. 2015-08-27 09:16:11 +02:00
scorer.py
senses.pxd * Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token. 2015-07-01 20:12:13 +02:00
senses.pyx
strings.pxd
strings.pyx * Work on language-independent refactoring 2015-08-23 20:49:18 +02:00
structs.pxd * Remove const qualifier on LexemeC.repvec 2015-09-15 14:42:51 +10:00
tagger.pxd * Tagger training now working. Still need to test load/save of model. Morphology still broken. 2015-08-27 09:16:11 +02:00
tagger.pyx * More work on language independent parsing 2015-08-28 03:44:54 +02:00
tokenizer.pxd * More work on language-generic parsing 2015-08-28 02:02:33 +02:00
tokenizer.pyx * More work on language-generic parsing 2015-08-28 02:02:33 +02:00
typedefs.pxd
typedefs.pyx
util.py
vocab.pxd * Rename vectors_length attribute 2015-09-15 14:43:31 +10:00
vocab.pyx * Add LookupError for better error reporting in Vocab 2015-10-06 10:34:59 +11:00