| .. |
|
de
|
* Add rule to ensure ordinals are preserved as single tokens
|
2015-09-22 12:26:05 +10:00 |
|
en
|
* Increment data version
|
2015-10-09 13:26:17 +02:00 |
|
fi
|
* More work on language-generic parsing
|
2015-08-28 02:02:33 +02:00 |
|
it
|
* Delete extra wordnets
|
2015-09-13 10:31:37 +10:00 |
|
munge
|
* Fix Python3 problem in align_raw
|
2015-07-28 16:06:53 +02:00 |
|
serialize
|
|
|
|
syntax
|
* Fix L/R edge bug, by ensuring l_edge and r_edge are preset, and fixing the way the edge update in del_arc. Bugs keep arising here because the edges are absolute positions, where everything else is relative. I'm also not 100% convinced that del_arc is handled correctly. Do we need to update the parents?
|
2015-09-09 03:40:44 +02:00 |
|
tokens
|
* Rename _seq to doc attribute in Span
|
2015-09-29 23:03:55 +10:00 |
|
__init__.pxd
|
|
|
|
__init__.py
|
|
|
|
_ml.pxd
|
|
|
|
_ml.pyx
|
* Tagger training now working. Still need to test load/save of model. Morphology still broken.
|
2015-08-27 09:16:11 +02:00 |
|
_nn.py
|
|
|
|
_nn.pyx
|
|
|
|
_theano.pxd
|
|
|
|
_theano.pyx
|
|
|
|
attrs.pxd
|
* Add PROB attribute in attrs.pxd
|
2015-08-26 19:14:19 +02:00 |
|
attrs.pyx
|
|
|
|
cfile.pxd
|
|
|
|
cfile.pyx
|
|
|
|
gold.pxd
|
|
|
|
gold.pyx
|
|
|
|
language.py
|
* Fix Issue #116: Misleading handling of True value in Language.__init__.
|
2015-09-29 20:54:12 +10:00 |
|
lemmatizer.py
|
* Add support for punctuation lemmatization, to handle unicode characters. This should help in addressing Issue #130
|
2015-10-09 18:54:40 +11:00 |
|
lexeme.pxd
|
* Fix ugly py_check_flag and py_set_flag functions in Lexeme
|
2015-09-15 13:06:18 +10:00 |
|
lexeme.pyx
|
* Fix vectors bugs for OOV words
|
2015-09-22 02:10:25 +02:00 |
|
matcher.pyx
|
* Fix phrase matcher
|
2015-10-09 02:00:45 +11:00 |
|
morphology.pxd
|
* More work on language independent parsing
|
2015-08-28 03:44:54 +02:00 |
|
morphology.pyx
|
* Allow punctuation to be lemmatized
|
2015-10-09 19:02:42 +11:00 |
|
multi_words.py
|
|
|
|
orth.pxd
|
|
|
|
orth.pyx
|
* Fix type declaration in asciied function
|
2015-10-09 13:46:57 +11:00 |
|
parts_of_speech.pxd
|
* Tagger training now working. Still need to test load/save of model. Morphology still broken.
|
2015-08-27 09:16:11 +02:00 |
|
parts_of_speech.pyx
|
* Tagger training now working. Still need to test load/save of model. Morphology still broken.
|
2015-08-27 09:16:11 +02:00 |
|
scorer.py
|
|
|
|
senses.pxd
|
* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.
|
2015-07-01 20:12:13 +02:00 |
|
senses.pyx
|
|
|
|
strings.pxd
|
|
|
|
strings.pyx
|
* Work on language-independent refactoring
|
2015-08-23 20:49:18 +02:00 |
|
structs.pxd
|
* Remove const qualifier on LexemeC.repvec
|
2015-09-15 14:42:51 +10:00 |
|
tagger.pxd
|
* Tagger training now working. Still need to test load/save of model. Morphology still broken.
|
2015-08-27 09:16:11 +02:00 |
|
tagger.pyx
|
* More work on language independent parsing
|
2015-08-28 03:44:54 +02:00 |
|
tokenizer.pxd
|
* More work on language-generic parsing
|
2015-08-28 02:02:33 +02:00 |
|
tokenizer.pyx
|
* More work on language-generic parsing
|
2015-08-28 02:02:33 +02:00 |
|
typedefs.pxd
|
|
|
|
typedefs.pyx
|
|
|
|
util.py
|
|
|
|
vocab.pxd
|
* Rename vectors_length attribute
|
2015-09-15 14:43:31 +10:00 |
|
vocab.pyx
|
* Add LookupError for better error reporting in Vocab
|
2015-10-06 10:34:59 +11:00 |