| .. |
|
data
|
add data dir
|
2015-11-18 11:48:55 +01:00 |
|
de
|
add baseclass DocIterator for iterators over documents
|
2016-03-16 15:53:35 +01:00 |
|
en
|
add baseclass DocIterator for iterators over documents
|
2016-03-16 15:53:35 +01:00 |
|
fi
|
access model via sputnik
|
2015-12-07 06:01:28 +01:00 |
|
it
|
access model via sputnik
|
2015-12-07 06:01:28 +01:00 |
|
munge
|
* Fix Python3 problem in align_raw
|
2015-07-28 16:06:53 +02:00 |
|
serialize
|
* Whitespace
|
2016-01-29 03:59:22 +01:00 |
|
syntax
|
add baseclass DocIterator for iterators over documents
|
2016-03-16 15:53:35 +01:00 |
|
tests
|
* Add missing __contains__ method to vocab
|
2016-03-08 15:49:10 +00:00 |
|
tokens
|
make error messages language independent
|
2016-03-24 11:47:09 +01:00 |
|
__init__.pxd
|
* Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags.
|
2014-10-24 02:23:42 +11:00 |
|
__init__.py
|
cleanup api
|
2016-03-08 12:59:18 +01:00 |
|
about.py
|
* Increment version
|
2016-03-08 15:58:45 +00:00 |
|
attrs.pxd
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
attrs.pyx
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
cfile.pxd
|
* Add cfile.pyx
|
2015-07-23 01:10:36 +02:00 |
|
cfile.pyx
|
* Fix CFile for Python2
|
2015-07-25 22:55:53 +02:00 |
|
gold.pxd
|
* Remove unused import
|
2015-07-25 18:11:16 +02:00 |
|
gold.pyx
|
adjust train.py to train both english and german models
|
2016-03-03 15:21:00 +01:00 |
|
language.py
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
lemmatizer.py
|
distinct load() and from_package() methods
|
2016-01-16 10:00:57 +01:00 |
|
lexeme.pxd
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
lexeme.pyx
|
make error messages language independent
|
2016-03-24 11:47:09 +01:00 |
|
matcher.pyx
|
* Fix Matcher.pipe
|
2016-02-05 19:46:02 +01:00 |
|
morphology.pxd
|
* Ensure Morphology can be pickled, to address Issue #125.
|
2015-10-13 13:44:41 +11:00 |
|
morphology.pyx
|
* Fix imports
|
2016-01-19 03:36:51 +01:00 |
|
multi_words.py
|
* Fix Issue #50: Python 3 compatibility of v0.80
|
2015-04-13 05:59:43 +02:00 |
|
orth.pxd
|
remove text-unidecode dependency
|
2016-02-24 08:01:59 +01:00 |
|
orth.pyx
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
parts_of_speech.pxd
|
* Fix parts_of_speech now that symbols list has been reformed
|
2015-10-13 13:44:40 +11:00 |
|
parts_of_speech.pyx
|
* Fix NAMES list in spacy/parts_of_speech.pyx
|
2015-10-13 14:18:45 +11:00 |
|
scorer.py
|
* Accept punct_labels as an argument to the scorer
|
2016-02-02 22:59:06 +01:00 |
|
strings.pxd
|
* Use unicode in StringStore.intern, instead of unreliably casting to bytes.
|
2015-11-05 11:32:19 +00:00 |
|
strings.pyx
|
* Add missing __contains__ method to vocab
|
2016-03-08 15:49:10 +00:00 |
|
structs.pxd
|
introduce lang field for LexemeC to hold language id
|
2016-03-10 13:01:34 +01:00 |
|
symbols.pxd
|
* Add placeholders for the new flags in attrs and symbols
|
2016-02-04 15:49:45 +01:00 |
|
symbols.pyx
|
* Add placeholders for the new flags in attrs and symbols
|
2016-02-04 15:49:45 +01:00 |
|
tagger.pxd
|
* Move to thinc 5.0
|
2016-01-29 03:58:55 +01:00 |
|
tagger.pyx
|
adjust train.py to train both english and german models
|
2016-03-03 15:21:00 +01:00 |
|
tokenizer.pxd
|
Add __reduce__ to Tokenizer so that English pickles.
|
2015-10-23 22:24:03 -07:00 |
|
tokenizer.pyx
|
* Add pipe() method to tokenizer
|
2016-02-03 02:32:37 +01:00 |
|
typedefs.pxd
|
* Fix type declarations for attr_t. Remove unused id_t.
|
2015-07-18 22:39:57 +02:00 |
|
typedefs.pyx
|
* Move POS tag definitions to parts_of_speech.pxd
|
2015-01-25 16:31:07 +11:00 |
|
util.py
|
cleanup api
|
2016-03-08 12:59:18 +01:00 |
|
vocab.pxd
|
* Start trying to pickle Vocab
|
2015-10-13 13:44:41 +11:00 |
|
vocab.pyx
|
add baseclass DocIterator for iterators over documents
|
2016-03-16 15:53:35 +01:00 |