spaCy/spacy
Wolfgang Seeker 5e2e8e951a add baseclass DocIterator for iterators over documents
add classes for English and German noun chunks

the respective iterators are set for the document when created by the parser
as they depend on the annotation scheme of the parsing model
2016-03-16 15:53:35 +01:00
..
data add data dir 2015-11-18 11:48:55 +01:00
de add baseclass DocIterator for iterators over documents 2016-03-16 15:53:35 +01:00
en add baseclass DocIterator for iterators over documents 2016-03-16 15:53:35 +01:00
fi access model via sputnik 2015-12-07 06:01:28 +01:00
it access model via sputnik 2015-12-07 06:01:28 +01:00
munge * Fix Python3 problem in align_raw 2015-07-28 16:06:53 +02:00
serialize * Whitespace 2016-01-29 03:59:22 +01:00
syntax add baseclass DocIterator for iterators over documents 2016-03-16 15:53:35 +01:00
tests * Add missing __contains__ method to vocab 2016-03-08 15:49:10 +00:00
tokens add baseclass DocIterator for iterators over documents 2016-03-16 15:53:35 +01:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py cleanup api 2016-03-08 12:59:18 +01:00
about.py * Increment version 2016-03-08 15:58:45 +00:00
attrs.pxd introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
attrs.pyx introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
cfile.pxd * Add cfile.pyx 2015-07-23 01:10:36 +02:00
cfile.pyx * Fix CFile for Python2 2015-07-25 22:55:53 +02:00
gold.pxd * Remove unused import 2015-07-25 18:11:16 +02:00
gold.pyx adjust train.py to train both english and german models 2016-03-03 15:21:00 +01:00
language.py introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
lemmatizer.py distinct load() and from_package() methods 2016-01-16 10:00:57 +01:00
lexeme.pxd introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
lexeme.pyx introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
matcher.pyx * Fix Matcher.pipe 2016-02-05 19:46:02 +01:00
morphology.pxd * Ensure Morphology can be pickled, to address Issue #125. 2015-10-13 13:44:41 +11:00
morphology.pyx * Fix imports 2016-01-19 03:36:51 +01:00
multi_words.py * Fix Issue #50: Python 3 compatibility of v0.80 2015-04-13 05:59:43 +02:00
orth.pxd remove text-unidecode dependency 2016-02-24 08:01:59 +01:00
orth.pyx introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
parts_of_speech.pxd * Fix parts_of_speech now that symbols list has been reformed 2015-10-13 13:44:40 +11:00
parts_of_speech.pyx * Fix NAMES list in spacy/parts_of_speech.pyx 2015-10-13 14:18:45 +11:00
scorer.py * Accept punct_labels as an argument to the scorer 2016-02-02 22:59:06 +01:00
strings.pxd * Use unicode in StringStore.intern, instead of unreliably casting to bytes. 2015-11-05 11:32:19 +00:00
strings.pyx * Add missing __contains__ method to vocab 2016-03-08 15:49:10 +00:00
structs.pxd introduce lang field for LexemeC to hold language id 2016-03-10 13:01:34 +01:00
symbols.pxd * Add placeholders for the new flags in attrs and symbols 2016-02-04 15:49:45 +01:00
symbols.pyx * Add placeholders for the new flags in attrs and symbols 2016-02-04 15:49:45 +01:00
tagger.pxd * Move to thinc 5.0 2016-01-29 03:58:55 +01:00
tagger.pyx adjust train.py to train both english and german models 2016-03-03 15:21:00 +01:00
tokenizer.pxd Add __reduce__ to Tokenizer so that English pickles. 2015-10-23 22:24:03 -07:00
tokenizer.pyx * Add pipe() method to tokenizer 2016-02-03 02:32:37 +01:00
typedefs.pxd * Fix type declarations for attr_t. Remove unused id_t. 2015-07-18 22:39:57 +02:00
typedefs.pyx * Move POS tag definitions to parts_of_speech.pxd 2015-01-25 16:31:07 +11:00
util.py cleanup api 2016-03-08 12:59:18 +01:00
vocab.pxd * Start trying to pickle Vocab 2015-10-13 13:44:41 +11:00
vocab.pyx add baseclass DocIterator for iterators over documents 2016-03-16 15:53:35 +01:00