spaCy/spacy
2018-02-12 12:05:54 +01:00
..
cli Fix init_model issue 2018-02-03 17:21:34 +03:30
data Make spacy/data a package 2017-03-18 20:04:22 +01:00
displacy Don't use deprecated Doc.merge call in displaCy 2018-01-27 11:25:05 +01:00
lang Merge pull request #1913 from ohenrik/nb_syntax_iterator 2018-02-06 04:59:07 +01:00
syntax Fix #1929: Incorrect NER when pre-set sentence boundaries. 2018-02-08 15:25:41 +01:00
tests Make test for #1945 more precise 2018-02-07 02:06:11 +01:00
tokens fix sent_start in serialization 2018-01-28 19:50:42 +01:00
__init__.pxd * Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags. 2014-10-24 02:23:42 +11:00
__init__.py Remove dummy variable from function calls 2018-01-05 09:37:05 +01:00
__main__.py Don't pass CLI command name as dummy argument 2018-01-04 21:33:47 +01:00
_matcher2_notes.py Add Python notes for rethinking matcher 2018-02-12 10:19:29 +01:00
_ml.py Create a preprocess function that gets bigrams 2017-11-12 00:43:41 +01:00
about.py Increment version to 2.0.7 2018-02-02 03:39:16 +01:00
attrs.pxd Fix cpdef enum in attrs.pyx 2017-09-17 12:28:53 -05:00
attrs.pyx Added tag map, fixed tests fails, added more exceptions 2017-11-26 20:54:48 +03:00
compat.py Add noqa to Python 2 compat variables of built-ins (see #1617) 2017-11-20 14:03:42 +01:00
glossary.py Update NER annotation scheme 2017-10-30 13:53:49 +01:00
gold.pxd Add support for sent_start to GoldParse 2017-08-25 20:03:14 -05:00
gold.pyx Add offsets_from_biluo_tags helper and tests (see #1626) 2017-11-26 16:38:01 +01:00
language.py make to sure pass in **cfg to each component when training 2018-01-30 18:29:54 -08:00
lemmatizer.py If no rules are set, lemmatize by lookup 2017-12-06 12:12:11 +01:00
lexeme.pxd WIP on stringstore change. 27 failures 2017-05-28 14:06:40 +02:00
lexeme.pyx Make .similarity() return 1.0 if all orth attrs match 2018-01-15 16:29:48 +01:00
matcher.pyx Merge pull request #1876 from GregDubbin/master 2018-01-24 16:38:11 +01:00
matcher2.pyx Move pattern_id out of TokenPattern 2018-02-12 12:05:54 +01:00
morphology.pxd Remove cpdef enum, to avoid too much code generation 2017-10-20 13:59:57 +02:00
morphology.pyx Fix non-clobbering lemmatization 2017-11-06 12:36:05 +01:00
parts_of_speech.pxd Add support for Universal Dependencies v2.0 2017-03-03 13:17:34 +01:00
parts_of_speech.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
pipeline.pxd Fix names of pipeline components 2017-10-26 12:38:23 +02:00
pipeline.pyx Further model deserialization fixes re #1727 2018-01-23 19:16:05 +01:00
scorer.py Tidy up rest 2017-10-27 21:07:59 +02:00
strings.pxd Try to fix StringStore clean up (see #1506) 2017-11-11 03:11:27 +03:00
strings.pyx Use safer method to get string without hit 2017-11-14 22:58:46 +03:00
structs.pxd Make TokenC.sent_tart an int, to allow ternary value 2017-10-08 19:58:54 +02:00
symbols.pxd Update symbols and document missing token attributes (see #1439) 2017-10-20 13:08:44 +02:00
symbols.pyx Add PRON_LEMMA to spacy.symbols 2017-11-06 17:38:25 +01:00
tokenizer.pxd Disable tokenizer cache for special-cases. Fixes #1250 2017-10-24 16:08:05 +02:00
tokenizer.pyx Merge pull request #1611 from fsonntag/master 2017-11-29 23:11:23 +01:00
typedefs.pxd Work on changing StringStore to return hashes. 2017-05-28 12:36:27 +02:00
typedefs.pyx Tidy up rest 2017-10-27 21:07:59 +02:00
util.py Add missing import (fixes #1546) 2017-11-10 19:05:18 +01:00
vectors.pyx Allow vector loading to work on 1d data files. Fixes #1831 2018-01-22 19:18:26 +01:00
vocab.pxd Add Vocab.cfg attr, to hold stuff like oov probs 2017-10-30 16:08:50 +01:00
vocab.pyx Make Vocab.__contains__ work with ints. Fixes #1868 2018-01-23 23:26:47 +01:00