..
cli
Fix ud-train script
2018-03-11 01:26:45 +01:00
data
Make spacy/data a package
2017-03-18 20:04:22 +01:00
displacy
Don't use deprecated Doc.merge call in displaCy
2018-01-27 11:25:05 +01:00
lang
Merge pull request #2012 from alldefector/patch-1
2018-03-11 01:05:19 +01:00
syntax
Fix dropout bug in beam parser
2018-03-10 23:16:40 +01:00
tests
Add built-in factories for merge_entities and merge_noun_chunks
2018-03-15 00:18:51 +01:00
tokens
Fix array out of bounds error in Span
2018-02-28 12:27:09 +01:00
__init__.pxd
* Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags.
2014-10-24 02:23:42 +11:00
__init__.py
Remove dummy variable from function calls
2018-01-05 09:37:05 +01:00
__main__.py
Link in spaCy CoNLL commands
2018-03-10 23:42:15 +01:00
_align.pyx
Fix many-to-one alignment
2018-02-24 16:03:50 +01:00
_matcher2_notes.py
Update notes on matcher2
2018-02-13 11:45:45 +01:00
_ml.py
Create a preprocess function that gets bigrams
2017-11-12 00:43:41 +01:00
about.py
Increment dev version
2018-03-11 01:58:21 +01:00
attrs.pxd
Fix LANG symbol
2018-02-17 18:10:50 +01:00
attrs.pyx
missing PrepCase attribute
2018-02-18 14:46:12 +00:00
compat.py
Drop six and related hacks as a dependency
2018-02-18 13:29:56 +01:00
glossary.py
Fix typo in glossary ( resolves #1964 )
2018-02-10 11:58:41 +01:00
gold.pxd
Add support for sent_start to GoldParse
2017-08-25 20:03:14 -05:00
gold.pyx
Stream the gold data during training, to reduce memory
2018-03-10 22:32:32 +01:00
language.py
Add built-in factories for merge_entities and merge_noun_chunks
2018-03-15 00:18:51 +01:00
lemmatizer.py
Don't lower-case lemmas of proper nouns
2018-02-21 16:01:16 +01:00
lexeme.pxd
WIP on stringstore change. 27 failures
2017-05-28 14:06:40 +02:00
lexeme.pyx
added new lexical feat to lexeme
2018-02-11 18:51:48 +01:00
matcher.pyx
Move cython declarations in matcher.pyx
2018-02-24 10:32:18 +01:00
morphology.pxd
fix typo/missing here too
2018-02-18 14:38:27 +00:00
morphology.pyx
fix typo/missing here too
2018-02-18 14:38:27 +00:00
parts_of_speech.pxd
Add support for Universal Dependencies v2.0
2017-03-03 13:17:34 +01:00
parts_of_speech.pyx
Tidy up rest
2017-10-27 21:07:59 +02:00
pipeline.pxd
Fix names of pipeline components
2017-10-26 12:38:23 +02:00
pipeline.pyx
Add built-in factories for merge_entities and merge_noun_chunks
2018-03-15 00:18:51 +01:00
scorer.py
Fix scoring of tokenization for punct
2018-02-24 10:32:32 +01:00
strings.pxd
Try to fix StringStore clean up (see #1506 )
2017-11-11 03:11:27 +03:00
strings.pyx
Use safer method to get string without hit
2017-11-14 22:58:46 +03:00
structs.pxd
Make TokenC.sent_tart an int, to allow ternary value
2017-10-08 19:58:54 +02:00
symbols.pxd
Fix inconsistencies in the symbols table
2018-02-18 13:51:31 +01:00
symbols.pyx
Fix inconsistencies in the symbols table
2018-02-18 13:51:31 +01:00
tokenizer.pxd
Disable tokenizer cache for special-cases. Fixes #1250
2017-10-24 16:08:05 +02:00
tokenizer.pyx
Merge pull request #1611 from fsonntag/master
2017-11-29 23:11:23 +01:00
typedefs.pxd
Work on changing StringStore to return hashes.
2017-05-28 12:36:27 +02:00
typedefs.pyx
Tidy up rest
2017-10-27 21:07:59 +02:00
util.py
Fix itershuffle
2018-03-10 22:32:59 +01:00
vectors.pyx
Fix Vectors pickling
2018-03-10 22:53:42 +01:00
vocab.pxd
Add Vocab.cfg attr, to hold stuff like oov probs
2017-10-30 16:08:50 +01:00
vocab.pyx
Make Vocab.__contains__ work with ints. Fixes #1868
2018-01-23 23:26:47 +01:00