spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-02-06 07:19:45 +03:00

History

Matthew Honnibal b00326a7fe Move pattern_id out of TokenPattern		2018-02-12 12:05:54 +01:00
..
cli	Fix init_model issue	2018-02-03 17:21:34 +03:30
data	Make spacy/data a package	2017-03-18 20:04:22 +01:00
displacy	Don't use deprecated Doc.merge call in displaCy	2018-01-27 11:25:05 +01:00
lang	Merge pull request #1913 from ohenrik/nb_syntax_iterator	2018-02-06 04:59:07 +01:00
syntax	Fix #1929 : Incorrect NER when pre-set sentence boundaries.	2018-02-08 15:25:41 +01:00
tests	Make test for #1945 more precise	2018-02-07 02:06:11 +01:00
tokens	fix sent_start in serialization	2018-01-28 19:50:42 +01:00
__init__.pxd	* Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags.	2014-10-24 02:23:42 +11:00
__init__.py	Remove dummy variable from function calls	2018-01-05 09:37:05 +01:00
__main__.py	Don't pass CLI command name as dummy argument	2018-01-04 21:33:47 +01:00
_matcher2_notes.py	Add Python notes for rethinking matcher	2018-02-12 10:19:29 +01:00
_ml.py	Create a preprocess function that gets bigrams	2017-11-12 00:43:41 +01:00
about.py	Increment version to 2.0.7	2018-02-02 03:39:16 +01:00
attrs.pxd	Fix cpdef enum in attrs.pyx	2017-09-17 12:28:53 -05:00
attrs.pyx	Added tag map, fixed tests fails, added more exceptions	2017-11-26 20:54:48 +03:00
compat.py	Add noqa to Python 2 compat variables of built-ins (see #1617 )	2017-11-20 14:03:42 +01:00
glossary.py	Update NER annotation scheme	2017-10-30 13:53:49 +01:00
gold.pxd	Add support for sent_start to GoldParse	2017-08-25 20:03:14 -05:00
gold.pyx	Add offsets_from_biluo_tags helper and tests (see #1626 )	2017-11-26 16:38:01 +01:00
language.py	make to sure pass in **cfg to each component when training	2018-01-30 18:29:54 -08:00
lemmatizer.py	If no rules are set, lemmatize by lookup	2017-12-06 12:12:11 +01:00
lexeme.pxd	WIP on stringstore change. 27 failures	2017-05-28 14:06:40 +02:00
lexeme.pyx	Make .similarity() return 1.0 if all orth attrs match	2018-01-15 16:29:48 +01:00
matcher.pyx	Merge pull request #1876 from GregDubbin/master	2018-01-24 16:38:11 +01:00
matcher2.pyx	Move pattern_id out of TokenPattern	2018-02-12 12:05:54 +01:00
morphology.pxd	Remove cpdef enum, to avoid too much code generation	2017-10-20 13:59:57 +02:00
morphology.pyx	Fix non-clobbering lemmatization	2017-11-06 12:36:05 +01:00
parts_of_speech.pxd	Add support for Universal Dependencies v2.0	2017-03-03 13:17:34 +01:00
parts_of_speech.pyx	Tidy up rest	2017-10-27 21:07:59 +02:00
pipeline.pxd	Fix names of pipeline components	2017-10-26 12:38:23 +02:00
pipeline.pyx	Further model deserialization fixes re #1727	2018-01-23 19:16:05 +01:00
scorer.py	Tidy up rest	2017-10-27 21:07:59 +02:00
strings.pxd	Try to fix StringStore clean up (see #1506 )	2017-11-11 03:11:27 +03:00
strings.pyx	Use safer method to get string without hit	2017-11-14 22:58:46 +03:00
structs.pxd	Make TokenC.sent_tart an int, to allow ternary value	2017-10-08 19:58:54 +02:00
symbols.pxd	Update symbols and document missing token attributes (see #1439 )	2017-10-20 13:08:44 +02:00
symbols.pyx	Add PRON_LEMMA to spacy.symbols	2017-11-06 17:38:25 +01:00
tokenizer.pxd	Disable tokenizer cache for special-cases. Fixes #1250	2017-10-24 16:08:05 +02:00
tokenizer.pyx	Merge pull request #1611 from fsonntag/master	2017-11-29 23:11:23 +01:00
typedefs.pxd	Work on changing StringStore to return hashes.	2017-05-28 12:36:27 +02:00
typedefs.pyx	Tidy up rest	2017-10-27 21:07:59 +02:00
util.py	Add missing import (fixes #1546 )	2017-11-10 19:05:18 +01:00
vectors.pyx	Allow vector loading to work on 1d data files. Fixes #1831	2018-01-22 19:18:26 +01:00
vocab.pxd	Add Vocab.cfg attr, to hold stuff like oov probs	2017-10-30 16:08:50 +01:00
vocab.pyx	Make Vocab.__contains__ work with ints. Fixes #1868	2018-01-23 23:26:47 +01:00