spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-01-10 10:41:14 +03:00

History

Matthew Honnibal 38ca0c33f5 Merge branch 'neuralnet' into refactor Mostly refactors parser, to use new thinc3.2 Example class. Aim is to remove use of shared memory, so that we can parallelize over documents easily. Conflicts: setup.py spacy/syntax/parser.pxd spacy/syntax/parser.pyx spacy/syntax/stateclass.pyx		2015-07-14 14:13:47 +02:00
..
en	* Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx	2015-07-13 20:20:58 +02:00
munge	* Fix head alignment in read_conll.parse, which was causing corrupt parses when strip_bad_periods=True. A similar problem may apply to other data readers.	2015-06-18 16:35:27 +02:00
ner	Remove trailing whitespace	2015-04-19 13:01:38 -07:00
syntax	Merge branch 'neuralnet' into refactor	2015-07-14 14:13:47 +02:00
tokens	* Extend count_by method	2015-07-14 03:20:09 +02:00
__init__.pxd	* Seems to be working after refactor. Need to wire up more POS tag features, and wire up save/load of POS tags.	2014-10-24 02:23:42 +11:00
__init__.py	* Basic punct tests updated and passing	2014-08-27 19:38:57 +02:00
_bu_nn.pyx	* Merge changes, and adjust Example to use memoryview	2015-06-28 11:36:11 +02:00
_ml.pxd	Merge branch 'neuralnet' into refactor	2015-07-14 14:13:47 +02:00
_ml.pyx	* Use new Example class	2015-06-28 22:36:03 +02:00
_nn.py	* Merge changes, and adjust Example to use memoryview	2015-06-28 11:36:11 +02:00
_nn.pyx	* Merge changes, and adjust Example to use memoryview	2015-06-28 11:36:11 +02:00
_theano.pxd	* Merge changes, and adjust Example to use memoryview	2015-06-28 11:36:11 +02:00
_theano.pyx	* Begin reorganizing neuralnet work	2015-06-30 14:26:32 +02:00
attrs.pxd	Remove trailing whitespace	2015-04-19 13:01:38 -07:00
gold.pxd	* Have oracle functions take a struct instead of a Python object	2015-06-02 20:01:06 +02:00
gold.pyx	* Fix space check in gold.pyx	2015-07-14 00:10:27 +02:00
lexeme.pxd	* Remove has_sense method from Lexeme declaration	2015-07-08 19:41:20 +02:00
lexeme.pyx	* Remove has_sense method from Lexeme	2015-07-08 19:28:29 +02:00
morphology.pxd	* Tmp commit. Refactoring to create a Python Lexeme class.	2015-01-12 10:26:22 +11:00
morphology.pyx	* Make PyPy work	2015-01-05 17:54:38 +11:00
multi_words.py	* Fix Issue #50 : Python 3 compatibility of v0.80	2015-04-13 05:59:43 +02:00
orth.pxd	* Make PyPy work	2015-01-05 17:54:38 +11:00
orth.pyx	* Work on word vectors, and other stuff	2015-01-17 16:21:17 +11:00
parts_of_speech.pxd	* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity	2015-07-09 13:30:41 +02:00
parts_of_speech.pyx	* Add SPACE part-of-speech tag, and train tagger to assign it. Also train tagger not to make whitespace an entity	2015-07-09 13:30:41 +02:00
scorer.py	* Start scoring tokens	2015-06-28 06:21:38 +02:00
senses.pxd	* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.	2015-07-01 20:12:13 +02:00
senses.pyx	* Set up an array POS_SENSES that denotes the set of valid senses for each POS tag. This way, we can do bitwise & between a lexeme's senses and the ones available for its POS tag, to get the allowable senses for the token.	2015-07-01 20:12:13 +02:00
serialize.pxd	* Make .pxd file for huffman codec	2015-07-13 13:54:51 +02:00
serialize.pyx	* Round-trip for serialization finally working. Needs a lot of optimization.	2015-07-13 18:39:38 +02:00
strings.pxd	* Tmp commit. Refactoring to create a Python Lexeme class.	2015-01-12 10:26:22 +11:00
strings.pyx	* Add __len__ function to StringStore	2015-06-23 00:02:50 +02:00
structs.pxd	* Add TokenC.spacy attr	2015-07-13 19:48:07 +02:00
tokenizer.pxd	* Refactor tokenizer, to set the 'spacy' field on TokenC instead of passing a string	2015-07-13 21:46:02 +02:00
tokenizer.pyx	* Fix tokenizer	2015-07-14 00:10:51 +02:00
typedefs.pxd	Remove trailing whitespace	2015-04-19 13:01:38 -07:00
typedefs.pyx	* Move POS tag definitions to parts_of_speech.pxd	2015-01-25 16:31:07 +11:00
util.py	Remove trailing whitespace	2015-04-19 13:01:38 -07:00
vocab.pxd	* Add codec property to Vocab, to use the Huffman encoding	2015-07-13 13:55:14 +02:00
vocab.pyx	* Add codec property to Vocab, to use the Huffman encoding	2015-07-13 13:55:14 +02:00