spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-04-12 13:14:18 +03:00

History

Matthew Honnibal 92b6bd2977 Refinements to retokenize.split() function (#3282 ) * Change retokenize.split() API for heads * Pass lists as values for attrs in split * Fix test_doc_split filename * Add error for mismatched tokens after split * Raise error if new tokens don't match text * Fix doc test * Fix error * Move deps under attrs * Fix split tests * Fix retokenize.split		2019-02-15 17:32:31 +01:00
..
__init__.pxd	* Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx	2015-07-13 20:20:58 +02:00
__init__.py	Tidy up and document Doc, Token and Span	2017-10-27 15:41:45 +02:00
_retokenize.pyx	Refinements to retokenize.split() function (#3282 )	2019-02-15 17:32:31 +01:00
_serialize.py	💫 Replace ujson, msgpack and dill/pickle/cloudpickle with srsly (#3003 )	2018-12-03 01:28:22 +01:00
doc.pxd	Fix issue 2396 (#3089 )	2018-12-29 18:05:52 +01:00
doc.pyx	💫 Replace {Doc,Span}.merge with Doc.retokenize (#3280 )	2019-02-15 10:29:44 +01:00
span.pxd	Add Span.to_array method	2017-08-19 12:20:45 +02:00
span.pyx	💫 Replace {Doc,Span}.merge with Doc.retokenize (#3280 )	2019-02-15 10:29:44 +01:00
token.pxd	Make NORM a token attribute (#3029 )	2018-12-08 10:49:10 +01:00
token.pyx	Raise better error if token is pickled (resolves #2833 ) (#3267 )	2019-02-13 11:27:04 +01:00
underscore.py	💫 Tidy up and auto-format .py files (#2983 )	2018-11-30 17:03:03 +01:00