spaCy/spacy/tests/doc
Grivaz 39815513e2 Add split one token into several (resolves #2838) (#3253)
* Add split one token into several (resolves #2838)

* Improve error message for token splitting

* Make retokenizer.split() tests use a Token object

Change retokenizer.split() to use a Token object, instead of an index.

* Pass Token into retokenize.split()

Tweak retokenize.split() API so that we pass the `Token` object, not the index.

* Fix token.idx in retokenize.split()

* Test that token.idx is correct after split

* Fix token.idx for split tokens

* Fix retokenize.split()

* Fix retokenize.split

* Fix retokenize.split() test
2019-02-15 01:27:13 +11:00
..
__init__.py Rename "tokens" tests to "doc" 2017-01-11 18:59:01 +01:00
test_add_entities.py raise error when setting overlapping entities as doc.ents (#2880) 2018-10-26 23:29:16 +02:00
test_array.py Support strings for attribute list in doc.to_array 2017-10-20 11:46:57 +05:30
test_creation.py Fix tests 2017-10-11 13:27:18 +02:00
test_doc_api.py Bugfix/get lca matrix (#3110) 2019-01-06 19:07:50 +01:00
test_doc_spilt.py Add split one token into several (resolves #2838) (#3253) 2019-02-15 01:27:13 +11:00
test_pickle_doc.py Test pickling hooks 2017-10-17 19:43:52 +02:00
test_span_merge.py Introduces a bulk merge function, in order to solve issue #653 (#2696) 2018-09-10 16:41:42 +02:00
test_span.py Bugfix/get lca matrix (#3110) 2019-01-06 19:07:50 +01:00
test_token_api.py Add sent property to token (#2521) 2018-07-06 15:54:15 +02:00