spaCy/spacy/tokens
Matthew Honnibal e5ad35787c WIP on adding split-token actions to parser
This patch starts getting the StateC object ready to split tokens. The
split function is implemented by pushing indices into the buffer that
indicate an out-of-length token.

Still todo:

* Update the oracles
* Update GoldParseC
* Interpret the parse once it's complete
* Add retokenizer.split() method
2018-03-31 20:05:27 +02:00
..
__init__.pxd * Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx 2015-07-13 20:20:58 +02:00
__init__.py Tidy up and document Doc, Token and Span 2017-10-27 15:41:45 +02:00
_retokenize.pyx Add notes for adding retokenize.split() 2018-03-31 19:32:37 +02:00
doc.pxd Export set_children_from_heads C function from doc.pxd 2018-03-31 15:17:23 +02:00
doc.pyx WIP on adding split-token actions to parser 2018-03-31 20:05:27 +02:00
printers.py Tidy up util and helpers 2017-10-27 14:39:09 +02:00
span.pxd Add Span.to_array method 2017-08-19 12:20:45 +02:00
span.pyx Revert "Merge branch 'develop' of https://github.com/explosion/spaCy into develop" 2018-03-27 19:23:02 +02:00
token.pxd fix sent_start in serialization 2018-01-28 19:50:42 +01:00
token.pyx Fix #2014: token.pos_ not writeable 2018-03-27 21:21:11 +02:00
underscore.py Tidy up util and helpers 2017-10-27 14:39:09 +02:00