mirror of
https://github.com/explosion/spaCy.git
synced 2025-01-26 17:24:41 +03:00
d59f968d08
In the retokenizer, only reset sent starts (with `set_children_from_head`) if the doc is parsed. If there is no parse, merged tokens have the unset `token.is_sent_start == None` by default after retokenization. |
||
---|---|---|
.. | ||
__init__.pxd | ||
__init__.py | ||
_dict_proxies.py | ||
_retokenize.pyx | ||
_serialize.py | ||
doc.pxd | ||
doc.pyx | ||
graph.pxd | ||
graph.pyx | ||
morphanalysis.pxd | ||
morphanalysis.pyx | ||
span_group.pxd | ||
span_group.pyx | ||
span.pxd | ||
span.pyx | ||
token.pxd | ||
token.pyx | ||
underscore.py |