spaCy/spacy/tokens
adrianeboyd 2d8c6e1124 Iterate over lr_edges until sents are correct (#4702)
Iterate over lr_edges until all heads are within the current sentence.
Instead of iterating over them for a fixed number of iterations, check
whether the sentence boundaries are correct for the heads and stop when
all are correct. Stop after a maximum of 10 iterations, providing a
warning in this case since the sentence boundaries may not be correct.
2019-11-25 13:06:36 +01:00
..
__init__.pxd * Break up tokens.pyx into tokens/doc.pyx, tokens/token.pyx, tokens/spans.pyx 2015-07-13 20:20:58 +02:00
__init__.py DocPallet -> DocBin 2019-09-18 15:15:37 +02:00
_retokenize.pyx Fix realloc in retokenizer.split() (#4606) 2019-11-11 16:26:46 +01:00
_serialize.py Fix serialization of extension attr values in DocBin (#4540) 2019-10-28 16:02:13 +01:00
doc.pxd Iterate over lr_edges until sents are correct (#4702) 2019-11-25 13:06:36 +01:00
doc.pyx Iterate over lr_edges until sents are correct (#4702) 2019-11-25 13:06:36 +01:00
morphanalysis.pxd Add header for morphanalysis 2019-03-07 17:24:57 +01:00
morphanalysis.pyx Add more docstrings for MorphAnalysis 2019-09-12 16:48:30 +02:00
span.pxd annotate kb_id through ents in doc 2019-03-22 11:36:44 +01:00
span.pyx Allow copying of user_data in as_doc (#4282) 2019-09-12 17:08:14 +02:00
token.pxd ensure Span.as_doc keeps the entity links + unit test 2019-06-25 15:28:51 +02:00
token.pyx Merge changes to test_ner 2019-09-18 21:41:24 +02:00
underscore.py 💫 Improve introspection of custom extension attributes (#3729) 2019-05-12 00:53:11 +02:00