spaCy/spacy
Raphael Mitsch 96b61d0671
Fix EL failure with sentence-crossing entities (#12398)
* Add test reproducing EL failure in sentence-crossing entities.

* Format.

* Draft fix.

* Format.

* Fix case for len(ent.sents) == 1.

* Format.

* Format.

* Format.

* Fix mypy error.

* Merge EL sentence crossing tests.

* Remove unneeded sentencizer component.

* Fix or ignore mypy issues in test.

* Simplify ent.sents handling.

* Format. Update assert in ent.sents handling.

* Small rewrite

---------

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-03-14 22:02:49 +01:00
..
cli Fix --verbose for spacy find-threshold (#12418) 2023-03-14 17:16:49 +01:00
displacy Auto-format code with black (#12100) 2023-01-13 10:12:10 +01:00
kb rely on is_empty property instead of __len__ (#12347) 2023-03-01 12:06:07 +01:00
lang Bugfix/swedish tokenizer (#12315) 2023-02-27 10:53:45 +01:00
matcher Add new REL_OPs: >+, >-, <+, and <- (#12334) 2023-02-28 14:36:33 +01:00
ml Raise error for non-default vectors with PretrainVectors (#12366) 2023-03-06 18:06:31 +01:00
pipeline Fix EL failure with sentence-crossing entities (#12398) 2023-03-14 22:02:49 +01:00
tests Fix EL failure with sentence-crossing entities (#12398) 2023-03-14 22:02:49 +01:00
tokens Fix sentence indexing bug in Span.sents (#12405) 2023-03-14 10:21:53 +01:00
training Have logging calls use string formatting types (#12215) 2023-02-02 11:15:22 +01:00
__init__.pxd
__init__.py Simplify and clarify enable/disable behavior of spacy.load() (#11459) 2022-09-27 14:22:36 +02:00
__main__.py
about.py Set version to v3.5.0 2022-11-25 12:05:25 +01:00
attrs.pxd
attrs.pyx
compat.py
default_config_pretraining.cfg
default_config.cfg Add training.before_update callback (#11739) 2022-11-23 17:54:58 +01:00
errors.py Add spancat_singlelabel pipeline for multiclass and non-overlapping span labelling tasks (#11365) 2023-03-09 10:30:59 +01:00
glossary.py
language.py Have logging calls use string formatting types (#12215) 2023-02-02 11:15:22 +01:00
lexeme.pxd
lexeme.pyi fix types (#12365) 2023-03-07 13:29:08 +01:00
lexeme.pyx fix types (#12365) 2023-03-07 13:29:08 +01:00
lookups.py Fix issues for Mypy 0.950 and Pydantic 1.9.0 (#10786) 2022-05-25 09:33:54 +02:00
morphology.pxd
morphology.pyx
parts_of_speech.pxd
parts_of_speech.pyx
pipe_analysis.py
py.typed
schemas.py Auto-format code with black (#12100) 2023-01-13 10:12:10 +01:00
scorer.py Restore v2 token_acc score implementation (#12073) 2023-01-11 08:01:47 +01:00
strings.pxd StringStore-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
strings.pyi
strings.pyx StringStore-related optimizations (#10938) 2022-07-04 15:04:03 +02:00
structs.pxd
symbols.pxd
symbols.pyx
tokenizer.pxd
tokenizer.pyx
ty.py
typedefs.pxd
typedefs.pyx
util.py Add tests for projects to master (#12303) 2023-02-23 10:22:57 +01:00
vectors.pyx Add equality definition for vectors (#11806) 2022-11-16 09:44:42 +01:00
vocab.pxd
vocab.pyi
vocab.pyx fix comparison of constants (#11834) 2022-11-21 08:12:03 +01:00