spaCy/spacy/cli
kadarakos c003aac29a
SpanFinder into spaCy from experimental (#12507)
* span finder integrated into spacy from experimental

* black

* isort

* black

* default spankey constant

* black

* Update spacy/pipeline/spancat.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* rename

* rename

* max_length and min_length as Optional[int] and strict checking

* black

* mypy fix for integer type infinity

* revert line order

* implement all comparison operators for inf int

* avoid two for loops over all docs by not precomputing

* interleave thresholding with span creation

* black

* revert to not interleaving (relized its faster)

* black

* Update spacy/errors.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* update dosctring

* enforce that the gold and predicted documents have the same text

* new error for ensuring reference and predicted texts are the same

* remove todo

* adjust test

* black

* handle misaligned tokenization

* return correct variable

* failing overfit test

* only use a single spans_key like in spancat

* black

* remove debug lines

* typo

* remove comment

* remove near duplicate reduntant method

* use the 'spans_key' variable name everywhere

* Update spacy/pipeline/span_finder.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* flaky test fix suggestion, hand set bias terms

* only test suggester and test result exhaustively

* make it clear that the span_finder_suggester is more general (not specific to span_finder)

* Update spacy/tests/pipeline/test_span_finder.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Apply suggestions from code review

* remove question comment

* move preset_spans_suggester test to spancat tests

* Add docs and unify default configs for spancat and span finder

* Add `allow_overlap=True` to span finder scorer

* Fix offset bug in set_annotations

* Ignore labels in span finder scorer

* Format

* Add span_finder to quickstart template

* Move settings to self.cfg, store min/max unset as None

* Remove debugging

* Update docstrings and docs

* Update spacy/pipeline/span_finder.py

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

* Fix imports

---------

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2023-06-07 15:52:28 +02:00
..
project Partially work around pending deprecation of pkg_resources (#12368) 2023-03-06 14:48:57 +01:00
templates SpanFinder into spaCy from experimental (#12507) 2023-06-07 15:52:28 +02:00
__init__.py Add a spacy benchmark speed subcommand (#11902) 2023-01-12 11:55:21 +01:00
_util.py Have logging calls use string formatting types (#12215) 2023-02-02 11:15:22 +01:00
apply.py Auto-format code with black (#12019) 2022-12-23 12:44:07 +01:00
assemble.py Tidy up with flake8: imports, comparisons, etc. 2021-06-28 12:08:15 +02:00
benchmark_speed.py Add a spacy benchmark speed subcommand (#11902) 2023-01-12 11:55:21 +01:00
convert.py fix processing of "auto" in convert (#12050) 2023-01-05 10:21:00 +01:00
debug_config.py Fix references to config file in the docs & UX (#9961) 2022-01-04 14:31:26 +01:00
debug_data.py fix typo (#12543) 2023-04-19 10:59:33 +02:00
debug_diff.py Add debug diff command in spaCy CLI (#10502) 2022-04-07 10:48:45 +02:00
debug_model.py Tidy up with flake8: imports, comparisons, etc. 2021-06-28 12:08:15 +02:00
download.py Remove #egg from download URLs (#12567) 2023-05-04 17:13:12 +02:00
evaluate.py Add scorer option to return per-component scores (#12540) 2023-05-12 15:36:54 +02:00
find_threshold.py Fix --verbose for spacy find-threshold (#12418) 2023-03-14 17:16:49 +01:00
info.py Partially work around pending deprecation of pkg_resources (#12368) 2023-03-06 14:48:57 +01:00
init_config.py Workaround for Typer optional default values with Python calls (#10788) 2022-06-17 12:15:36 +02:00
init_pipeline.py Add support for floret vectors (#8909) 2021-10-27 14:08:31 +02:00
package.py Adjust return type of registry.find (#12227) 2023-02-15 12:32:53 +01:00
pretrain.py Add model-last saving mechanism to pretraining (#12459) 2023-04-03 15:24:03 +02:00
profile.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00
train.py Add docs section for spacy.cli.train.train (#9545) 2021-10-29 10:36:34 +02:00
validate.py 🏷 Add Mypy check to CI and ignore all existing Mypy errors (#9167) 2021-10-14 15:21:40 +02:00