spaCy/spacy
Peter Baumann 61b04a70d5
Run PhraseMatcher on Spans (#6918)
* Add regression test

* Run PhraseMatcher on Spans

* Add test for PhraseMatcher on Spans and Docs

* Add SCA

* Add test with 3 matches in Doc, 1 match in Span

* Update docs

* Use doc.length for find_matches in tokenizer

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-02-10 23:43:32 +11:00
..
cli add capture arg 2021-02-02 19:47:12 +01:00
displacy Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
lang reformatting 2021-01-30 17:29:33 +01:00
matcher Run PhraseMatcher on Spans (#6918) 2021-02-10 23:43:32 +11:00
ml fix: TransformerListener with TextCatEnsemble (#6951) 2021-02-06 13:44:51 +01:00
pipeline ensure the loss value is cast as float (#6928) 2021-02-07 07:51:56 +08:00
tests Run PhraseMatcher on Spans (#6918) 2021-02-10 23:43:32 +11:00
tokens Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
training reduce memory load when reading all vectors from file (#6945) 2021-02-07 08:05:43 +08:00
__init__.pxd
__init__.py Pass on vocab arg in spacy.blank() (#6924) 2021-02-04 15:09:01 +01:00
__main__.py Tidy up 2020-06-22 00:45:40 +02:00
about.py Set version to v3.0.0 2021-02-02 20:26:17 +11:00
attrs.pxd Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
attrs.pyx Merge branch 'develop' into master-tmp 2020-05-21 18:39:06 +02:00
compat.py Use Literal type for nr_feature_tokens 2020-09-23 16:00:03 +02:00
default_config_pretraining.cfg pretrain architectures (#6451) 2020-12-08 14:41:03 +08:00
default_config.cfg Add initialize.before_init and after_init callbacks 2021-01-12 13:07:44 +01:00
errors.py Rephrase error related to sample data initialization 2021-02-08 09:21:36 +01:00
glossary.py unicode -> str consistency 2020-05-24 17:20:58 +02:00
kb.pxd Revert added_strings change (#6236) 2020-10-10 18:55:07 +02:00
kb.pyx Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
language.py remove link_components flag again (#6883) 2021-02-02 10:08:40 +08:00
lexeme.pxd Fix Lexeme.from_ptr 2020-08-10 16:43:37 +02:00
lexeme.pyx reduce memory load when reading all vectors from file (#6945) 2021-02-07 08:05:43 +08:00
lookups.py Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
morphology.pxd Add Lemmatizer and simplify related components (#5848) 2020-08-07 15:27:13 +02:00
morphology.pyx Prevent 0-length mem alloc (#6653) 2021-01-06 12:50:17 +11:00
parts_of_speech.pxd
parts_of_speech.pyx Drop Python 2.7 and 3.5 (#4828) 2019-12-22 01:53:56 +01:00
pipe_analysis.py Tidy up and auto-format 2020-09-29 21:39:28 +02:00
schemas.py Add initialize.before_init and after_init callbacks 2021-01-12 13:07:44 +01:00
scorer.py Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
strings.pxd Remove 'cleanup' of strings (#6007) 2020-09-01 16:12:15 +02:00
strings.pyx Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
structs.pxd Add SpanGroup and Graph container types to represent arbitrary annotations (#6696) 2021-01-14 17:30:41 +11:00
symbols.pxd introduce token.has_head and refer to MISSING_DEP_ (WIP) 2021-01-12 17:17:06 +01:00
symbols.pyx introduce token.has_head and refer to MISSING_DEP_ (WIP) 2021-01-12 17:17:06 +01:00
tokenizer.pxd Simplify specials and cache checks (#6012) 2020-09-03 09:42:49 +02:00
tokenizer.pyx Run PhraseMatcher on Spans (#6918) 2021-02-10 23:43:32 +11:00
typedefs.pxd Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master 2020-11-25 11:49:34 +01:00
typedefs.pyx
util.py Update and add test 2021-02-10 14:12:00 +11:00
vectors.pyx Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00
vocab.pxd Merge remote-tracking branch 'upstream/master' into chore/update-develop-from-master 2020-11-25 11:49:34 +01:00
vocab.pyx Replace links to nightly docs [ci skip] 2021-01-30 20:09:38 +11:00