spaCy/website/docs/usage
adrianeboyd 8fe7bdd0fa Improve token pattern checking without validation (#4105)
* Fix typo in rule-based matching docs

* Improve token pattern checking without validation

Add more detailed token pattern checks without full JSON pattern validation and
provide more detailed error messages.

Addresses #4070 (also related: #4063, #4100).

* Check whether top-level attributes in patterns and attr for PhraseMatcher are
  in token pattern schema

* Check whether attribute value types are supported in general (as opposed to
  per attribute with full validation)

* Report various internal error types (OverflowError, AttributeError, KeyError)
  as ValueError with standard error messages

* Check for tagger/parser in PhraseMatcher pipeline for attributes TAG, POS,
  LEMMA, and DEP

* Add error messages with relevant details on how to use validate=True or nlp()
  instead of nlp.make_doc()

* Support attr=TEXT for PhraseMatcher

* Add NORM to schema

* Expand tests for pattern validation, Matcher, PhraseMatcher, and EntityRuler

* Remove unnecessary .keys()

* Rephrase error messages

* Add another type check to Matcher

Add another type check to Matcher for more understandable error messages
in some rare cases.

* Support phrase_matcher_attr=TEXT for EntityRuler

* Don't use spacy.errors in examples and bin scripts

* Fix error code

* Auto-format

Also try get Azure pipelines to finally start a build :(

* Update errors.py


Co-authored-by: Ines Montani <ines@ines.io>
Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>
2019-08-21 14:00:37 +02:00
..
101 Make visualized NER examples more clear (closes #4104) [ci skip] 2019-08-18 16:29:29 +02:00
_benchmarks-choi.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
adding-languages.md Fix missing ids 2019-03-14 17:56:53 +01:00
examples.md Remove non-existent example (closes #3533) 2019-04-03 09:59:17 +02:00
facts-figures.md Update landing and feature overview 2019-04-19 15:23:08 +02:00
index.md Merge branch 'spacy.io' [ci skip] 2019-03-22 15:17:11 +01:00
linguistic-features.md Make visualized NER examples more clear (closes #4104) [ci skip] 2019-08-18 16:29:29 +02:00
models.md Corrected example model URL in requirements.txt (#3786) 2019-05-29 10:51:55 +02:00
processing-pipelines.md Add "Processing text" section [ci skip] 2019-07-25 17:38:03 +02:00
rule-based-matching.md Improve token pattern checking without validation (#4105) 2019-08-21 14:00:37 +02:00
saving-loading.md Improve consistency of docs examples [ci skip] 2019-07-25 14:24:56 +02:00
spacy-101.md Remove dangling M (#3657) 2019-04-29 19:44:43 +02:00
training.md Document gold.docs_to_json [ci skip] 2019-07-10 10:27:33 +02:00
v2-1.md Merge branch 'spacy.io' [ci skip] 2019-03-23 16:46:34 +01:00
v2.md Fix small issues in the docs [ci skip] 2019-03-12 22:57:15 +01:00
vectors-similarity.md 💫 Update website (#3285) 2019-02-17 19:31:19 +01:00
visualizers.md Make enabling/disabling jupyter mode more explicit (#4144) 2019-08-19 11:53:34 +02:00