mirror of
				https://github.com/explosion/spaCy.git
				synced 2025-10-26 05:31:15 +03:00 
			
		
		
		
	Merge pull request #6063 from svlandeg/feature/doc_cleanup [ci skip]
This commit is contained in:
		
						commit
						9afb1d9965
					
				|  | @ -183,7 +183,7 @@ will be overwritten. | |||
| | -------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `match_id`     | An ID for the patterns. ~~str~~                                                                                                                                      | | ||||
| | `patterns`     | A list of match patterns. A pattern consists of a list of dicts, where each dict describes a token in the tree. ~~List[List[Dict[str, Union[str, Dict]]]]~~          | | ||||
| | _keyword-only_ |                                                                                                                                                                      |  | | ||||
| | _keyword-only_ |                                                                                                                                                                      | | ||||
| | `on_match`     | Callback function to act on matches. Takes the arguments `matcher`, `doc`, `i` and `matches`. ~~Optional[Callable[[DependencyMatcher, Doc, int, List[Tuple], Any]]~~ | | ||||
| 
 | ||||
| ## DependencyMatcher.get {#get tag="method"} | ||||
|  |  | |||
|  | @ -217,7 +217,7 @@ model. Delegates to [`predict`](/api/dependencyparser#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  |  | |||
|  | @ -85,7 +85,7 @@ providing custom registered functions. | |||
| | `vocab`          | The shared vocabulary. ~~Vocab~~                                                                                                 | | ||||
| | `model`          | The [`Model`](https://thinc.ai/docs/api-model) powering the pipeline component. ~~Model~~                                        | | ||||
| | `name`           | String name of the component instance. Used to add entries to the `losses` during training. ~~str~~                              | | ||||
| | _keyword-only_   |                                                                                                                                  |  | | ||||
| | _keyword-only_   |                                                                                                                                  | | ||||
| | `kb_loader`      | Function that creates a [`KnowledgeBase`](/api/kb) from a `Vocab` instance. ~~Callable[[Vocab], KnowledgeBase]~~                 | | ||||
| | `get_candidates` | Function that generates plausible candidates for a given `Span` object. ~~Callable[[KnowledgeBase, Span], Iterable[Candidate]]~~ | | ||||
| | `labels_discard` | NER labels that will automatically get a `"NIL"` prediction. ~~Iterable[str]~~                                                   | | ||||
|  | @ -218,7 +218,7 @@ pipe's entity linking model and context encoder. Delegates to | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  |  | |||
|  | @ -206,7 +206,7 @@ model. Delegates to [`predict`](/api/entityrecognizer#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  |  | |||
|  | @ -255,7 +255,7 @@ Get all patterns that were added to the entity ruler. | |||
| 
 | ||||
| | Name              | Description                                                                                                           | | ||||
| | ----------------- | --------------------------------------------------------------------------------------------------------------------- | | ||||
| | `matcher`         | The underlying matcher used to process token patterns. ~~Matcher~~                                                    |  | | ||||
| | `matcher`         | The underlying matcher used to process token patterns. ~~Matcher~~                                                    | | ||||
| | `phrase_matcher`  | The underlying phrase matcher, used to process phrase patterns. ~~PhraseMatcher~~                                     | | ||||
| | `token_patterns`  | The token patterns present in the entity ruler, keyed by label. ~~Dict[str, List[Dict[str, Union[str, List[dict]]]]~~ | | ||||
| | `phrase_patterns` | The phrase patterns present in the entity ruler, keyed by label. ~~Dict[str, List[Doc]]~~                             | | ||||
|  |  | |||
|  | @ -81,7 +81,7 @@ shortcut for this and instantiate the component using its string name and | |||
| | `vocab`        | The shared vocabulary. ~~Vocab~~                                                                                                                               | | ||||
| | `model`        | **Not yet implemented:** The model to use. ~~Model~~                                                                                                           | | ||||
| | `name`         | String name of the component instance. Used to add entries to the `losses` during training. ~~str~~                                                            | | ||||
| | _keyword-only_ |                                                                                                                                                                |  | | ||||
| | _keyword-only_ |                                                                                                                                                                | | ||||
| | mode           | The lemmatizer mode, e.g. `"lookup"` or `"rule"`. Defaults to `"lookup"`. ~~str~~                                                                              | | ||||
| | lookups        | A lookups object containing the tables such as `"lemma_rules"`, `"lemma_index"`, `"lemma_exc"` and `"lemma_lookup"`. Defaults to `None`. ~~Optional[Lookups]~~ | | ||||
| | overwrite      | Whether to overwrite existing lemmas. ~~bool~                                                                                                                  | | ||||
|  |  | |||
|  | @ -139,7 +139,7 @@ setting up the label scheme based on the data. | |||
| | Name           | Description                                                                                                                           | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `get_examples` | Function that returns gold-standard annotations in the form of [`Example`](/api/example) objects. ~~Callable[[], Iterable[Example]]~~ | | ||||
| | _keyword-only_ |                                                                                                                                       |  | | ||||
| | _keyword-only_ |                                                                                                                                       | | ||||
| | `pipeline`     | Optional list of pipeline components that this component is part of. ~~Optional[List[Tuple[str, Callable[[Doc], Doc]]]]~~             | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                         | | ||||
| | **RETURNS**    | The optimizer. ~~Optimizer~~                                                                                                          | | ||||
|  | @ -196,7 +196,7 @@ Delegates to [`predict`](/api/morphologizer#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  |  | |||
|  | @ -150,9 +150,9 @@ patterns = [nlp("health care reform"), nlp("healthcare reform")] | |||
| 
 | ||||
| | Name           | Description                                                                                                                                                | | ||||
| | -------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `match_id`     | str                                                                                                                                                        | An ID for the thing you're matching. ~~str~~ | | ||||
| | `match_id`     | An ID for the thing you're matching. ~~str~~ |                                                                                                                                                        |  | ||||
| | `docs`         | `Doc` objects of the phrases to match. ~~List[Doc]~~                                                                                                       | | ||||
| | _keyword-only_ |                                                                                                                                                            |  | | ||||
| | _keyword-only_ |                                                                                                                                                            | | ||||
| | `on_match`     | Callback function to act on matches. Takes the arguments `matcher`, `doc`, `i` and `matches`. ~~Optional[Callable[[Matcher, Doc, int, List[tuple], Any]]~~ | | ||||
| 
 | ||||
| ## PhraseMatcher.remove {#remove tag="method" new="2.2"} | ||||
|  |  | |||
|  | @ -187,7 +187,7 @@ predictions and gold-standard annotations, and update the component's model. | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  | @ -211,7 +211,7 @@ the "catastrophic forgetting" problem. This feature is experimental. | |||
| | Name           | Description                                                                                                              | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------ | | ||||
| | `examples`     | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                        | | ||||
| | _keyword-only_ |                                                                                                                          |  | | ||||
| | _keyword-only_ |                                                                                                                          | | ||||
| | `drop`         | The dropout rate. ~~float~~                                                                                              | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~            | | ||||
| | `losses`       | Optional record of the loss during training. Updated using the component name as the key. ~~Optional[Dict[str, float]]~~ | | ||||
|  |  | |||
|  | @ -192,7 +192,7 @@ Delegates to [`predict`](/api/sentencerecognizer#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  | @ -216,7 +216,7 @@ the "catastrophic forgetting" problem. This feature is experimental. | |||
| | Name           | Description                                                                                                              | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------ | | ||||
| | `examples`     | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                        | | ||||
| | _keyword-only_ |                                                                                                                          |  | | ||||
| | _keyword-only_ |                                                                                                                          | | ||||
| | `drop`         | The dropout rate. ~~float~~                                                                                              | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~            | | ||||
| | `losses`       | Optional record of the loss during training. Updated using the component name as the key. ~~Optional[Dict[str, float]]~~ | | ||||
|  |  | |||
|  | @ -53,7 +53,7 @@ Initialize the sentencizer. | |||
| 
 | ||||
| | Name           | Description                                                                                                             | | ||||
| | -------------- | ----------------------------------------------------------------------------------------------------------------------- | | ||||
| | _keyword-only_ |                                                                                                                         |  | | ||||
| | _keyword-only_ |                                                                                                                         | | ||||
| | `punct_chars`  | Optional custom list of punctuation characters that mark sentence ends. See below for defaults. ~~Optional[List[str]]~~ | | ||||
| 
 | ||||
| ```python | ||||
|  |  | |||
|  | @ -190,7 +190,7 @@ Delegates to [`predict`](/api/tagger#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  | @ -214,7 +214,7 @@ the "catastrophic forgetting" problem. This feature is experimental. | |||
| | Name           | Description                                                                                                              | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------ | | ||||
| | `examples`     | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                        | | ||||
| | _keyword-only_ |                                                                                                                          |  | | ||||
| | _keyword-only_ |                                                                                                                          | | ||||
| | `drop`         | The dropout rate. ~~float~~                                                                                              | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~            | | ||||
| | `losses`       | Optional record of the loss during training. Updated using the component name as the key. ~~Optional[Dict[str, float]]~~ | | ||||
|  |  | |||
|  | @ -201,7 +201,7 @@ Delegates to [`predict`](/api/textcategorizer#predict) and | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  | @ -225,7 +225,7 @@ the "catastrophic forgetting" problem. This feature is experimental. | |||
| | Name           | Description                                                                                                              | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------ | | ||||
| | `examples`     | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                        | | ||||
| | _keyword-only_ |                                                                                                                          |  | | ||||
| | _keyword-only_ |                                                                                                                          |  | ||||
| | `drop`         | The dropout rate. ~~float~~                                                                                              | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~            | | ||||
| | `losses`       | Optional record of the loss during training. Updated using the component name as the key. ~~Optional[Dict[str, float]]~~ | | ||||
|  | @ -263,7 +263,7 @@ Score a batch of examples. | |||
| | Name             | Description                                                                                                          | | ||||
| | ---------------- | -------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`       | The examples to score. ~~Iterable[Example]~~                                                                         | | ||||
| | _keyword-only_   |                                                                                                                      |  | | ||||
| | _keyword-only_   |                                                                                                                      | | ||||
| | `positive_label` | Optional positive label. ~~Optional[str]~~                                                                           | | ||||
| | **RETURNS**      | The scores, produced by [`Scorer.score_cats`](/api/scorer#score_cats). ~~Dict[str, Union[float, Dict[str, float]]]~~ | | ||||
| 
 | ||||
|  |  | |||
|  | @ -144,7 +144,7 @@ setting up the label scheme based on the data. | |||
| | Name           | Description                                                                                                                           | | ||||
| | -------------- | ------------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `get_examples` | Function that returns gold-standard annotations in the form of [`Example`](/api/example) objects. ~~Callable[[], Iterable[Example]]~~ | | ||||
| | _keyword-only_ |                                                                                                                                       |  | | ||||
| | _keyword-only_ |                                                                                                                                       | | ||||
| | `pipeline`     | Optional list of pipeline components that this component is part of. ~~Optional[List[Tuple[str, Callable[[Doc], Doc]]]]~~             | | ||||
| | `sgd`          | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                         | | ||||
| | **RETURNS**    | The optimizer. ~~Optimizer~~                                                                                                          | | ||||
|  | @ -200,7 +200,7 @@ Delegates to [`predict`](/api/tok2vec#predict). | |||
| | Name              | Description                                                                                                                        | | ||||
| | ----------------- | ---------------------------------------------------------------------------------------------------------------------------------- | | ||||
| | `examples`        | A batch of [`Example`](/api/example) objects to learn from. ~~Iterable[Example]~~                                                  | | ||||
| | _keyword-only_    |                                                                                                                                    |  | | ||||
| | _keyword-only_    |                                                                                                                                    | | ||||
| | `drop`            | The dropout rate. ~~float~~                                                                                                        | | ||||
| | `set_annotations` | Whether or not to update the `Example` objects with the predictions, delegating to [`set_annotations`](#set_annotations). ~~bool~~ | | ||||
| | `sgd`             | An optimizer. Will be created via [`create_optimizer`](#create_optimizer) if not set. ~~Optional[Optimizer]~~                      | | ||||
|  |  | |||
		Loading…
	
		Reference in New Issue
	
	Block a user