mirror of
				https://github.com/explosion/spaCy.git
				synced 2025-10-25 13:11:03 +03:00 
			
		
		
		
	Merge pull request #8665 from rynoV/patch-1 [ci skip]
This commit is contained in:
		
						commit
						d4fecdfb82
					
				|  | @ -63,7 +63,7 @@ another token that's at least 10 characters long. | ||||||
| 
 | 
 | ||||||
| spaCy features a rule-matching engine, the [`Matcher`](/api/matcher), that | spaCy features a rule-matching engine, the [`Matcher`](/api/matcher), that | ||||||
| operates over tokens, similar to regular expressions. The rules can refer to | operates over tokens, similar to regular expressions. The rules can refer to | ||||||
| token annotations (e.g. the token `text` or `tag_`, and flags (e.g. `IS_PUNCT`). | token annotations (e.g. the token `text` or `tag_`, and flags like `IS_PUNCT`). | ||||||
| The rule matcher also lets you pass in a custom callback to act on matches – for | The rule matcher also lets you pass in a custom callback to act on matches – for | ||||||
| example, to merge entities and apply custom labels. You can also associate | example, to merge entities and apply custom labels. You can also associate | ||||||
| patterns with entity IDs, to allow some basic entity linking or disambiguation. | patterns with entity IDs, to allow some basic entity linking or disambiguation. | ||||||
|  | @ -1552,7 +1552,7 @@ doc = nlp("Dr. Alex Smith chaired first board meeting of Acme Corp Inc.") | ||||||
| print([(ent.text, ent.label_) for ent in doc.ents]) | print([(ent.text, ent.label_) for ent in doc.ents]) | ||||||
| ``` | ``` | ||||||
| 
 | 
 | ||||||
| An alternative approach would be to an | An alternative approach would be to use an | ||||||
| [extension attribute](/usage/processing-pipelines/#custom-components-attributes) | [extension attribute](/usage/processing-pipelines/#custom-components-attributes) | ||||||
| like `._.person_title` and add it to `Span` objects (which includes entity spans | like `._.person_title` and add it to `Span` objects (which includes entity spans | ||||||
| in `doc.ents`). The advantage here is that the entity text stays intact and can | in `doc.ents`). The advantage here is that the entity text stays intact and can | ||||||
|  |  | ||||||
		Loading…
	
		Reference in New Issue
	
	Block a user