Apple ORG is looking at buying U.K. GPE startup for $1 billion MONEY
+
diff --git a/website/docs/usage/101/_pos-deps.mdx b/website/docs/usage/101/_pos-deps.mdx
index bedb6ce2c..db1e12885 100644
--- a/website/docs/usage/101/_pos-deps.mdx
+++ b/website/docs/usage/101/_pos-deps.mdx
@@ -56,8 +56,7 @@ for token in doc:
Using spaCy's built-in [displaCy visualizer](/usage/visualizers), here's what
our example sentence and its dependencies look like:
-
diff --git a/website/docs/usage/layers-architectures.mdx b/website/docs/usage/layers-architectures.mdx
index 8f6bf3a20..03b85f5af 100644
--- a/website/docs/usage/layers-architectures.mdx
+++ b/website/docs/usage/layers-architectures.mdx
@@ -153,8 +153,9 @@ maxout_pieces = 3
depth = 2
[components.textcat.model.linear_model]
-@architectures = "spacy.TextCatBOW.v2"
+@architectures = "spacy.TextCatBOW.v3"
exclusive_classes = true
+length = 262144
ngram_size = 1
no_output_layer = false
```
@@ -170,8 +171,9 @@ factory = "textcat"
labels = []
[components.textcat.model]
-@architectures = "spacy.TextCatBOW.v2"
+@architectures = "spacy.TextCatBOW.v3"
exclusive_classes = true
+length = 262144
ngram_size = 1
no_output_layer = false
nO = null
diff --git a/website/docs/usage/linguistic-features.mdx b/website/docs/usage/linguistic-features.mdx
index a58e8a241..21cedd1ef 100644
--- a/website/docs/usage/linguistic-features.mdx
+++ b/website/docs/usage/linguistic-features.mdx
@@ -290,11 +290,7 @@ for token in doc:
| toward | `prep` | shift | `NOUN` | manufacturers |
| manufacturers | `pobj` | toward | `ADP` | |
-
+
Because the syntactic relations form a tree, every word has **exactly one
head**. You can therefore iterate over the arcs in the tree by iterating over
@@ -709,11 +705,9 @@ doc = nlp(text)
displacy.serve(doc, style="ent")
```
-
+
+
When Sebastian Thrun PERSON started working on self-driving cars at Google ORG in 2007 DATE, few people outside of the company took him seriously.
+
## Entity Linking {id="entity-linking"}
@@ -723,6 +717,10 @@ identifier from a knowledge base (KB). You can create your own
[`KnowledgeBase`](/api/kb) and [train](/usage/training) a new
[`EntityLinker`](/api/entitylinker) using that custom knowledge base.
+As an example on how to define a KnowledgeBase and train an entity linker model,
+see [`this tutorial`](https://github.com/explosion/projects/blob/v3/tutorials/nel_emerson)
+using [spaCy projects](/usage/projects).
+
### Accessing entity identifiers {id="entity-linking-accessing",model="entity linking"}
The annotated KB identifier is accessible as either a hash value or as a string,
@@ -733,6 +731,7 @@ object, or the `ent_kb_id` and `ent_kb_id_` attributes of a
```python
import spacy
+# "my_custom_el_pipeline" is assumed to be a custom NLP pipeline that was trained and serialized to disk
nlp = spacy.load("my_custom_el_pipeline")
doc = nlp("Ada Lovelace was born in London")
diff --git a/website/docs/usage/processing-pipelines.mdx b/website/docs/usage/processing-pipelines.mdx
index 6ec8a0513..3e58b251d 100644
--- a/website/docs/usage/processing-pipelines.mdx
+++ b/website/docs/usage/processing-pipelines.mdx
@@ -1328,8 +1328,9 @@ labels = []
# This function is created and then passed to the "textcat" component as
# the argument "model"
[components.textcat.model]
-@architectures = "spacy.TextCatBOW.v2"
+@architectures = "spacy.TextCatBOW.v3"
exclusive_classes = true
+length = 262144
ngram_size = 1
no_output_layer = false
diff --git a/website/docs/usage/rule-based-matching.mdx b/website/docs/usage/rule-based-matching.mdx
index d01107ea2..e5b98da3a 100644
--- a/website/docs/usage/rule-based-matching.mdx
+++ b/website/docs/usage/rule-based-matching.mdx
@@ -1144,10 +1144,9 @@ relations and tokens we want to match:
> displacy.serve(doc)
> ```
-
The relations we're interested in are:
diff --git a/website/docs/usage/saving-loading.mdx b/website/docs/usage/saving-loading.mdx
index aad8ea353..9a6791d5e 100644
--- a/website/docs/usage/saving-loading.mdx
+++ b/website/docs/usage/saving-loading.mdx
@@ -405,7 +405,7 @@ available to spaCy, all you need to do is install the package in your
environment:
```bash
-$ python setup.py develop
+$ python -m pip install .
```
spaCy is now able to create the pipeline component `"snek"` – even though you
@@ -586,11 +586,9 @@ After installing the package, the custom colors will be used when visualizing
text with `displacy`. Whenever the label `SNEK` is assigned, it will be
displayed in `#3dff74`.
-
+
+
🌱🌿 🐍 SNEK ____ 🌳🌲 ____ 👨🌾 HUMAN 🏘️
+
## Saving, loading and distributing trained pipelines {id="models"}
@@ -675,7 +673,7 @@ $ python -m spacy package ./en_example_pipeline ./packages
```
This command will create a pipeline package directory and will run
-`python setup.py sdist` in that directory to create a binary `.whl` file or
+`python -m build` in that directory to create a binary `.whl` file or
`.tar.gz` archive of your package that can be installed using `pip install`.
Installing the binary wheel is usually more efficient.
diff --git a/website/docs/usage/v3-3.mdx b/website/docs/usage/v3-3.mdx
index d692475de..fd211e6d2 100644
--- a/website/docs/usage/v3-3.mdx
+++ b/website/docs/usage/v3-3.mdx
@@ -77,11 +77,9 @@ doc.spans["custom"] = [Span(doc, 3, 6, "ORG"), Span(doc, 5, 6, "GPE")]
displacy.serve(doc, style="span", options={"spans_key": "custom"})
```
-
+
+
Welcome to the BankORGof ChinaGPE.
+
## Additional features and improvements
diff --git a/website/docs/usage/visualizers.mdx b/website/docs/usage/visualizers.mdx
index e73c4a16a..2905ba2bd 100644
--- a/website/docs/usage/visualizers.mdx
+++ b/website/docs/usage/visualizers.mdx
@@ -119,11 +119,9 @@ doc = nlp(text)
displacy.serve(doc, style="ent")
```
-
+
+
When Sebastian Thrun PERSON started working on self-driving cars at Google ORG in 2007 DATE, few people outside of the company took him seriously.
+
The entity visualizer lets you customize the following `options`:
@@ -148,11 +146,9 @@ use the `colors` setting to add your own colors for them.
> displacy.serve(doc, style="ent", options=options)
> ```
-
+
+
But Google ORG is starting from behind. The company made a late push into hardware, and Apple ORG’s Siri, available on iPhones, and Amazon ORG’s Alexa software, which runs on its Echo and Dot devices, have clear leads in consumer adoption.
+
The above example uses a little trick: Since the background color values are
added as the `background` style attribute, you can use any
@@ -197,11 +193,9 @@ doc.spans["sc"] = [
displacy.serve(doc, style="span")
```
-
+
+
Welcome to the BankORGof ChinaGPE.
+
The span visualizer lets you customize the following `options`:
@@ -223,11 +217,9 @@ specify which one displaCy should use with `spans_key` (`sc` is the default).
> displacy.serve(doc, style="span", options=options)
> ```
-
+
+
Welcome to the BankBANKof China.
+
## Using displaCy in Jupyter notebooks {id="jupyter"}
diff --git a/website/meta/languages.json b/website/meta/languages.json
index 3305b840b..d6a078097 100644
--- a/website/meta/languages.json
+++ b/website/meta/languages.json
@@ -103,6 +103,10 @@
"has_examples": true,
"models": ["fi_core_news_sm", "fi_core_news_md", "fi_core_news_lg"]
},
+ {
+ "code": "fo",
+ "name": "Faroese"
+ },
{
"code": "fr",
"name": "French",
@@ -290,6 +294,12 @@
"example": "Dit is een zin.",
"has_examples": true
},
+ {
+ "code": "nn",
+ "name": "Norwegian Nynorsk",
+ "example": "Det er ein meir enn i same periode i fjor.",
+ "has_examples": true
+ },
{
"code": "pl",
"name": "Polish",
diff --git a/website/meta/sidebars.json b/website/meta/sidebars.json
index 24213ed12..2df120ffa 100644
--- a/website/meta/sidebars.json
+++ b/website/meta/sidebars.json
@@ -9,14 +9,9 @@
{ "text": "Models & Languages", "url": "/usage/models" },
{ "text": "Facts & Figures", "url": "/usage/facts-figures" },
{ "text": "spaCy 101", "url": "/usage/spacy-101" },
- { "text": "New in v3.0", "url": "/usage/v3" },
- { "text": "New in v3.1", "url": "/usage/v3-1" },
- { "text": "New in v3.2", "url": "/usage/v3-2" },
- { "text": "New in v3.3", "url": "/usage/v3-3" },
- { "text": "New in v3.4", "url": "/usage/v3-4" },
- { "text": "New in v3.5", "url": "/usage/v3-5" },
+ { "text": "New in v3.7", "url": "/usage/v3-7" },
{ "text": "New in v3.6", "url": "/usage/v3-6" },
- { "text": "New in v3.7", "url": "/usage/v3-7" }
+ { "text": "New in v3.5", "url": "/usage/v3-5" }
]
},
{
diff --git a/website/meta/site.json b/website/meta/site.json
index a07d131d3..f1d318071 100644
--- a/website/meta/site.json
+++ b/website/meta/site.json
@@ -66,6 +66,10 @@
{
"text": "Stack Overflow",
"url": "http://stackoverflow.com/questions/tagged/spacy"
+ },
+ {
+ "text": "Merchandise",
+ "url": "https://explosion.ai/merch"
}
]
},
diff --git a/website/meta/universe.json b/website/meta/universe.json
index b2868c084..6278dd489 100644
--- a/website/meta/universe.json
+++ b/website/meta/universe.json
@@ -4500,6 +4500,23 @@
"website": "https://nlp.unibuc.ro/people/snisioi.html"
},
"category": ["pipeline", "training", "models"]
+ },
+ {
+ "id": "redfield-spacy-nodes",
+ "title": "Redfield NLP Nodes for KNIME",
+ "slogan": "Makes the functionality of the spaCy library available in KNIME Analytics Platform.",
+ "description": "This extension provides nodes that make the functionality of the spaCy library available in the [KNIME Analytics Platform](https://www.knime.com/).",
+ "github": "Redfield-AB/Spacy-Nodes",
+ "url": "https://redfield.ai/spacy-redfield/",
+ "thumb": "https://raw.githubusercontent.com/Redfield-AB/Spacy-Nodes/master/resource/redfield_logo_100x100.png",
+ "image": "https://raw.githubusercontent.com/Redfield-AB/Spacy-Nodes/master/resource/screen1.png",
+ "author": "Redfield AB",
+ "author_links": {
+ "twitter": "Redfield_AB",
+ "github": "Redfield-AB",
+ "website": "https://redfield.ai"
+ },
+ "category": ["standalone"]
}
],
diff --git a/website/public/images/displacy-dep-founded.html b/website/public/images/displacy-dep-founded.svg
similarity index 100%
rename from website/public/images/displacy-dep-founded.html
rename to website/public/images/displacy-dep-founded.svg
diff --git a/website/public/images/displacy-ent-custom.html b/website/public/images/displacy-ent-custom.html
deleted file mode 100644
index 5da472fdb..000000000
--- a/website/public/images/displacy-ent-custom.html
+++ /dev/null
@@ -1,80 +0,0 @@
-
But
- Google
- ORGis starting from behind. The company made a late push into hardware, and
- Apple
- ORG’s Siri, available on iPhones, and
- Amazon
- ORG’s Alexa software, which runs on its Echo and Dot devices, have clear leads in consumer
- adoption.
- When
-
- Sebastian Thrun
- PERSON
-
- started working on self-driving cars at
-
- Google
- ORG
-
- in
-
- 2007
- DATE
-
- , few people outside of the company took him seriously.
-
diff --git a/website/public/images/displacy-long.html b/website/public/images/displacy-long.svg
similarity index 100%
rename from website/public/images/displacy-long.html
rename to website/public/images/displacy-long.svg
diff --git a/website/public/images/displacy-long2.svg b/website/public/images/displacy-long2.svg
new file mode 100644
index 000000000..c428bd2cb
--- /dev/null
+++ b/website/public/images/displacy-long2.svg
@@ -0,0 +1,212 @@
+
diff --git a/website/public/images/displacy-span-custom.html b/website/public/images/displacy-span-custom.html
deleted file mode 100644
index 10cb6dd2d..000000000
--- a/website/public/images/displacy-span-custom.html
+++ /dev/null
@@ -1,84 +0,0 @@
-
- Welcome to the
-
- Bank
-
-
-
-
- BANK
-
-
-
-
- of
-
-
-
-
- China
-
-
-
-
- .
-