mirror of
https://github.com/explosion/spaCy.git
synced 2025-04-27 04:13:41 +03:00
Update docs [ci skip]
This commit is contained in:
parent
c8bda92243
commit
02008e9a55
|
@ -4,21 +4,16 @@ import { Help } from 'components/typography'; import Link from 'components/link'
|
||||||
|
|
||||||
<figure>
|
<figure>
|
||||||
|
|
||||||
| System | Parser | Tagger | NER | WPS<br />CPU <Help>words per second on CPU, higher is better</Help> | WPS<br/>GPU <Help>words per second on GPU, higher is better</Help> |
|
| System | Parser | Tagger | NER | WPS<br />CPU <Help>words per second on CPU, higher is better</Help> | WPS<br/>GPU <Help>words per second on GPU, higher is better</Help> |
|
||||||
| ------------------------------------------------------------------------- | ----------------: | ----------------: | ---: | ------------------------------------------------------------------: | -----------------------------------------------------------------: |
|
| ---------------------------------------------------------- | -----: | -----: | ---: | ------------------------------------------------------------------: | -----------------------------------------------------------------: |
|
||||||
| [`en_core_web_trf`](/models/en#en_core_web_trf) (spaCy v3) | | | | | 6k |
|
| [`en_core_web_trf`](/models/en#en_core_web_trf) (spaCy v3) | | | | | 6k |
|
||||||
| [`en_core_web_lg`](/models/en#en_core_web_lg) (spaCy v3) | | | | | |
|
| [`en_core_web_lg`](/models/en#en_core_web_lg) (spaCy v3) | | | | | |
|
||||||
| `en_core_web_lg` (spaCy v2) | 91.9 | 97.2 | 85.9 | 10k | |
|
| `en_core_web_lg` (spaCy v2) | 91.9 | 97.2 | 85.9 | 10k | |
|
||||||
| [Stanza](https://stanfordnlp.github.io/stanza/) (StanfordNLP)<sup>1</sup> | _n/a_<sup>2</sup> | _n/a_<sup>2</sup> | 88.8 | 234 | 2k |
|
|
||||||
| <Link to="https://github.com/flairNLP/flair" hideIcon>Flair</Link> | - | 97.9 | 89.3 | | |
|
|
||||||
|
|
||||||
<figcaption class="caption">
|
<figcaption class="caption">
|
||||||
|
|
||||||
**Accuracy and speed on the
|
**Accuracy and speed on the
|
||||||
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) corpus.**<br />**1. **
|
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) corpus.**
|
||||||
[Qi et al. (2020)](https://arxiv.org/pdf/2003.07082.pdf). **2. ** _Coming soon_:
|
|
||||||
Qi et al. don't report parsing and tagging results on OntoNotes. We're working
|
|
||||||
on training Stanza on this corpus to allow direct comparison.
|
|
||||||
|
|
||||||
</figcaption>
|
</figcaption>
|
||||||
|
|
||||||
|
@ -26,19 +21,22 @@ on training Stanza on this corpus to allow direct comparison.
|
||||||
|
|
||||||
<figure>
|
<figure>
|
||||||
|
|
||||||
| System | POS | UAS | LAS |
|
| Named Entity Recognition Model | OntoNotes | CoNLL '03 |
|
||||||
| ------------------------------------------------------------------------------ | ---: | ---: | ---: |
|
| ------------------------------------------------------------------------------ | --------: | --------- |
|
||||||
| spaCy RoBERTa (2020) | 98.0 | 96.8 | 95.0 |
|
| spaCy RoBERTa (2020) |
|
||||||
| spaCy CNN (2020) | | | |
|
| spaCy CNN (2020) | |
|
||||||
| [Mrini et al.](https://khalilmrini.github.io/Label_Attention_Layer.pdf) (2019) | 97.3 | 97.4 | 96.3 |
|
| spaCy CNN (2017) | 86.4 |
|
||||||
| [Zhou and Zhao](https://www.aclweb.org/anthology/P19-1230/) (2019) | 97.3 | 97.2 | 95.7 |
|
| [Stanza](https://stanfordnlp.github.io/stanza/) (StanfordNLP)<sup>1</sup> | 88.8 |
|
||||||
|
| <Link to="https://github.com/flairNLP/flair" hideIcon>Flair</Link><sup>2</sup> | 89.7 |
|
||||||
|
|
||||||
<figcaption class="caption">
|
<figcaption class="caption">
|
||||||
|
|
||||||
**Accuracy on the Penn Treebank.** See
|
**Named entity recognition accuracy** on the
|
||||||
[NLP-progress](http://nlpprogress.com/english/dependency_parsing.html) for more
|
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) and
|
||||||
results. For spaCy's evaluation, see the
|
[CoNLL-2003](https://www.aclweb.org/anthology/W03-0419.pdf) corpora. See
|
||||||
[project template](https://github.com/explosion/projects/tree/v3/benchmarks/parsing_penn_treebank).
|
[NLP-progress](http://nlpprogress.com/english/named_entity_recognition.html) for
|
||||||
|
more results. **1. ** [Qi et al. (2020)](https://arxiv.org/pdf/2003.07082.pdf).
|
||||||
|
**2. ** [Akbik et al. (2018)](https://www.aclweb.org/anthology/C18-1139/)
|
||||||
|
|
||||||
</figcaption>
|
</figcaption>
|
||||||
|
|
||||||
|
|
|
@ -61,6 +61,25 @@ import Benchmarks from 'usage/\_benchmarks-models.md'
|
||||||
|
|
||||||
<Benchmarks />
|
<Benchmarks />
|
||||||
|
|
||||||
|
<figure>
|
||||||
|
|
||||||
|
| System | UAS | LAS |
|
||||||
|
| ------------------------------------------------------------------------------ | ---: | ---: |
|
||||||
|
| spaCy RoBERTa (2020) | 96.8 | 95.0 |
|
||||||
|
| spaCy CNN (2020) | 93.7 | 91.8 |
|
||||||
|
| [Mrini et al.](https://khalilmrini.github.io/Label_Attention_Layer.pdf) (2019) | 97.4 | 96.3 |
|
||||||
|
| [Zhou and Zhao](https://www.aclweb.org/anthology/P19-1230/) (2019) | 97.2 | 95.7 |
|
||||||
|
|
||||||
|
<figcaption class="caption">
|
||||||
|
|
||||||
|
**Accuracy on the Penn Treebank.** See
|
||||||
|
[NLP-progress](http://nlpprogress.com/english/dependency_parsing.html) for more
|
||||||
|
results.
|
||||||
|
|
||||||
|
</figcaption>
|
||||||
|
|
||||||
|
</figure>
|
||||||
|
|
||||||
<Project id="benchmarks/parsing_penn_treebank">
|
<Project id="benchmarks/parsing_penn_treebank">
|
||||||
|
|
||||||
The easiest way to reproduce spaCy's benchmarks on the Penn Treebank is to clone
|
The easiest way to reproduce spaCy's benchmarks on the Penn Treebank is to clone
|
||||||
|
|
|
@ -297,7 +297,7 @@ const Landing = ({ data }) => {
|
||||||
to run.
|
to run.
|
||||||
</p>
|
</p>
|
||||||
<p>
|
<p>
|
||||||
<Button to="/usage/facts-figures#benchmarks">See details</Button>
|
<Button to="/usage/facts-figures#benchmarks">More results</Button>
|
||||||
</p>
|
</p>
|
||||||
</LandingCol>
|
</LandingCol>
|
||||||
|
|
||||||
|
|
Loading…
Reference in New Issue
Block a user