spaCy/website/docs/usage/_benchmarks-models.md

43 lines
1.6 KiB
Markdown
Raw Normal View History

2020-09-12 18:05:10 +03:00
import { Help } from 'components/typography'; import Link from 'components/link'
<figure>
2020-10-15 18:27:24 +03:00
| Pipeline | Parser | Tagger | NER |
| ---------------------------------------------------------- | -----: | -----: | ---: |
2021-02-09 23:28:33 +03:00
| [`en_core_web_trf`](/models/en#en_core_web_trf) (spaCy v3) | 95.1 | 97.8 | 89.8 |
| [`en_core_web_lg`](/models/en#en_core_web_lg) (spaCy v3) | 92.0 | 97.4 | 85.5 |
2020-10-15 18:27:24 +03:00
| `en_core_web_lg` (spaCy v2) | 91.9 | 97.2 | 85.5 |
2020-09-12 18:05:10 +03:00
<figcaption class="caption">
2021-01-23 02:57:00 +03:00
**Full pipeline accuracy** on the
2020-10-15 12:16:06 +03:00
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) corpus (reported on
the development set).
2020-09-12 18:05:10 +03:00
</figcaption>
</figure>
<figure>
2020-10-15 18:27:24 +03:00
| Named Entity Recognition System | OntoNotes | CoNLL '03 |
| -------------------------------- | --------: | --------: |
2021-02-09 23:28:33 +03:00
| spaCy RoBERTa (2020) | 89.8 | 91.6 |
2020-10-15 18:27:24 +03:00
| Stanza (StanfordNLP)<sup>1</sup> | 88.8 | 92.1 |
| Flair<sup>2</sup> | 89.7 | 93.1 |
2020-09-12 18:05:10 +03:00
<figcaption class="caption">
2020-09-23 23:02:31 +03:00
**Named entity recognition accuracy** on the
[OntoNotes 5.0](https://catalog.ldc.upenn.edu/LDC2013T19) and
[CoNLL-2003](https://www.aclweb.org/anthology/W03-0419.pdf) corpora. See
[NLP-progress](http://nlpprogress.com/english/named_entity_recognition.html) for
2020-10-15 09:58:30 +03:00
more results. Project template:
[`benchmarks/ner_conll03`](%%GITHUB_PROJECTS/benchmarks/ner_conll03). **1. **
[Qi et al. (2020)](https://arxiv.org/pdf/2003.07082.pdf). **2. **
2020-10-15 18:27:24 +03:00
[Akbik et al. (2018)](https://www.aclweb.org/anthology/C18-1139/).
2020-09-12 18:05:10 +03:00
</figcaption>
</figure>