spaCy/website/docs/models/index.md

56 lines
2.1 KiB
Markdown
Raw Permalink Normal View History

---
title: Trained Models & Pipelines
teaser: Downloadable trained pipelines and weights for spaCy
menu:
- ['Quickstart', 'quickstart']
- ['Conventions', 'conventions']
---
2020-10-08 17:23:12 +03:00
<!-- TODO: include interactive demo -->
### Quickstart {hidden="true"}
2020-10-08 17:23:12 +03:00
> #### 📖 Installation and usage
>
> For more details on how to use trained pipelines with spaCy, see the
> [usage guide](/usage/models).
2020-10-08 17:23:12 +03:00
import QuickstartModels from 'widgets/quickstart-models.js'
2020-10-08 17:23:12 +03:00
<QuickstartModels id="quickstart" />
## Package naming conventions {#conventions}
In general, spaCy expects all pipeline packages to follow the naming convention
2020-10-14 21:50:23 +03:00
of `[lang]\_[name]`. For spaCy's pipelines, we also chose to divide the name
into three components:
1. **Type:** Capabilities (e.g. `core` for general-purpose pipeline with
2020-10-14 21:50:23 +03:00
vocabulary, syntax, entities and word vectors, or `dep` for only vocab and
syntax).
2. **Genre:** Type of text the pipeline is trained on, e.g. `web` or `news`.
3. **Size:** Package size indicator, `sm`, `md` or `lg`.
2020-08-11 21:57:23 +03:00
For example, [`en_core_web_sm`](/models/en#en_core_web_sm) is a small English
pipeline trained on written web text (blogs, news, comments), that includes
2020-08-11 21:57:23 +03:00
vocabulary, vectors, syntax and entities.
### Package versioning {#model-versioning}
Additionally, the pipeline package versioning reflects both the compatibility
with spaCy, as well as the major and minor version. A package version `a.b.c`
translates to:
- `a`: **spaCy major version**. For example, `2` for spaCy v2.x.
- `b`: **Package major version**. Pipelines with a different major version can't
be loaded by the same code. For example, changing the width of the model,
adding hidden layers or changing the activation changes the major version.
- `c`: **Package minor version**. Same pipeline structure, but different
parameter values, e.g. from being trained on different data, for different
numbers of iterations, etc.
For a detailed compatibility overview, see the
[`compatibility.json`](https://github.com/explosion/spacy-models/tree/master/compatibility.json).
This is also the source of spaCy's internal compatibility check, performed when
you run the [`download`](/api/cli#download) command.