💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
import React from 'react'
|
|
|
|
|
import PropTypes from 'prop-types'
|
|
|
|
|
import { StaticQuery, graphql } from 'gatsby'
|
|
|
|
|
|
2019-03-12 17:21:58 +03:00
|
|
|
|
import {
|
|
|
|
|
LandingHeader,
|
|
|
|
|
LandingTitle,
|
|
|
|
|
LandingSubtitle,
|
|
|
|
|
LandingGrid,
|
|
|
|
|
LandingCard,
|
2019-03-18 18:24:52 +03:00
|
|
|
|
LandingCol,
|
2019-03-12 17:21:58 +03:00
|
|
|
|
LandingButton,
|
|
|
|
|
LandingDemo,
|
|
|
|
|
LandingBannerGrid,
|
|
|
|
|
LandingBanner,
|
|
|
|
|
LandingLogos,
|
|
|
|
|
} from '../components/landing'
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
import { H2 } from '../components/typography'
|
|
|
|
|
import { Ul, Li } from '../components/list'
|
|
|
|
|
import Button from '../components/button'
|
|
|
|
|
import Link from '../components/link'
|
|
|
|
|
|
|
|
|
|
import BenchmarksChoi from 'usage/_benchmarks-choi.md'
|
|
|
|
|
|
|
|
|
|
const CODE_EXAMPLE = `# pip install spacy
|
2019-03-22 21:02:15 +03:00
|
|
|
|
# python -m spacy download en_core_web_sm
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
|
|
|
|
|
import spacy
|
|
|
|
|
|
|
|
|
|
# Load English tokenizer, tagger, parser, NER and word vectors
|
2019-03-22 21:02:15 +03:00
|
|
|
|
nlp = spacy.load("en_core_web_sm")
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
|
|
|
|
|
# Process whole documents
|
2019-03-22 21:02:15 +03:00
|
|
|
|
text = ("When Sebastian Thrun started working on self-driving cars at "
|
|
|
|
|
"Google in 2007, few people outside of the company took him "
|
|
|
|
|
"seriously. “I can tell you very senior CEOs of major American "
|
|
|
|
|
"car companies would shake my hand and turn away because I wasn’t "
|
|
|
|
|
"worth talking to,” said Thrun, in an interview with Recode earlier "
|
|
|
|
|
"this week.")
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
doc = nlp(text)
|
|
|
|
|
|
2019-03-22 21:02:15 +03:00
|
|
|
|
# Analyze syntax
|
|
|
|
|
print("Noun phrases:", [chunk.text for chunk in doc.noun_chunks])
|
|
|
|
|
print("Verbs:", [token.lemma_ for token in doc if token.pos_ == "VERB"])
|
|
|
|
|
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
# Find named entities, phrases and concepts
|
|
|
|
|
for entity in doc.ents:
|
|
|
|
|
print(entity.text, entity.label_)
|
|
|
|
|
`
|
|
|
|
|
|
|
|
|
|
/**
|
|
|
|
|
* Compute the overall total counts of models and languages
|
|
|
|
|
*/
|
|
|
|
|
function getCounts(langs = []) {
|
|
|
|
|
return {
|
|
|
|
|
langs: langs.length,
|
|
|
|
|
modelLangs: langs.filter(({ models }) => models && !!models.length).length,
|
|
|
|
|
models: langs.map(({ models }) => (models ? models.length : 0)).reduce((a, b) => a + b, 0),
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
const Landing = ({ data }) => {
|
|
|
|
|
const counts = getCounts(data.languages)
|
|
|
|
|
return (
|
|
|
|
|
<>
|
|
|
|
|
<LandingHeader>
|
|
|
|
|
<LandingTitle>
|
|
|
|
|
Industrial-Strength
|
|
|
|
|
<br />
|
|
|
|
|
Natural Language
|
|
|
|
|
<br />
|
|
|
|
|
Processing
|
|
|
|
|
</LandingTitle>
|
|
|
|
|
<LandingSubtitle>in Python</LandingSubtitle>
|
|
|
|
|
</LandingHeader>
|
|
|
|
|
<LandingGrid blocks>
|
|
|
|
|
<LandingCard title="Fastest in the world">
|
|
|
|
|
<p>
|
|
|
|
|
spaCy excels at large-scale information extraction tasks. It's written from
|
|
|
|
|
the ground up in carefully memory-managed Cython. Independent research has
|
|
|
|
|
confirmed that spaCy is the fastest in the world. If your application needs
|
|
|
|
|
to process entire web dumps, spaCy is the library you want to be using.
|
|
|
|
|
</p>
|
|
|
|
|
<LandingButton to="/usage/facts-figures">Facts & Figures</LandingButton>
|
|
|
|
|
</LandingCard>
|
|
|
|
|
|
|
|
|
|
<LandingCard title="Get things done">
|
|
|
|
|
<p>
|
|
|
|
|
spaCy is designed to help you do real work — to build real products, or
|
|
|
|
|
gather real insights. The library respects your time, and tries to avoid
|
|
|
|
|
wasting it. It's easy to install, and its API is simple and productive. We
|
|
|
|
|
like to think of spaCy as the Ruby on Rails of Natural Language Processing.
|
|
|
|
|
</p>
|
|
|
|
|
<LandingButton to="/usage">Get started</LandingButton>
|
|
|
|
|
</LandingCard>
|
|
|
|
|
|
|
|
|
|
<LandingCard title="Deep learning">
|
|
|
|
|
<p>
|
|
|
|
|
spaCy is the best way to prepare text for deep learning. It interoperates
|
|
|
|
|
seamlessly with TensorFlow, PyTorch, scikit-learn, Gensim and the rest of
|
|
|
|
|
Python's awesome AI ecosystem. With spaCy, you can easily construct
|
|
|
|
|
linguistically sophisticated statistical models for a variety of NLP
|
|
|
|
|
problems.
|
|
|
|
|
</p>
|
|
|
|
|
<LandingButton to="/usage/training">Read more</LandingButton>
|
|
|
|
|
</LandingCard>
|
|
|
|
|
</LandingGrid>
|
|
|
|
|
|
|
|
|
|
<LandingGrid>
|
|
|
|
|
<LandingDemo title="Edit the code & try spaCy">{CODE_EXAMPLE}</LandingDemo>
|
|
|
|
|
|
2019-03-18 18:24:52 +03:00
|
|
|
|
<LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
<H2>Features</H2>
|
|
|
|
|
<Ul>
|
|
|
|
|
<Li>
|
|
|
|
|
Non-destructive <strong>tokenization</strong>
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
<strong>Named entity</strong> recognition
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
Support for <strong>{counts.langs}+ languages</strong>
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
<strong>{counts.models} statistical models</strong> for{' '}
|
|
|
|
|
{counts.modelLangs} languages
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
Pre-trained <strong>word vectors</strong>
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
Easy <strong>deep learning</strong> integration
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>Part-of-speech tagging</Li>
|
|
|
|
|
<Li>Labelled dependency parsing</Li>
|
|
|
|
|
<Li>Syntax-driven sentence segmentation</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
Built in <strong>visualizers</strong> for syntax and NER
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>Convenient string-to-hash mapping</Li>
|
|
|
|
|
<Li>Export to numpy data arrays</Li>
|
|
|
|
|
<Li>Efficient binary serialization</Li>
|
|
|
|
|
<Li>
|
|
|
|
|
Easy <strong>model packaging</strong> and deployment
|
|
|
|
|
</Li>
|
|
|
|
|
<Li>State-of-the-art speed</Li>
|
|
|
|
|
<Li>Robust, rigorously evaluated accuracy</Li>
|
|
|
|
|
</Ul>
|
2019-03-18 18:24:52 +03:00
|
|
|
|
</LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
</LandingGrid>
|
|
|
|
|
|
|
|
|
|
<LandingBannerGrid>
|
|
|
|
|
<LandingBanner
|
|
|
|
|
title="BERT-style language model pretraining and more"
|
|
|
|
|
label="New in v2.1"
|
|
|
|
|
to="/usage/v2-1"
|
|
|
|
|
button="Read more"
|
|
|
|
|
small
|
|
|
|
|
>
|
|
|
|
|
Learn more from small training corpora by initializing your models with{' '}
|
|
|
|
|
<strong>knowledge from raw text</strong>. The new pretrain command teaches
|
|
|
|
|
spaCy's CNN model to predict words based on their context, producing
|
|
|
|
|
representations of words in contexts. If you've seen Google's BERT system or
|
|
|
|
|
fast.ai's ULMFiT, spaCy's pretraining is similar – but much more efficient. It's
|
|
|
|
|
still experimental, but users are already reporting good results, so give it a
|
|
|
|
|
try!
|
|
|
|
|
</LandingBanner>
|
|
|
|
|
|
|
|
|
|
<LandingBanner
|
|
|
|
|
title="Prodigy: Radically efficient machine teaching"
|
|
|
|
|
label="From the makers of spaCy"
|
|
|
|
|
to="https://prodi.gy"
|
|
|
|
|
button="Try it out"
|
|
|
|
|
background="#eee"
|
|
|
|
|
color="#252a33"
|
|
|
|
|
small
|
|
|
|
|
>
|
|
|
|
|
Prodigy is an <strong>annotation tool</strong> so efficient that data scientists
|
|
|
|
|
can do the annotation themselves, enabling a new level of rapid iteration.
|
|
|
|
|
Whether you're working on entity recognition, intent detection or image
|
|
|
|
|
classification, Prodigy can help you <strong>train and evaluate</strong> your
|
|
|
|
|
models faster. Stream in your own examples or real-world data from live APIs,
|
|
|
|
|
update your model in real-time and chain models together to build more complex
|
|
|
|
|
systems.
|
|
|
|
|
</LandingBanner>
|
|
|
|
|
</LandingBannerGrid>
|
|
|
|
|
|
|
|
|
|
<LandingLogos title="spaCy is trusted by" logos={data.logosUsers}>
|
|
|
|
|
<Button to={`https://github.com/${data.repo}/stargazers`}>and many more</Button>
|
|
|
|
|
</LandingLogos>
|
|
|
|
|
<LandingLogos title="Featured on" logos={data.logosPublications} />
|
|
|
|
|
|
|
|
|
|
<LandingBanner
|
|
|
|
|
title="Convolutional neural network models"
|
|
|
|
|
label="New in v2.0"
|
|
|
|
|
button="Download models"
|
|
|
|
|
to="/models"
|
|
|
|
|
>
|
|
|
|
|
spaCy v2.0 features new neural models for <strong>tagging</strong>,{' '}
|
|
|
|
|
<strong>parsing</strong> and <strong>entity recognition</strong>. The models have
|
|
|
|
|
been designed and implemented from scratch specifically for spaCy, to give you an
|
|
|
|
|
unmatched balance of speed, size and accuracy. A novel bloom embedding strategy with
|
|
|
|
|
subword features is used to support huge vocabularies in tiny tables. Convolutional
|
|
|
|
|
layers with residual connections, layer normalization and maxout non-linearity are
|
|
|
|
|
used, giving much better efficiency than the standard BiLSTM solution. Finally, the
|
|
|
|
|
parser and NER use an imitation learning objective to deliver accuracy in-line with
|
|
|
|
|
the latest research systems, even when evaluated from raw text. With these
|
|
|
|
|
innovations, spaCy v2.0's models are <strong>10× smaller</strong>,{' '}
|
|
|
|
|
<strong>20% more accurate</strong>, and
|
|
|
|
|
<strong>even cheaper to run</strong> than the previous generation.
|
|
|
|
|
</LandingBanner>
|
|
|
|
|
|
|
|
|
|
<LandingGrid cols={2}>
|
2019-03-18 18:24:52 +03:00
|
|
|
|
<LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
<H2>Benchmarks</H2>
|
|
|
|
|
<p>
|
|
|
|
|
In 2015, independent researchers from Emory University and Yahoo! Labs
|
2019-03-22 17:17:35 +03:00
|
|
|
|
showed that spaCy offered the{' '}
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
<strong>fastest syntactic parser in the world</strong> and that its accuracy
|
|
|
|
|
was <strong>within 1% of the best</strong> available (
|
|
|
|
|
<Link to="https://aclweb.org/anthology/P/P15/P15-1038.pdf">
|
|
|
|
|
Choi et al., 2015
|
|
|
|
|
</Link>
|
|
|
|
|
). spaCy v2.0, released in 2017, is more accurate than any of the systems
|
|
|
|
|
Choi et al. evaluated.
|
|
|
|
|
</p>
|
|
|
|
|
<p>
|
|
|
|
|
<Button to="/usage/facts-figures#benchmarks" large>
|
|
|
|
|
See details
|
|
|
|
|
</Button>
|
|
|
|
|
</p>
|
2019-03-18 18:24:52 +03:00
|
|
|
|
</LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
|
2019-03-18 18:24:52 +03:00
|
|
|
|
<LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
<BenchmarksChoi />
|
2019-03-18 18:24:52 +03:00
|
|
|
|
</LandingCol>
|
💫 Update website (#3285)
<!--- Provide a general summary of your changes in the title. -->
## Description
The new website is implemented using [Gatsby](https://www.gatsbyjs.org) with [Remark](https://github.com/remarkjs/remark) and [MDX](https://mdxjs.com/). This allows authoring content in **straightforward Markdown** without the usual limitations. Standard elements can be overwritten with powerful [React](http://reactjs.org/) components and wherever Markdown syntax isn't enough, JSX components can be used. Hopefully, this update will also make it much easier to contribute to the docs. Once this PR is merged, I'll implement auto-deployment via [Netlify](https://netlify.com) on a specific branch (to avoid building the website on every PR). There's a bunch of other cool stuff that the new setup will allow us to do – including writing front-end tests, service workers, offline support, implementing a search and so on.
This PR also includes various new docs pages and content.
Resolves #3270. Resolves #3222. Resolves #2947. Resolves #2837.
### Types of change
enhancement
## Checklist
<!--- Before you submit the PR, go over this checklist and make sure you can
tick off all the boxes. [] -> [x] -->
- [x] I have submitted the spaCy Contributor Agreement.
- [x] I ran the tests, and all new and existing tests passed.
- [x] My changes don't require a change to the documentation, or if they do, I've added all required information.
2019-02-17 21:31:19 +03:00
|
|
|
|
</LandingGrid>
|
|
|
|
|
</>
|
|
|
|
|
)
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
Landing.propTypes = {
|
|
|
|
|
data: PropTypes.shape({
|
|
|
|
|
repo: PropTypes.string,
|
|
|
|
|
languages: PropTypes.arrayOf(
|
|
|
|
|
PropTypes.shape({
|
|
|
|
|
models: PropTypes.arrayOf(PropTypes.string),
|
|
|
|
|
})
|
|
|
|
|
),
|
|
|
|
|
logosUsers: PropTypes.arrayOf(
|
|
|
|
|
PropTypes.shape({
|
|
|
|
|
id: PropTypes.string.isRequired,
|
|
|
|
|
url: PropTypes.string.isRequired,
|
|
|
|
|
})
|
|
|
|
|
),
|
|
|
|
|
logosPublications: PropTypes.arrayOf(
|
|
|
|
|
PropTypes.shape({
|
|
|
|
|
id: PropTypes.string.isRequired,
|
|
|
|
|
url: PropTypes.string.isRequired,
|
|
|
|
|
})
|
|
|
|
|
),
|
|
|
|
|
}),
|
|
|
|
|
}
|
|
|
|
|
|
|
|
|
|
export default () => (
|
|
|
|
|
<StaticQuery query={landingQuery} render={({ site }) => <Landing data={site.siteMetadata} />} />
|
|
|
|
|
)
|
|
|
|
|
|
|
|
|
|
const landingQuery = graphql`
|
|
|
|
|
query LandingQuery {
|
|
|
|
|
site {
|
|
|
|
|
siteMetadata {
|
|
|
|
|
repo
|
|
|
|
|
languages {
|
|
|
|
|
models
|
|
|
|
|
}
|
|
|
|
|
logosUsers {
|
|
|
|
|
id
|
|
|
|
|
url
|
|
|
|
|
}
|
|
|
|
|
logosPublications {
|
|
|
|
|
id
|
|
|
|
|
url
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
}
|
|
|
|
|
`
|