2014-09-25 20:42:20 +04:00
|
|
|
.. spaCy documentation master file, created by
|
|
|
|
sphinx-quickstart on Tue Aug 19 16:27:38 2014.
|
|
|
|
You can adapt this file completely to your liking, but it should at least
|
|
|
|
contain the root `toctree` directive.
|
|
|
|
|
2015-01-15 23:08:35 +03:00
|
|
|
==============================
|
|
|
|
spaCy: Industrial-strength NLP
|
|
|
|
==============================
|
2014-09-25 20:42:20 +04:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
`spaCy`_ is a library for building tomorrow's language technology products.
|
|
|
|
It's like Stanford's CoreNLP for Python, but with a fundamentally different
|
|
|
|
objective. While CoreNLP is primarily built for conducting research, spaCy is
|
|
|
|
designed for application.
|
2015-01-23 15:11:16 +03:00
|
|
|
|
|
|
|
If you're a small company doing NLP, I think spaCy will seem like a minor miracle.
|
2015-01-24 17:06:14 +03:00
|
|
|
It's by far the fastest NLP software ever released.
|
2015-07-29 23:34:10 +03:00
|
|
|
The full processing pipeline completes in under 50ms per document, including accurate
|
|
|
|
tagging, entity recognition and parsing. All strings are mapped to integer IDs,
|
|
|
|
tokens are linked to embedded word representations, and a range of useful features
|
|
|
|
are pre-calculated and cached. The full analysis can be exported to numpy
|
|
|
|
arrays, or losslessly serialized into binary data smaller than the raw text.
|
2015-01-15 23:08:35 +03:00
|
|
|
|
|
|
|
If none of that made any sense to you, here's the gist of it. Computers don't
|
2015-04-19 11:43:46 +03:00
|
|
|
understand text. This is unfortunate, because that's what the web almost entirely
|
2015-01-15 23:08:35 +03:00
|
|
|
consists of. We want to recommend people text based on other text they liked.
|
|
|
|
We want to shorten text to display it on a mobile screen. We want to aggregate
|
|
|
|
it, link it, filter it, categorise it, generate it and correct it.
|
|
|
|
|
2015-01-23 15:11:16 +03:00
|
|
|
spaCy provides a library of utility functions that help programmers build such
|
|
|
|
products. It's commercial open source software: you can either use it under
|
2015-04-19 11:31:31 +03:00
|
|
|
the AGPL, or you can `buy a commercial license`_ for a one-time fee.
|
2015-01-23 15:11:16 +03:00
|
|
|
|
2015-01-23 00:22:00 +03:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
.. _spaCy: https://github.com/honnibal/spaCy/
|
2015-01-15 23:08:35 +03:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
.. _Issue Tracker: https://github.com/honnibal/spaCy/issues
|
2015-01-15 23:08:35 +03:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
**2015-07-08**: `Version 0.89 released`_
|
2015-01-15 23:08:35 +03:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
.. _Version 0.89 released: updates.html
|
2015-01-26 01:31:34 +03:00
|
|
|
|
2015-07-29 23:34:10 +03:00
|
|
|
.. _buy a commercial license: license.html
|
2014-12-30 13:20:34 +03:00
|
|
|
|
2014-09-25 20:42:20 +04:00
|
|
|
.. toctree::
|
2015-07-08 18:58:49 +03:00
|
|
|
:maxdepth: 4
|
|
|
|
:hidden:
|
2014-12-01 14:55:13 +03:00
|
|
|
|
2015-01-15 23:08:35 +03:00
|
|
|
quickstart.rst
|
2015-07-08 18:58:49 +03:00
|
|
|
reference/index.rst
|
2015-04-19 11:31:31 +03:00
|
|
|
license.rst
|
2015-01-31 15:05:17 +03:00
|
|
|
updates.rst
|