mirror of
https://github.com/explosion/spaCy.git
synced 2024-12-25 17:36:30 +03:00
* Announce v0.87 in docs
This commit is contained in:
parent
d8ef2d6b61
commit
17792bb4e3
|
@ -10,11 +10,9 @@ spaCy: Industrial-strength NLP
|
|||
|
||||
.. _Issue Tracker: https://github.com/honnibal/spaCy/issues
|
||||
|
||||
**2015-06-24**: `Version 0.86 released`_
|
||||
**2015-07-01**: `Version 0.87 released`_
|
||||
|
||||
.. _Version 0.86 released: updates.html
|
||||
|
||||
`spaCy`_
|
||||
.. _Version 0.87 released: updates.html
|
||||
|
||||
`spaCy`_ is a new library for text processing in Python and Cython.
|
||||
I wrote it because I think small companies are terrible at
|
||||
|
|
|
@ -10,6 +10,19 @@ To update your installation:
|
|||
|
||||
Most updates ship a new model, so you will usually have to redownload the data.
|
||||
|
||||
2015-07-01 v0.87
|
||||
----------------
|
||||
|
||||
* Changed weights data structure. Memory use should be reduced 30-40%.
|
||||
* Fixed speed regressions introduced in the last few versions.
|
||||
* Models should now be slightly more robust to noise in the input text, as I'm
|
||||
now training on data with a small amount of noise added, e.g. I randomly corrupt
|
||||
capitalization, swap spaces for newlines, etc. This is bringing a small
|
||||
benefit on out-of-domain data. I think this strategy could yield better
|
||||
results with a better noise-generation function. If you think you have a good
|
||||
way to make clean text resemble the kind of noisy input you're seeing in your
|
||||
domain, get in touch.
|
||||
|
||||
2015-06-24 v0.86
|
||||
----------------
|
||||
|
||||
|
|
Loading…
Reference in New Issue
Block a user