From 17792bb4e3c911e6f99a5356452b296a3f8a001a Mon Sep 17 00:00:00 2001 From: Matthew Honnibal Date: Wed, 1 Jul 2015 15:36:41 +0200 Subject: [PATCH] * Announce v0.87 in docs --- docs/source/index.rst | 6 ++---- docs/source/updates.rst | 13 +++++++++++++ 2 files changed, 15 insertions(+), 4 deletions(-) diff --git a/docs/source/index.rst b/docs/source/index.rst index cba745ba1..d323a576e 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -10,11 +10,9 @@ spaCy: Industrial-strength NLP .. _Issue Tracker: https://github.com/honnibal/spaCy/issues -**2015-06-24**: `Version 0.86 released`_ +**2015-07-01**: `Version 0.87 released`_ -.. _Version 0.86 released: updates.html - -`spaCy`_ +.. _Version 0.87 released: updates.html `spaCy`_ is a new library for text processing in Python and Cython. I wrote it because I think small companies are terrible at diff --git a/docs/source/updates.rst b/docs/source/updates.rst index 43be93428..c21b9c984 100644 --- a/docs/source/updates.rst +++ b/docs/source/updates.rst @@ -10,6 +10,19 @@ To update your installation: Most updates ship a new model, so you will usually have to redownload the data. +2015-07-01 v0.87 +---------------- + +* Changed weights data structure. Memory use should be reduced 30-40%. +* Fixed speed regressions introduced in the last few versions. +* Models should now be slightly more robust to noise in the input text, as I'm + now training on data with a small amount of noise added, e.g. I randomly corrupt + capitalization, swap spaces for newlines, etc. This is bringing a small + benefit on out-of-domain data. I think this strategy could yield better + results with a better noise-generation function. If you think you have a good + way to make clean text resemble the kind of noisy input you're seeing in your + domain, get in touch. + 2015-06-24 v0.86 ----------------