Sofie Van Landeghem
9d3ce7cba2
Ensure training doesn't crash with empty batches ( #4360 )
...
* unit test for previously resolved unflatten issue
* prevent batch of empty docs to cause problems
2019-10-02 12:50:47 +02:00
Ines Montani
52b5912dbf
Tidy up [ci skip]
2019-10-02 12:05:59 +02:00
adrianeboyd
d82241218a
Make the default NER labels less model-specific [ci skip] ( #4361 )
2019-10-02 12:05:17 +02:00
adrianeboyd
dda86118bd
Update Ukrainian lemmatizer with new lookups ( #4359 )
...
* Update Ukrainian lemmatizer with new lookups
* Add missing import
Co-authored-by: Ines Montani <ines@ines.io>
2019-10-02 12:04:06 +02:00
Ines Montani
b6670bf0c2
Use consistent spelling
2019-10-02 10:37:39 +02:00
Ines Montani
208629615d
Auto-format
2019-10-02 10:37:04 +02:00
Ines Montani
867e93aae2
Add Streamlit example [ci skip]
2019-10-02 01:21:20 +02:00
Matthew Honnibal
38b6e69389
Merge branch 'master' of https://github.com/explosion/spaCy
2019-10-01 22:28:25 +02:00
Matthew Honnibal
d4b63bb6dd
Set version to v2.2.0
2019-10-01 22:28:13 +02:00
Ines Montani
9885b5ae68
Update spacy_lookups_data version [ci skip]
2019-10-01 22:21:21 +02:00
Ines Montani
475e3188ce
Add docs on filtering overlapping spans for merging ( resolves #4352 ) [ci skip]
2019-10-01 21:59:50 +02:00
Matthew Honnibal
667f294627
Merge branch 'master' of https://github.com/explosion/spaCy
2019-10-01 21:37:25 +02:00
Ines Montani
0dd127bb00
Update v2-2.md [ci skip]
2019-10-01 21:37:06 +02:00
Matthew Honnibal
64a9577d43
Set version to v2.2.0.dev17
2019-10-01 21:36:59 +02:00
Ines Montani
cf65a80f36
Refactor lemmatizer and data table integration ( #4353 )
...
* Move test
* Allow default in Lookups.get_table
* Start with blank tables in Lookups.from_bytes
* Refactor lemmatizer to hold instance of Lookups
* Get lookups table within the lemmatization methods to make sure it references the correct table (even if the table was replaced or modified, e.g. when loading a model from disk)
* Deprecate other arguments on Lemmatizer.__init__ and expect Lookups for consistency
* Remove old and unsupported Lemmatizer.load classmethod
* Refactor language-specific lemmatizers to inherit as much as possible from base class and override only what they need
* Update tests and docs
* Fix more tests
* Fix lemmatizer
* Upgrade pytest to try and fix weird CI errors
* Try pytest 4.6.5
2019-10-01 21:36:03 +02:00
Ines Montani
3297a19545
Warn in Tagger.begin_training if no lemma tables are available ( #4351 )
2019-10-01 15:13:55 +02:00
Ines Montani
bc7e7db208
Fix wording [ci skip]
2019-10-01 14:20:44 +02:00
Ines Montani
2a3a4565cd
Update infobox [ci skip]
2019-10-01 14:19:34 +02:00
Ines Montani
66aa0d479f
Update v2.2 page [ci skip]
2019-10-01 14:11:05 +02:00
Ines Montani
a8a1800f2a
Update lemma data documentation [ci skip]
2019-10-01 13:22:13 +02:00
Ines Montani
932ad9cb91
Fix typos and formatting [ci skip]
2019-10-01 12:30:04 +02:00
Ines Montani
ca0b20ae8b
Make prereleases less verbose [ci skip]
2019-10-01 12:29:14 +02:00
Matthew Honnibal
2fb05482dd
Set version to v2.2.0
2019-10-01 03:50:13 +02:00
Matthew Honnibal
dc22ec0aad
Set version to v2.2.0.dev17
2019-10-01 03:26:53 +02:00
Matthew Honnibal
377008bae2
Fix sdist for fabfile
2019-10-01 02:44:10 +02:00
Matthew Honnibal
91978a4de0
Merge branch 'master' of https://github.com/explosion/spaCy
2019-10-01 00:31:08 +02:00
Matthew Honnibal
aedfba867a
Set version to v2.2.0.dev16
2019-10-01 00:31:00 +02:00
Ines Montani
30d872011d
Merge branch 'master' of https://github.com/explosion/spaCy
2019-10-01 00:25:48 +02:00
Ines Montani
75b8021a86
Move setup requirements to setup.cfg
2019-10-01 00:25:46 +02:00
Ines Montani
e0cf4796a5
Move lookup tables out of the core library ( #4346 )
...
* Add default to util.get_entry_point
* Tidy up entry points
* Read lookups from entry points
* Remove lookup tables and related tests
* Add lookups install option
* Remove lemmatizer tests
* Remove logic to process language data files
* Update setup.cfg
2019-10-01 00:01:27 +02:00
Rahul Soni
ed620daa5c
Fix example sentences in Hindi for grammatical errors ( #4343 )
...
* Fix grammar for hindi
* Fix grammar for hindi
* Submit contributor agreement
2019-09-30 23:32:49 +02:00
Ines Montani
ba186299e1
Tidy up and modernize setup and config ( #4344 )
...
* Tidy up and modernize setup and config
* Update setup.cfg
* Re-add pyproject.toml
* Delete .flake8
* Move static meta from about to setup.cfg
* Update setup.cfg
Co-Authored-By: Matthew Honnibal <honnibal+gh@gmail.com>
2019-09-30 20:10:55 +02:00
Ines Montani
4f905ac9e6
Add test for ASCII filenames ( #4345 )
2019-09-30 18:45:30 +02:00
Ines Montani
69c674bdbf
Update MANIFEST.in
2019-09-30 16:33:07 +02:00
Ines Montani
b8eca6cd11
Update MANIFEST.in
2019-09-30 16:27:12 +02:00
Matthew Honnibal
512e2208dc
Include .txt files
2019-09-30 15:24:25 +02:00
Matthew Honnibal
7735b7ec59
Merge branch 'master' of https://github.com/explosion/spaCy
2019-09-30 14:56:59 +02:00
Matthew Honnibal
84837c1680
Use include_package_data in setup.py
2019-09-30 14:56:44 +02:00
Ines Montani
31cebf66a8
Update universe.json
2019-09-30 13:50:08 +02:00
Ines Montani
61263e2fbc
Update universe.json [ci skip]
2019-09-30 13:49:44 +02:00
Ines Montani
88fee1a768
Update models.js
2019-09-30 13:22:17 +02:00
Ines Montani
7c701784e5
Update models.js
2019-09-30 13:17:48 +02:00
Ines Montani
06d8c3a20f
Revert "Merge branch 'master' into spacy.io"
...
This reverts commit c8bb08b545
, reversing
changes made to b6a509a8d1
.
2019-09-30 13:14:48 +02:00
Matthew Honnibal
b5c775dd42
Set version to v2.2.0
2019-09-30 12:47:08 +02:00
Ines Montani
f7d1736241
Skip duplicate spans in Doc.retokenize ( #4339 )
2019-09-30 12:43:48 +02:00
Ines Montani
c8bb08b545
Merge branch 'master' into spacy.io
2019-09-30 12:01:18 +02:00
Ines Montani
71bd040834
Update models.js [ci skip]
2019-09-30 12:01:09 +02:00
Ines Montani
9cbaf9b965
Update azure-pipelines.yml
2019-09-29 20:19:29 +02:00
Ines Montani
6a5433236b
Update azure-pipelines.yml
2019-09-29 19:22:09 +02:00
Ines Montani
c72c8794dd
Update CONTRIBUTING.md [ci skip]
2019-09-29 18:37:22 +02:00