Commit Graph

3888 Commits

Author SHA1 Message Date
Gyorgy Orosz
ab2f6ea46c Removed data files from tests.. 2016-12-21 20:22:09 +01:00
Gyorgy Orosz
3d5306acb9 Added further testcases. 2016-12-20 23:49:35 +01:00
Gyorgy Orosz
23956e72ff Improved partial support for tokenzing Hungarian numbers 2016-12-20 23:36:59 +01:00
Gyorgy Orosz
6add156075 Refactored language data structure 2016-12-20 22:28:20 +01:00
Gyorgy Orosz
366b3f8685 Merge branch 'master' into hu_tokenizer 2016-12-20 20:53:31 +01:00
Gyorgy Orosz
c035928156 Partial Hungarian number tokenization is added. 2016-12-20 20:46:20 +01:00
Matthew Honnibal
f3c61ae246 Merge pull request #697 from jmizgajski/patch-1
Fixed missing vec_path declaration that was failing if 'add_vectors' …
2016-12-21 04:32:43 +11:00
JM
70ff0639b5 Fixed missing vec_path declaration that was failing if 'add_vectors' was set
Added vec_path variable declaration to avoid accessing it before assignment in case 'add_vectors' is in overrides.
2016-12-20 18:21:05 +01:00
Ines Montani
9a0748b12d Update CONTRIBUTORS.md 2016-12-20 12:01:33 +01:00
Ines Montani
68abc16c9a Merge pull request #694 from magnusburton/master
Initial commit for Swedish
2016-12-20 12:00:55 +01:00
Magnus Burton
db5a077d2b Initial commit for Swedish 2016-12-20 11:05:06 +01:00
Ines Montani
a2525c76ee Reformat word frequencies section in "adding languages" workflow 2016-12-19 17:18:38 +01:00
Ines Montani
ddf5c5bb61 Generalise dependency parsing annotation specs beyond English (closes #657) 2016-12-19 13:42:44 +01:00
Ines Montani
6a793251c8 Add aside on spaCy's custom pronoun lemma 2016-12-19 13:41:47 +01:00
Ines Montani
d0c15730c4 Fix link 2016-12-19 13:09:45 +01:00
Ines Montani
a9c0e77b80 Fix typo 2016-12-19 13:09:45 +01:00
Matthew Honnibal
0c7720e162 Remove unit and integration test packages 2016-12-19 00:26:56 +01:00
Matthew Honnibal
6c0c43c267 Add comment 2016-12-19 00:20:16 +01:00
Matthew Honnibal
6c7ae55084 Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-12-19 00:15:26 +01:00
Matthew Honnibal
b2cebdcca7 List more test packages in the setup.py 2016-12-19 00:15:11 +01:00
Matthew Honnibal
97521c95b3 List the language_data package in the setup.py 2016-12-19 00:14:09 +01:00
Ines Montani
c773258ebf Update README.rst 2016-12-18 23:54:39 +01:00
Ines Montani
fa65c6b54c Add "Adding languages" workflow (closes #562) 2016-12-18 23:54:19 +01:00
Ines Montani
1cddb7da36 Add "Part-of-speech tagging" workflow (closes #581) 2016-12-18 23:54:19 +01:00
Ines Montani
89398ca57b Bump version 2016-12-18 23:54:19 +01:00
Ines Montani
ac597b58f6 Update showcase 2016-12-18 23:54:18 +01:00
Matthew Honnibal
3f5747a9b2 Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-12-18 23:44:22 +01:00
Matthew Honnibal
40e71586d6 Fix Issue #683: Add 'SP' to tag_map, if it's not there already, within the Morphology class. 2016-12-18 23:44:05 +01:00
Matthew Honnibal
fa1d23e10d Merge branch 'master' of https://github.com/explosion/spaCy 2016-12-18 23:32:03 +01:00
Matthew Honnibal
f38eb25fe1 Fix test for word vector 2016-12-18 23:31:55 +01:00
Matthew Honnibal
4e68abebc4 Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-12-18 23:19:45 +01:00
Matthew Honnibal
5a6328a5a4 Increment version 2016-12-18 23:19:19 +01:00
Matthew Honnibal
13a0b31279 Another tweak to GloVe path hackery. 2016-12-18 23:12:49 +01:00
Matthew Honnibal
2c6228565e Fix vector loading re glove hack 2016-12-18 23:06:44 +01:00
Matthew Honnibal
618b50a064 Fix issue #684: GloVe vectors not loaded in spacy.en.English. 2016-12-18 22:46:31 +01:00
Matthew Honnibal
404019ad2f Fix issue #672: ent_iob_ was a string, not unicode, due to missing unicode_literals statement. 2016-12-18 22:33:53 +01:00
Matthew Honnibal
29fb9db4e8 Merge branch 'master' of ssh://github.com/explosion/spaCy 2016-12-18 22:29:52 +01:00
Matthew Honnibal
2ef9d53117 Untested fix for issue #684: GloVe vectors hack should be inserted in English, not in spacy.load. 2016-12-18 22:29:31 +01:00
Matthew Honnibal
c065359459 Fix path-override bug in spacy.load 2016-12-18 22:15:29 +01:00
Ines Montani
592a1398ed Update CONTRIBUTING.md 2016-12-18 18:09:33 +01:00
Matthew Honnibal
9a7b406b30 Repair travis test script 2016-12-18 18:07:59 +01:00
Matthew Honnibal
a8e1edb85d Disable compilation-based test 2016-12-18 18:06:22 +01:00
Matthew Honnibal
162bca5727 Fix travis script 2016-12-18 18:02:44 +01:00
Matthew Honnibal
5e970307f5 Disable init_model.py test, which will be replaced with new model creation logic. 2016-12-18 17:53:45 +01:00
Ines Montani
614ca6fb41 Split annotation specs into files to they can be included in different places 2016-12-18 17:42:10 +01:00
Ines Montani
ac95779a75 Wrap src mixin in nowrap to prevent line break between text and icon 2016-12-18 17:41:03 +01:00
Ines Montani
6f8b555ab0 Add nowrap utility class 2016-12-18 17:40:30 +01:00
Ines Montani
ce8bf08223 Fix formatting 2016-12-18 17:40:20 +01:00
Ines Montani
5dfeefcb25 Merge pull request #692 from explosion/organize-language-data
Reorganize language data
2016-12-18 17:38:42 +01:00
Matthew Honnibal
813249f826 Work on morphology class. Still not fully consistent with rest of library. 2016-12-18 17:35:22 +01:00