Commit Graph

292 Commits

Author SHA1 Message Date
Aniruddha Adhikary
5a4fc09576 add basic Bengali support 2017-02-28 07:48:37 +06:00
Matthew Honnibal
c744ce4b6d Fix bad change to cythonize.py script, re subprocess call 2017-02-16 19:01:25 +01:00
Matthew Honnibal
0836cbe064 Pass shell to cythonize.py. See Issue #791 2017-02-17 01:06:06 +11:00
Michael Wallin
73f66ec570 Add preliminary support for Finnish 2017-02-04 13:54:10 +02:00
Raphaël Bournhonesque
0c2e5539ce Specify version number for ujson and plac
The required version was specified for plac in requirements.txt but not in setup.py, which could cause a conflicting version error.
Similarly, set the version of ujson in requirements.txt to be the same as in setup.py
2017-01-28 18:38:14 +01:00
Matthew Honnibal
48c712f1c1 Merge branch 'master' of ssh://github.com/explosion/spaCy 2017-01-16 13:18:06 +01:00
Matthew Honnibal
d4e6d4c1c4 Use new thinc 2017-01-16 13:17:14 +01:00
Ines Montani
a308703f47 Remove old tests 2017-01-13 01:34:48 +01:00
Ines Montani
f8803808ce Remove old unused tests and conftest files 2017-01-12 15:09:05 +01:00
Ines Montani
26d018d874 Add tests for StringStore 2017-01-12 15:07:31 +01:00
Ines Montani
ffcaba9017 Remove old and/or redundant tests 2017-01-12 02:10:18 +01:00
Ines Montani
33800c9367 Rename "tokens" tests to "doc" 2017-01-11 18:59:01 +01:00
Matthew Honnibal
c9fdd9917c Require older thinc 2017-01-09 10:12:41 -06:00
Matthew Honnibal
7108ad9d80 Require thinc 6.1 2017-01-09 14:37:00 +01:00
Matthew Honnibal
e4862d1dab Merge branch 'develop' 2017-01-09 13:36:01 +01:00
Ines Montani
d87ca84028 Remove old website example tests from setup.py 2017-01-08 22:42:54 +01:00
Matthew Honnibal
af81ac8bb0 Use thinc 6.0 2016-12-29 11:58:42 +01:00
Gyorgy Orosz
35aa54765d Hungarian module is exposed in spacy. 2016-12-21 20:45:36 +01:00
Magnus Burton
db5a077d2b Initial commit for Swedish 2016-12-20 11:05:06 +01:00
Matthew Honnibal
0c7720e162 Remove unit and integration test packages 2016-12-19 00:26:56 +01:00
Matthew Honnibal
6c0c43c267 Add comment 2016-12-19 00:20:16 +01:00
Matthew Honnibal
b2cebdcca7 List more test packages in the setup.py 2016-12-19 00:15:11 +01:00
Matthew Honnibal
97521c95b3 List the language_data package in the setup.py 2016-12-19 00:14:09 +01:00
dafnevk
d8c7ac203a Added nl module for dutch 2016-11-24 16:39:49 +01:00
Matthew Honnibal
36bcd46244 Integrate patch from @mikepb re building OpenMP-supporting wheels for macOS / OSX. I'm running blind on this, so this commit might not be 100%. Rollback if there are any problems. See Issue #267. 2016-11-06 11:58:50 +01:00
Matthew Honnibal
bc8d04abc0 Package alpha es, fr, it and pt directories. 2016-11-04 20:02:53 +01:00
Adam Ever Hadani
452b766d82 added ujson dependency to setup.py 2016-10-20 14:57:18 -07:00
Matthew Honnibal
b5a74f8ad2 Don't automatically include a data/ directory. 2016-10-20 20:50:32 +02:00
Matthew Honnibal
811dc4da75 Fix setup.py script 2016-10-19 00:27:57 +02:00
Matthew Honnibal
818dc83e26 Fix encoding error in setup.py 2016-10-19 00:05:53 +02:00
Matthew Honnibal
509b30834f Add a pipeline module, to collect and wrap processes for annotation 2016-10-16 01:47:12 +02:00
Matthew Honnibal
53d5bd62ee Add the data/ directory as package data 2016-10-15 14:34:33 +02:00
Matthew Honnibal
2f998f8ed0 Require pathlib 2016-10-13 14:19:57 +02:00
Matthew Honnibal
7c5fe84b80 Require older preshed, for thinc compatibility. 2016-10-09 12:25:53 +02:00
Matthew Honnibal
d61feffe24 Require new preshed 2016-09-30 18:41:01 +02:00
Matthew Honnibal
24337175df * Register zh package in setup.py 2016-05-03 14:36:59 +02:00
Henning Peters
2bf34687ea add stdint.h fallback (vs 2008) 2016-04-28 22:10:43 +02:00
Henning Peters
bb3238bcdd pin numpy to >=1.7, ship headers 2016-04-19 19:50:42 +02:00
Henning Peters
6215272786 remove ujson as default non-dev dependency (still works as fallback if installed), because ujson doesn't ship wheels 2016-04-12 11:28:07 +02:00
Henning Peters
5f699883dd make openmp on windows optional 2016-04-12 10:12:57 +02:00
SJ
91b3f1c12f Enable OpenMP compiler option for MSVC
Enable OpenMP compiler option for MSVC to support Multi-Threading for nlp.pipe()
2016-04-09 15:22:17 -07:00
Henning Peters
29ad621825 add de 2016-04-08 14:52:29 +02:00
Matthew Honnibal
872695759d Merge pull request #306 from wbwseeker/german_noun_chunks
add German noun chunk functionality
2016-04-08 00:54:24 +10:00
Wolfgang Seeker
5e2e8e951a add baseclass DocIterator for iterators over documents
add classes for English and German noun chunks

the respective iterators are set for the document when created by the parser
as they depend on the annotation scheme of the parsing model
2016-03-16 15:53:35 +01:00
Henning Peters
54f3447b5f cleanup 2016-03-14 01:46:33 +01:00
Henning Peters
1fe29c6919 cleanup 2016-03-13 18:12:32 +01:00
Henning Peters
49f499ca1c cleanup 2016-03-12 14:30:24 +01:00
Henning Peters
5701686272 cleanup 2016-03-12 13:47:10 +01:00
Wolfgang Seeker
03fb498dbe introduce lang field for LexemeC to hold language id
put noun_chunk logic into iterators.py for each language separately
2016-03-10 13:01:34 +01:00
Wolfgang Seeker
d9312bc9ea add new files npchunks.{pyx,pxd} to hold noun phrase chunk generators 2016-03-09 16:18:48 +01:00