Ines Montani
26d018d874
Add tests for StringStore
2017-01-12 15:07:31 +01:00
Ines Montani
ffcaba9017
Remove old and/or redundant tests
2017-01-12 02:10:18 +01:00
Ines Montani
33800c9367
Rename "tokens" tests to "doc"
2017-01-11 18:59:01 +01:00
Matthew Honnibal
c9fdd9917c
Require older thinc
2017-01-09 10:12:41 -06:00
Matthew Honnibal
7108ad9d80
Require thinc 6.1
2017-01-09 14:37:00 +01:00
Matthew Honnibal
e4862d1dab
Merge branch 'develop'
2017-01-09 13:36:01 +01:00
Ines Montani
d87ca84028
Remove old website example tests from setup.py
2017-01-08 22:42:54 +01:00
Matthew Honnibal
af81ac8bb0
Use thinc 6.0
2016-12-29 11:58:42 +01:00
Gyorgy Orosz
35aa54765d
Hungarian module is exposed in spacy.
2016-12-21 20:45:36 +01:00
Magnus Burton
db5a077d2b
Initial commit for Swedish
2016-12-20 11:05:06 +01:00
Matthew Honnibal
0c7720e162
Remove unit and integration test packages
2016-12-19 00:26:56 +01:00
Matthew Honnibal
6c0c43c267
Add comment
2016-12-19 00:20:16 +01:00
Matthew Honnibal
b2cebdcca7
List more test packages in the setup.py
2016-12-19 00:15:11 +01:00
Matthew Honnibal
97521c95b3
List the language_data package in the setup.py
2016-12-19 00:14:09 +01:00
dafnevk
d8c7ac203a
Added nl module for dutch
2016-11-24 16:39:49 +01:00
Matthew Honnibal
36bcd46244
Integrate patch from @mikepb re building OpenMP-supporting wheels for macOS / OSX. I'm running blind on this, so this commit might not be 100%. Rollback if there are any problems. See Issue #267 .
2016-11-06 11:58:50 +01:00
Matthew Honnibal
bc8d04abc0
Package alpha es, fr, it and pt directories.
2016-11-04 20:02:53 +01:00
Adam Ever Hadani
452b766d82
added ujson dependency to setup.py
2016-10-20 14:57:18 -07:00
Matthew Honnibal
b5a74f8ad2
Don't automatically include a data/ directory.
2016-10-20 20:50:32 +02:00
Matthew Honnibal
811dc4da75
Fix setup.py script
2016-10-19 00:27:57 +02:00
Matthew Honnibal
818dc83e26
Fix encoding error in setup.py
2016-10-19 00:05:53 +02:00
Matthew Honnibal
509b30834f
Add a pipeline module, to collect and wrap processes for annotation
2016-10-16 01:47:12 +02:00
Matthew Honnibal
53d5bd62ee
Add the data/ directory as package data
2016-10-15 14:34:33 +02:00
Matthew Honnibal
2f998f8ed0
Require pathlib
2016-10-13 14:19:57 +02:00
Matthew Honnibal
7c5fe84b80
Require older preshed, for thinc compatibility.
2016-10-09 12:25:53 +02:00
Matthew Honnibal
d61feffe24
Require new preshed
2016-09-30 18:41:01 +02:00
Matthew Honnibal
24337175df
* Register zh package in setup.py
2016-05-03 14:36:59 +02:00
Henning Peters
2bf34687ea
add stdint.h fallback (vs 2008)
2016-04-28 22:10:43 +02:00
Henning Peters
bb3238bcdd
pin numpy to >=1.7, ship headers
2016-04-19 19:50:42 +02:00
Henning Peters
6215272786
remove ujson as default non-dev dependency (still works as fallback if installed), because ujson doesn't ship wheels
2016-04-12 11:28:07 +02:00
Henning Peters
5f699883dd
make openmp on windows optional
2016-04-12 10:12:57 +02:00
SJ
91b3f1c12f
Enable OpenMP compiler option for MSVC
...
Enable OpenMP compiler option for MSVC to support Multi-Threading for nlp.pipe()
2016-04-09 15:22:17 -07:00
Henning Peters
29ad621825
add de
2016-04-08 14:52:29 +02:00
Matthew Honnibal
872695759d
Merge pull request #306 from wbwseeker/german_noun_chunks
...
add German noun chunk functionality
2016-04-08 00:54:24 +10:00
Wolfgang Seeker
5e2e8e951a
add baseclass DocIterator for iterators over documents
...
add classes for English and German noun chunks
the respective iterators are set for the document when created by the parser
as they depend on the annotation scheme of the parsing model
2016-03-16 15:53:35 +01:00
Henning Peters
54f3447b5f
cleanup
2016-03-14 01:46:33 +01:00
Henning Peters
1fe29c6919
cleanup
2016-03-13 18:12:32 +01:00
Henning Peters
49f499ca1c
cleanup
2016-03-12 14:30:24 +01:00
Henning Peters
5701686272
cleanup
2016-03-12 13:47:10 +01:00
Wolfgang Seeker
03fb498dbe
introduce lang field for LexemeC to hold language id
...
put noun_chunk logic into iterators.py for each language separately
2016-03-10 13:01:34 +01:00
Wolfgang Seeker
d9312bc9ea
add new files npchunks.{pyx,pxd} to hold noun phrase chunk generators
2016-03-09 16:18:48 +01:00
Henning Peters
5b3b3ebc8e
upgrade to latest sputnik
2016-03-08 15:30:17 +01:00
Matthew Honnibal
fcaa0ad7ce
Merge pull request #280 from wbwseeker/german_parser
...
German parser
2016-03-04 03:27:42 +11:00
Wolfgang Seeker
3448cb40a4
integrated pseudo-projective parsing into parser
...
- nonproj.pyx holds a class PseudoProjectivity which currently holds
all functionality to implement Nivre & Nilsson 2005's pseudo-projective
parsing using the HEAD decoration scheme
- changed lefts/rights in Token to account for possible non-projective
structures
2016-03-01 10:09:08 +01:00
Henning Peters
12d58a7099
remove text-unidecode dependency
2016-02-24 08:01:59 +01:00
Henning Peters
9cc4f8d5b3
avoid shadowing __name__
2016-02-15 01:33:39 +01:00
Henning Peters
4c9e3c7911
upgrade spuntik, enforce data api via model version constraints
2016-02-14 16:03:17 +01:00
Henning Peters
3b5f1e753b
py26 compatibility
2016-02-10 14:32:54 +01:00
Henning Peters
c00dd43fe0
add sun data
2016-02-09 16:42:55 +01:00
Matthew Honnibal
860fd11e98
* Don't import include files --- use the repository
2016-02-06 23:59:47 +01:00