Commit Graph

303 Commits

Author SHA1 Message Date
Henning Peters
5f699883dd make openmp on windows optional 2016-04-12 10:12:57 +02:00
SJ
91b3f1c12f Enable OpenMP compiler option for MSVC
Enable OpenMP compiler option for MSVC to support Multi-Threading for nlp.pipe()
2016-04-09 15:22:17 -07:00
Henning Peters
29ad621825 add de 2016-04-08 14:52:29 +02:00
Matthew Honnibal
872695759d Merge pull request #306 from wbwseeker/german_noun_chunks
add German noun chunk functionality
2016-04-08 00:54:24 +10:00
Wolfgang Seeker
5e2e8e951a add baseclass DocIterator for iterators over documents
add classes for English and German noun chunks

the respective iterators are set for the document when created by the parser
as they depend on the annotation scheme of the parsing model
2016-03-16 15:53:35 +01:00
Henning Peters
54f3447b5f cleanup 2016-03-14 01:46:33 +01:00
Henning Peters
1fe29c6919 cleanup 2016-03-13 18:12:32 +01:00
Henning Peters
49f499ca1c cleanup 2016-03-12 14:30:24 +01:00
Henning Peters
5701686272 cleanup 2016-03-12 13:47:10 +01:00
Wolfgang Seeker
03fb498dbe introduce lang field for LexemeC to hold language id
put noun_chunk logic into iterators.py for each language separately
2016-03-10 13:01:34 +01:00
Wolfgang Seeker
d9312bc9ea add new files npchunks.{pyx,pxd} to hold noun phrase chunk generators 2016-03-09 16:18:48 +01:00
Henning Peters
5b3b3ebc8e upgrade to latest sputnik 2016-03-08 15:30:17 +01:00
Matthew Honnibal
fcaa0ad7ce Merge pull request #280 from wbwseeker/german_parser
German parser
2016-03-04 03:27:42 +11:00
Wolfgang Seeker
3448cb40a4 integrated pseudo-projective parsing into parser
- nonproj.pyx holds a class PseudoProjectivity which currently holds
  all functionality to implement Nivre & Nilsson 2005's pseudo-projective
  parsing using the HEAD decoration scheme
- changed lefts/rights in Token to account for possible non-projective
  structures
2016-03-01 10:09:08 +01:00
Henning Peters
12d58a7099 remove text-unidecode dependency 2016-02-24 08:01:59 +01:00
Henning Peters
9cc4f8d5b3 avoid shadowing __name__ 2016-02-15 01:33:39 +01:00
Henning Peters
4c9e3c7911 upgrade spuntik, enforce data api via model version constraints 2016-02-14 16:03:17 +01:00
Henning Peters
3b5f1e753b py26 compatibility 2016-02-10 14:32:54 +01:00
Henning Peters
c00dd43fe0 add sun data 2016-02-09 16:42:55 +01:00
Matthew Honnibal
860fd11e98 * Don't import include files --- use the repository 2016-02-06 23:59:47 +01:00
Matthew Honnibal
8bd16ce8f7 * Try to fix win32 compilation 2016-02-05 14:43:52 +01:00
Matthew Honnibal
add8f07f61 * Conditionally link against openmp, on not-darwin 2016-02-05 12:19:51 +01:00
Matthew Honnibal
c9aa91041d * Don't expect openmp in options 2016-02-02 13:50:25 +01:00
Matthew Honnibal
490ba65398 * Use openmp in parser 2016-02-01 03:08:42 +01:00
Matthew Honnibal
9c34ca9e5d * Add _stack to mod_names 2016-02-01 03:00:53 +01:00
Matthew Honnibal
bc0f0d284c * Require different thinc version 2016-01-30 20:29:24 +01:00
Henning Peters
65aeac24cb remove package version constraint 2016-01-21 17:40:51 +01:00
Henning Peters
211913d689 add about.py, adapt setup.py 2016-01-15 18:57:01 +01:00
Henning Peters
ccd87ad7fb add default_model to about 2016-01-15 18:12:01 +01:00
Henning Peters
780cb847c9 add default_model to about 2016-01-15 18:07:15 +01:00
Henning Peters
788f734513 refactored data_dir->via, add zip_safe, add spacy.load() 2016-01-15 18:01:02 +01:00
Henning Peters
bc229790ac integrate with sputnik 2016-01-13 19:46:17 +01:00
Matthew Honnibal
e38205a838 * Pin versions to ranges, to escape version lock 2015-12-31 02:09:55 +01:00
Henning Peters
1c4352c42e bump version 2015-12-28 13:53:26 +01:00
Henning Peters
a404bfec38 bump preshed version 2015-12-22 22:38:25 +01:00
Henning Peters
46fe3a7327 bump thinc version 2015-12-22 13:21:24 +01:00
Henning Peters
1643e63c31 bump preshed version 2015-12-22 11:23:25 +01:00
Henning Peters
4a1d843682 bump murmurhash version 2015-12-21 21:59:11 +01:00
Henning Peters
74dc02a0e6 fix windows readme 2015-12-21 21:58:53 +01:00
Henning Peters
c17ce6c119 (re-)include cython sources, murmurhash header discovery 2015-12-21 12:40:44 +01:00
Henning Peters
b667020e81 refactor setup.py 2015-12-13 23:39:29 +01:00
Henning Peters
4f4b1d8f3d refactor setup.py 2015-12-13 23:32:23 +01:00
Henning Peters
eaadca2bf2 get buildbot running 2015-12-13 14:13:46 +01:00
Henning Peters
73674a4afb try using system-wide headers 2015-12-13 12:51:23 +01:00
Henning Peters
b2f66f7b8d try using system-wide headers 2015-12-13 12:45:30 +01:00
Henning Peters
63d74ae8f3 try using system-wide headers 2015-12-13 12:41:46 +01:00
Henning Peters
92fabd0114 wrap virtualenv around cythonize 2015-12-13 12:32:22 +01:00
Henning Peters
ac318b568c new approach to dependency headers 2015-12-13 11:49:17 +01:00
Matthew Honnibal
65413ad7b3 Merge pull request #186 from henningpeters/master
website build was broken for me, fixed it
2015-11-29 15:36:52 +11:00
Henning Peters
abe6162e7b avoid redirect 2015-11-24 20:01:43 +01:00
Henning Peters
4e98ea4e41 bump version 2015-11-21 19:04:57 +01:00
Matthew Honnibal
d8c52560d1 Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-11-19 11:00:11 +01:00
Matthew Honnibal
44e563d4e5 * Pin version of murmurhash 2015-11-19 10:59:51 +01:00
Matthew Honnibal
73d47c3010 Merge pull request #185 from henningpeters/sputnik
integrate sputnik
2015-11-19 20:59:09 +11:00
Matthew Honnibal
1e166eb9cd * Upgrade spacy version 2015-11-18 17:42:56 +01:00
Henning Peters
919a4f0b04 change data path, add repository 2015-11-18 11:40:46 +01:00
Henning Peters
12de895e60 fix version 2015-11-15 16:38:16 +01:00
Matthew Honnibal
6dd37c5ee4 * Fix requirement of preshed 2015-11-08 18:09:21 +01:00
Matthew Honnibal
f9d20b1318 * Require updated thinc 2015-11-08 21:32:21 +11:00
Matthew Honnibal
3c162dcac3 * Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc. 2015-11-07 03:24:30 +11:00
Matthew Honnibal
c339783bbe * Fix reference to tests.span in setup 2015-11-07 03:23:14 +11:00
Matthew Honnibal
802ad3d71a * Avoid compiling theano module for now 2015-11-06 00:24:43 +11:00
Matthew Honnibal
3ddea19b2b * Rename spans.pyx to span.pyx 2015-11-04 00:14:40 +11:00
Matthew Honnibal
9482d616bc * Rename spans.pyx to span.pyx 2015-11-03 23:51:05 +11:00
Matthew Honnibal
f81389abe0 * Pin to specific cymem, preshed and thinc versions. 2015-11-03 23:12:13 +11:00
Matthew Honnibal
7adef3f831 * Increment version 2015-11-03 07:58:59 +01:00
Matthew Honnibal
64531d5a3a * Define package_data in one place 2015-11-03 17:07:43 +11:00
Matthew Honnibal
5ca31e05fb * Prune down package data, as models are distributed entirely within the data download. 2015-11-03 13:30:37 +11:00
Matthew Honnibal
f56209ef2e * Update requirements 2015-11-03 02:40:01 +11:00
Matthew Honnibal
09e0b15629 * Package tests, for distriution in PyPi 2015-10-26 00:30:33 +11:00
Matthew Honnibal
b0ba534d4a * Fix license descriptor in setup.py 2015-10-26 00:16:37 +11:00
Matthew Honnibal
9ee1ddab7e * Increment version 2015-10-23 02:04:48 +02:00
Matthew Honnibal
108138366f * Ensure .pxd files are packaged 2015-10-23 01:57:03 +02:00
Matthew Honnibal
2348a08481 * Load/dump strings with a json file, instead of the hacky strings file we were using. 2015-10-22 21:13:03 +11:00
Matthew Honnibal
579670e4c7 * Fix uget 2015-10-19 17:23:33 +11:00
Matthew Honnibal
984775e5e2 * Fix setup of uget 2015-10-19 17:19:05 +11:00
Matthew Honnibal
e25adce54d Merge branch 'master' of ssh://github.com/honnibal/spaCy 2015-10-19 17:17:33 +11:00
Matthew Honnibal
382cbc8cab * Add uget to setup.py 2015-10-19 17:15:40 +11:00
Matthew Honnibal
a43777cef8 * Inc version 2015-10-19 07:46:42 +02:00
Henning Peters
bfde91fa49 add custom download tool (uget), replace wget with uget 2015-10-18 12:35:04 +02:00
Matthew Honnibal
fc261195f7 * Fix compilation for OSX 2015-10-18 17:19:07 +11:00
Matthew Honnibal
710e8fb168 * Fix platform condition re Issue #138 2015-10-15 20:46:08 +11:00
maxirmx
1b8fd329b8 Merge remote-tracking branch 'refs/remotes/honnibal/master' 2015-10-13 11:28:17 +03:00
Matthew Honnibal
d74a1e51d7 * Add cloudpickle requirement 2015-10-13 19:05:20 +11:00
maxirmx
3dbec0902f Merge remote-tracking branch 'refs/remotes/honnibal/master'
Conflicts -- pushing preshed 0.42
	requirements.txt
	setup.py
2015-10-13 10:16:16 +03:00
maxirmx
237db7f519 Appveyor build #5
Added Wordnet download
2015-10-13 10:11:56 +03:00
Matthew Honnibal
41cbbdefe3 Merge branch 'attrs' 2015-10-13 05:03:25 +02:00
Matthew Honnibal
1ca1beff4b * Allow preshed v0.42 in setup.py 2015-10-13 13:55:50 +11:00
Matthew Honnibal
b866f1443e Merge branch 'master' of https://github.com/honnibal/spaCy into attrs 2015-10-13 04:52:27 +02:00
Matthew Honnibal
6c2da06c18 * Package tag_map.json 2015-10-13 13:52:10 +11:00
Matthew Honnibal
e886e6a406 * Inc version 2015-10-13 13:46:17 +11:00
maxirmx
bf963c3cce Merging Windows\Linux versions of setup.py
Python 3.0 compatibility fix
2015-10-13 02:11:21 +03:00
maxirmx
ccf6156261 Merging Windows\Linux versions of setup.py #2 2015-10-13 01:46:52 +03:00
maxirmx
7c5bfc5916 Merging Windows/Linux versions of setup.py 2015-10-13 01:31:59 +03:00
maxirmx
9d949c857b More dirty Windows stuff - just for now 2015-10-10 20:11:20 +03:00
maxirmx
8e03239ac5 Merge remote-tracking branch 'refs/remotes/honnibal/master'
Conflicts:
	setup.py
2015-10-10 17:38:06 +03:00
maxirmx
815994a212 MSVC x86-64 Pyton 2.7 dirty build 2015-10-10 17:32:44 +03:00
Matthew Honnibal
064bd69ad0 * Refactor symbols, so that frequency rank can be derived from the orth id of a word. 2015-10-10 16:03:48 +11:00
Matthew Honnibal
8b8d048385 Merge pull request #135 from henningpeters/patch-1
remove compile warning noise
2015-10-10 01:40:15 +11:00
Matthew Honnibal
af8d0a2a09 * Increment version 2015-10-09 12:42:41 +02:00