Matthew Honnibal
|
01ab464383
|
* Prevent Begin and In moves from applying in NER if we're at the last token of a sentence, as this would mean the entity would span over a sentence boundary. Re Issue #169
|
2015-11-07 05:30:44 +11:00 |
|
Matthew Honnibal
|
b65633f270
|
* Fix function that returns nth entity in StateClass. Was only returning the first.
|
2015-11-07 05:29:11 +11:00 |
|
Matthew Honnibal
|
410b6f9ec1
|
* Remove deprecated _ml.pyx. We now use the nicer APIs provided by thinc 4.0, and subclass the AveragedPerceptron class.
|
2015-11-07 05:13:10 +11:00 |
|
Matthew Honnibal
|
3c162dcac3
|
* Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc.
|
2015-11-07 03:24:30 +11:00 |
|
Matthew Honnibal
|
c339783bbe
|
* Fix reference to tests.span in setup
|
2015-11-07 03:23:14 +11:00 |
|
Matthew Honnibal
|
9d1b2a103a
|
* Fix capitalization in lemmatizer
|
2015-11-06 05:44:35 +11:00 |
|
Matthew Honnibal
|
6ed3aedf79
|
* Merge vocab changes
|
2015-11-06 00:48:08 +11:00 |
|
Matthew Honnibal
|
72abbb43fb
|
* Add type declarations in strings.pyx
|
2015-11-06 00:47:26 +11:00 |
|
Matthew Honnibal
|
5b2af4864f
|
* When lemmatizing non-noun, non-verb, non-adj words, output lower-case
|
2015-11-06 00:45:09 +11:00 |
|
Matthew Honnibal
|
754bf04162
|
* Remove declaration of Model.update
|
2015-11-06 00:31:15 +11:00 |
|
Matthew Honnibal
|
e18bdff23a
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2015-11-06 00:26:15 +11:00 |
|
Matthew Honnibal
|
b9991fbd20
|
* Update to use thinc 3.0
|
2015-11-06 00:25:59 +11:00 |
|
Matthew Honnibal
|
802ad3d71a
|
* Avoid compiling theano module for now
|
2015-11-06 00:24:43 +11:00 |
|
Matthew Honnibal
|
864a8f45d8
|
* Use unicode in StringStore.intern, instead of unreliably casting to bytes.
|
2015-11-05 11:32:19 +00:00 |
|
Matthew Honnibal
|
b18204cd52
|
* Fix StringStore._realloc, re Issue #155
|
2015-11-05 11:28:26 +00:00 |
|
Matthew Honnibal
|
f8004c5f65
|
* Begin upgrading to improved thinc API
|
2015-11-05 03:53:03 +11:00 |
|
Matthew Honnibal
|
adc7bbd6cf
|
* Fix name of like_num in default_lex_attrs
|
2015-11-04 22:02:47 +11:00 |
|
Matthew Honnibal
|
e96faf29e7
|
* Rename like_number to like_num, to fix inconsistency re Issue #166
|
2015-11-04 22:01:44 +11:00 |
|
Matthew Honnibal
|
65934b7cd4
|
* Enforce import of ujson in strings.pyx, because otherwise it's too slow
|
2015-11-04 00:32:02 +11:00 |
|
Matthew Honnibal
|
1ce5d5602d
|
* Rename Doc.data to Doc.c
|
2015-11-04 00:17:13 +11:00 |
|
Matthew Honnibal
|
68f479e821
|
* Rename Doc.data to Doc.c
|
2015-11-04 00:15:14 +11:00 |
|
Matthew Honnibal
|
3ddea19b2b
|
* Rename spans.pyx to span.pyx
|
2015-11-04 00:14:40 +11:00 |
|
Matthew Honnibal
|
9482d616bc
|
* Rename spans.pyx to span.pyx
|
2015-11-03 23:51:05 +11:00 |
|
Matthew Honnibal
|
116da5990a
|
* Clean up setting of tag in doc.from_bytes
|
2015-11-03 23:48:57 +11:00 |
|
Matthew Honnibal
|
9ec7b9c454
|
* Clean up unused Constituent struct.
|
2015-11-03 23:48:21 +11:00 |
|
Matthew Honnibal
|
1e99fcd413
|
* Rename .repvec to .vector in C API
|
2015-11-03 23:47:59 +11:00 |
|
Matthew Honnibal
|
f81389abe0
|
* Pin to specific cymem, preshed and thinc versions.
|
2015-11-03 23:12:13 +11:00 |
|
Matthew Honnibal
|
ee3f9ba581
|
* Fix test of serializer
|
2015-11-03 19:45:16 +11:00 |
|
Matthew Honnibal
|
d06ba26371
|
* Fix test of serializer
|
2015-11-03 19:43:27 +11:00 |
|
Matthew Honnibal
|
4083059650
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-11-03 09:07:19 +01:00 |
|
Matthew Honnibal
|
9e37437ba8
|
* Fix assign_tag in doc.merge
|
2015-11-03 19:07:02 +11:00 |
|
Matthew Honnibal
|
dde9e1357c
|
* Add todo to morphology.lemmatize
|
2015-11-03 18:54:35 +11:00 |
|
Matthew Honnibal
|
ffedff9e6c
|
* Remove the archive after download, to save disk space
|
2015-11-03 18:54:05 +11:00 |
|
Matthew Honnibal
|
85372468e3
|
* Fix serialize test
|
2015-11-03 08:51:33 +01:00 |
|
Matthew Honnibal
|
833eb35c57
|
* Fix tag assignment in doc.from_array
|
2015-11-03 18:45:54 +11:00 |
|
Matthew Honnibal
|
09664177d7
|
* Fix tag handling in doc.merge, and assign sent_start when setting heads.
|
2015-11-03 18:15:52 +11:00 |
|
Matthew Honnibal
|
068222c09a
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-11-03 08:07:38 +01:00 |
|
Matthew Honnibal
|
389a373807
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2015-11-03 18:07:25 +11:00 |
|
Matthew Honnibal
|
3f44b3e43f
|
* Mark serializer test as requiring models
|
2015-11-03 18:07:08 +11:00 |
|
Matthew Honnibal
|
7adef3f831
|
* Increment version
|
2015-11-03 07:58:59 +01:00 |
|
Matthew Honnibal
|
25ed7be8f8
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-11-03 07:58:17 +01:00 |
|
Matthew Honnibal
|
604ceac4c6
|
* Fix morphological assignment in doc.merge()
|
2015-11-03 17:57:51 +11:00 |
|
Matthew Honnibal
|
5e040855a5
|
* Ensure morphological features and lemmas are loaded in from_array, re Issue #152
|
2015-11-03 17:56:50 +11:00 |
|
Matthew Honnibal
|
2714fb4733
|
* Fix prebuild command
|
2015-11-03 07:30:33 +01:00 |
|
Matthew Honnibal
|
8bde2bba58
|
* Fiddle with prebuild command
|
2015-11-03 07:11:59 +01:00 |
|
Matthew Honnibal
|
cb1376465f
|
Merge branch 'master' of https://github.com/honnibal/spaCy
|
2015-11-03 07:07:57 +01:00 |
|
Matthew Honnibal
|
64531d5a3a
|
* Define package_data in one place
|
2015-11-03 17:07:43 +11:00 |
|
Matthew Honnibal
|
3656f06e35
|
* Don't use models in fab test
|
2015-11-03 06:39:30 +01:00 |
|
Matthew Honnibal
|
bb5598b816
|
* Fix test command in fabfile
|
2015-11-03 05:32:18 +01:00 |
|
Matthew Honnibal
|
5668feb235
|
* Fix pickle test for python3
|
2015-11-03 04:57:02 +01:00 |
|