Henning Peters
|
73e5650be5
|
change index server
|
2015-11-18 18:09:46 +01:00 |
|
Henning Peters
|
50d15ea5d2
|
fix
|
2015-11-18 17:35:21 +01:00 |
|
Henning Peters
|
02a1dcec76
|
add data dir
|
2015-11-18 11:48:55 +01:00 |
|
Henning Peters
|
919a4f0b04
|
change data path, add repository
|
2015-11-18 11:40:46 +01:00 |
|
Henning Peters
|
12de895e60
|
fix version
|
2015-11-15 16:38:16 +01:00 |
|
Henning Peters
|
03d2f98cd5
|
add sputnik
|
2015-11-15 15:58:21 +01:00 |
|
Matthew Honnibal
|
ec7d36c3a4
|
* Add test for matcher end-point problem
|
2015-11-12 05:00:40 +11:00 |
|
Matthew Honnibal
|
d309622a27
|
* Add test for matcher end-point problem
|
2015-11-12 04:59:11 +11:00 |
|
Matthew Honnibal
|
56ea20a886
|
* Add test for matcher end-point problem
|
2015-11-12 04:58:53 +11:00 |
|
Matthew Honnibal
|
cfa4062147
|
* Add test for matcher end-point problem
|
2015-11-12 04:56:07 +11:00 |
|
Matthew Honnibal
|
5623242b3e
|
* Adjust NER rules, so that U entries in gazetteer don't become B moves to the model
|
2015-11-12 04:48:23 +11:00 |
|
Matthew Honnibal
|
d67d7d5a86
|
* Add test for NER inconsistency bug
|
2015-11-08 16:19:33 +01:00 |
|
Matthew Honnibal
|
44fbdc7260
|
* Fix bug in NER transition system, that sometimes left no valid moves
|
2015-11-08 16:19:12 +01:00 |
|
Matthew Honnibal
|
ab5aac5b2f
|
* Add .rank property to Token and Lexeme, for frequency rank
|
2015-11-08 16:18:25 +01:00 |
|
Matthew Honnibal
|
fde9a22ec2
|
* Add new test for ner
|
2015-11-08 13:57:15 +01:00 |
|
Matthew Honnibal
|
e92371bb54
|
* Fix rule that made Last action invalid if there was a preset of O, since if the entity is already open, that ship has sailed.
|
2015-11-08 22:17:51 +11:00 |
|
Matthew Honnibal
|
3b74739c3e
|
* Download updated data
|
2015-11-08 21:24:25 +11:00 |
|
Matthew Honnibal
|
31da42eb27
|
* Mark tests that require models
|
2015-11-07 19:27:38 +11:00 |
|
Matthew Honnibal
|
8e26a28616
|
* Mark tests that require models
|
2015-11-07 19:10:56 +11:00 |
|
Matthew Honnibal
|
15eab7354f
|
* Remove extraneous test files
|
2015-11-07 18:45:13 +11:00 |
|
Matthew Honnibal
|
6f47074214
|
* Make constructor of ParserModel and TaggerModel the same as AveragedPerceptron, for each pickling.
|
2015-11-07 18:25:17 +11:00 |
|
Matthew Honnibal
|
1cfa20fb17
|
* Fix sentence-final whitespace issue
|
2015-11-07 17:34:46 +11:00 |
|
Matthew Honnibal
|
7663970d5f
|
* Removed unused i variable from Span, and set attributes to read-only
|
2015-11-07 17:06:15 +11:00 |
|
Matthew Honnibal
|
4b3c96d76d
|
* Fix zero-length spans
|
2015-11-07 17:05:16 +11:00 |
|
Matthew Honnibal
|
888c05a7fa
|
* Fix variable naming in StepwiseState, for thinc 4.0
|
2015-11-07 11:02:44 +11:00 |
|
Matthew Honnibal
|
fc2185bfe3
|
* Fix variable naming in StepwiseState, for thinc 4.0
|
2015-11-07 10:48:31 +11:00 |
|
Matthew Honnibal
|
954442a807
|
* Fix variable naming in StepwiseState, for thinc 4.0
|
2015-11-07 10:30:45 +11:00 |
|
Matthew Honnibal
|
06f26d258e
|
* Fix test_basic_create
|
2015-11-07 10:04:37 +11:00 |
|
Matthew Honnibal
|
1d3884c46d
|
* Fix test_basic_create
|
2015-11-07 10:03:56 +11:00 |
|
Matthew Honnibal
|
cc8febcbe1
|
* Fix Span comparison
|
2015-11-07 09:54:14 +11:00 |
|
Matthew Honnibal
|
af70dc166a
|
* Fix Last restriction, that was supposed to prevent conflicts with presets, but was incorrect.
|
2015-11-07 09:52:00 +11:00 |
|
Matthew Honnibal
|
a9b612abdf
|
* Rework the Span-merge patch, to avoid extending the interface of Doc, and avoid virtualizing the Span.start and Span.end indices, to keep Span usage efficient
|
2015-11-07 09:01:12 +11:00 |
|
Matthew Honnibal
|
56499d89ef
|
* Rework the Span-merge patch, to avoid extending the interface of Doc, and avoid virtualizing the Span.start and Span.end indices, to keep Span usage efficient
|
2015-11-07 08:55:34 +11:00 |
|
Andreas Grivas
|
83ca4e0b93
|
* use old merge tests - add more
|
2015-11-07 07:57:04 +11:00 |
|
Andreas Grivas
|
4be7fda453
|
* span start, end -> properties. autoupdate after merge
|
2015-11-07 07:57:04 +11:00 |
|
Andreas Grivas
|
562db6d2d0
|
* merge add lex last - add index finder funcs
|
2015-11-07 07:57:04 +11:00 |
|
Matthew Honnibal
|
a06e3c8963
|
* Fix bone-headed mistake in StateClass.E
|
2015-11-07 07:35:28 +11:00 |
|
Matthew Honnibal
|
d24b8509e4
|
* Correct screw ups from the previous commits
|
2015-11-07 06:51:41 +11:00 |
|
Matthew Honnibal
|
5efad178b5
|
* Set ent tag when close entity
|
2015-11-07 06:09:25 +11:00 |
|
Matthew Honnibal
|
9285f01d26
|
* Fix broken StateClass.E tracking
|
2015-11-07 06:06:39 +11:00 |
|
Matthew Honnibal
|
19136b0e7d
|
* Add better debug message for illegal move
|
2015-11-07 05:34:37 +11:00 |
|
Matthew Honnibal
|
2733816b7b
|
* Fix whitespace
|
2015-11-07 05:31:06 +11:00 |
|
Matthew Honnibal
|
01ab464383
|
* Prevent Begin and In moves from applying in NER if we're at the last token of a sentence, as this would mean the entity would span over a sentence boundary. Re Issue #169
|
2015-11-07 05:30:44 +11:00 |
|
Matthew Honnibal
|
b65633f270
|
* Fix function that returns nth entity in StateClass. Was only returning the first.
|
2015-11-07 05:29:11 +11:00 |
|
Matthew Honnibal
|
410b6f9ec1
|
* Remove deprecated _ml.pyx. We now use the nicer APIs provided by thinc 4.0, and subclass the AveragedPerceptron class.
|
2015-11-07 05:13:10 +11:00 |
|
Matthew Honnibal
|
3c162dcac3
|
* Refactor away from the _ml module, to use thinc 4.0. Still some work needs to be done, e.g. to add __reduce__ to the models, more testing, etc.
|
2015-11-07 03:24:30 +11:00 |
|
Matthew Honnibal
|
9d1b2a103a
|
* Fix capitalization in lemmatizer
|
2015-11-06 05:44:35 +11:00 |
|
Matthew Honnibal
|
6ed3aedf79
|
* Merge vocab changes
|
2015-11-06 00:48:08 +11:00 |
|
Matthew Honnibal
|
72abbb43fb
|
* Add type declarations in strings.pyx
|
2015-11-06 00:47:26 +11:00 |
|
Matthew Honnibal
|
5b2af4864f
|
* When lemmatizing non-noun, non-verb, non-adj words, output lower-case
|
2015-11-06 00:45:09 +11:00 |
|