Matthew Honnibal
|
f787cd29fe
|
Refactor the pipeline classes to make them more consistent, and remove the redundant blank() constructor.
|
2016-10-16 21:34:57 +02:00 |
|
Matthew Honnibal
|
311a985fe0
|
Add input error handling in Doc
|
2016-10-16 18:16:42 +02:00 |
|
Matthew Honnibal
|
06322ba99d
|
Add words and spaces keyword arguments to Doc.
|
2016-10-16 18:13:03 +02:00 |
|
Matthew Honnibal
|
ca51f3b77e
|
Use DependencyParser and EntityRecognizer in the Language class.
|
2016-10-16 17:58:12 +02:00 |
|
Matthew Honnibal
|
195d998a12
|
Fix GoldParse argument to tagger.update
|
2016-10-16 17:05:09 +02:00 |
|
Matthew Honnibal
|
274a4d4272
|
Fix queue Python property in StateClass
|
2016-10-16 17:04:41 +02:00 |
|
Matthew Honnibal
|
e8c8aa08ce
|
Make action_name optional in StepwiseState
|
2016-10-16 17:04:16 +02:00 |
|
Matthew Honnibal
|
4bb73b1a93
|
Fix parser labels in pipeline
|
2016-10-16 17:03:22 +02:00 |
|
Matthew Honnibal
|
a81c5a7abf
|
Fix name of labels keyword to 'actions'.
|
2016-10-16 12:00:27 +02:00 |
|
Matthew Honnibal
|
a079677984
|
Fix omission of O action when creating blank entity recognizer
|
2016-10-16 11:43:25 +02:00 |
|
Matthew Honnibal
|
5444d38cc6
|
Update test for biluo tags
|
2016-10-16 11:42:45 +02:00 |
|
Matthew Honnibal
|
4fc56d4a31
|
Rename 'labels' to 'actions' in parser options
|
2016-10-16 11:42:26 +02:00 |
|
Matthew Honnibal
|
8a6b35d266
|
Delay binding in MakeDoc
|
2016-10-16 11:41:55 +02:00 |
|
Matthew Honnibal
|
52b48b415e
|
Fix GoldParse class
|
2016-10-16 11:41:36 +02:00 |
|
Matthew Honnibal
|
3259a63779
|
Whitespace
|
2016-10-16 01:47:28 +02:00 |
|
Matthew Honnibal
|
509b30834f
|
Add a pipeline module, to collect and wrap processes for annotation
|
2016-10-16 01:47:12 +02:00 |
|
Matthew Honnibal
|
0317cea0ad
|
Fix GoldParse
|
2016-10-15 23:55:07 +02:00 |
|
Matthew Honnibal
|
1c62573a41
|
Fix spacy.train
|
2016-10-15 23:53:46 +02:00 |
|
Matthew Honnibal
|
a48aa15384
|
Improve the API for the GoldParse class.
|
2016-10-15 23:53:29 +02:00 |
|
Matthew Honnibal
|
e07fe92b27
|
Draft a refactored init for the GoldParse class
|
2016-10-15 22:09:52 +02:00 |
|
Matthew Honnibal
|
47afef7d6b
|
Add init.py for gold tests
|
2016-10-15 21:51:28 +02:00 |
|
Matthew Honnibal
|
86ae665c78
|
Add function for entity->biluo transformation
|
2016-10-15 21:51:04 +02:00 |
|
Matthew Honnibal
|
2163fd238f
|
Add tests for entity->biluo transformation
|
2016-10-15 21:50:43 +02:00 |
|
Matthew Honnibal
|
5e923b9bfa
|
Return None in match_best_version if not path exists.
|
2016-10-15 14:47:29 +02:00 |
|
Matthew Honnibal
|
2516382106
|
Fix loading of English in span test
|
2016-10-15 14:44:37 +02:00 |
|
Matthew Honnibal
|
dda2fc6bef
|
Add empty data directory
|
2016-10-15 14:25:25 +02:00 |
|
Matthew Honnibal
|
049197e0ae
|
Update tests, somewhat messily.
|
2016-10-15 14:14:04 +02:00 |
|
Matthew Honnibal
|
1e1a1d9517
|
Update matcher test
|
2016-10-15 14:13:41 +02:00 |
|
Matthew Honnibal
|
9cc9ce0f14
|
Load with default path=False in tests.
|
2016-10-15 14:13:23 +02:00 |
|
Matthew Honnibal
|
08e9134760
|
Change default value of path to True
|
2016-10-15 14:12:54 +02:00 |
|
Matthew Honnibal
|
788657f062
|
Ensure words are added to vocab before test, so that the lexicon is updated correctly.
|
2016-10-15 14:12:18 +02:00 |
|
Matthew Honnibal
|
4a1a2bce68
|
Update version in about.py
|
2016-10-15 13:44:27 +02:00 |
|
Matthew Honnibal
|
6d8cb515ac
|
Break the tokenization stage out of the pipeline into a function 'make_doc'. This allows all pipeline methods to have the same signature.
|
2016-10-14 17:38:29 +02:00 |
|
Matthew Honnibal
|
2cc515b2ed
|
Add add_flag method to Vocab, re Issue #504.
|
2016-10-14 12:15:38 +02:00 |
|
Matthew Honnibal
|
f3be9d0a9a
|
Add tensor field to Lexeme, Token, Doc and Span, so that users have a place to hang neural network outputs
|
2016-10-14 03:24:13 +02:00 |
|
Matthew Honnibal
|
9b55d97a8f
|
Update train method
|
2016-10-13 03:24:53 +02:00 |
|
Matthew Honnibal
|
645d99523a
|
Move merge_sents method into spacy.gold
|
2016-10-13 03:24:29 +02:00 |
|
Matthew Honnibal
|
41f88ce938
|
Fix dep model loading in parser
|
2016-10-12 20:26:38 +02:00 |
|
Matthew Honnibal
|
d9ae2d68af
|
Load features by string-name for backwards compatibility.
|
2016-10-12 20:15:11 +02:00 |
|
Matthew Honnibal
|
a42fbcf946
|
Require model for test_is_properties
|
2016-10-12 19:35:18 +02:00 |
|
Matthew Honnibal
|
20c948361b
|
Use local path in test_lemmatizer
|
2016-10-12 19:35:00 +02:00 |
|
Matthew Honnibal
|
1318d0bc65
|
Test with the non-loaded versions of the English and German pipelines.
|
2016-10-12 19:13:31 +02:00 |
|
Matthew Honnibal
|
0e2bedc373
|
Fix default labels for parser and NER
|
2016-10-12 19:12:40 +02:00 |
|
Matthew Honnibal
|
3a03c668c3
|
Fix message in ParserStateError
|
2016-10-12 14:44:31 +02:00 |
|
Matthew Honnibal
|
6bf505e865
|
Fix error on ParserStateError
|
2016-10-12 14:35:55 +02:00 |
|
Matthew Honnibal
|
ba5e048502
|
Add docstring for Trainer class.
|
2016-10-12 14:26:02 +02:00 |
|
Matthew Honnibal
|
847a4a4182
|
Refactor Language, dropping Language.blank() method.
|
2016-10-12 13:45:58 +02:00 |
|
Matthew Honnibal
|
ea23b64cc8
|
Refactor training, with new spacy.train module. Defaults still a little awkward.
|
2016-10-09 12:24:24 +02:00 |
|
Matthew Honnibal
|
ca32a1ab01
|
Revert "Work on Issue #285: intern strings into document-specific pools, to address streaming data memory growth. StringStore.__getitem__ now raises KeyError when it can't find the string. Use StringStore.intern() to get the old behaviour. Still need to hunt down all uses of StringStore.__getitem__ in library and do testing, but logic looks good."
This reverts commit 8423e8627f .
|
2016-09-30 20:20:22 +02:00 |
|
Matthew Honnibal
|
90baa9c7e6
|
Revert "Changes to matcher.pyx for new StringStore scheme"
This reverts commit 3ff09614e0 .
|
2016-09-30 20:20:13 +02:00 |
|