Matthew Honnibal
|
ee975d36d0
|
* Add stubs to test is_bracket/is_quote/is_left_punct/is_right_punct functions
|
2016-02-04 13:02:25 +01:00 |
|
Matthew Honnibal
|
1a2ee73e98
|
* Add missing pos and tag attributes to API
|
2016-02-02 23:00:53 +01:00 |
|
Matthew Honnibal
|
f204daf27b
|
* Add error warning that a gold tag is unrecognised
|
2016-02-02 22:59:59 +01:00 |
|
Matthew Honnibal
|
99b8906100
|
* Accept punct_labels as an argument to the scorer
|
2016-02-02 22:59:06 +01:00 |
|
Matthew Honnibal
|
e2ed6251d7
|
* Fancy up the CLI for the conll train script
|
2016-02-02 22:58:06 +01:00 |
|
Matthew Honnibal
|
59123443e2
|
* Check for presence/absence of the different models in Language.end_training
|
2016-02-02 22:49:55 +01:00 |
|
Matthew Honnibal
|
7cbff48ace
|
* Set the German lemma rules to be an empty JSON object
|
2016-02-02 22:30:51 +01:00 |
|
Matthew Honnibal
|
d0f06c5cc4
|
* Add missing tags to the German tag map
|
2016-02-02 22:30:22 +01:00 |
|
Matthew Honnibal
|
bf5a7cc598
|
* Update train_pos_tagger example
|
2016-02-02 22:30:00 +01:00 |
|
Matthew Honnibal
|
a676d66807
|
* Update the CoNLL train script, to get working on other languages
|
2016-02-02 22:29:34 +01:00 |
|
Matthew Honnibal
|
6c633f2edc
|
Fix Issue #243: Incorrect gazetteer entry
|
2016-01-30 06:58:29 +11:00 |
|
Matthew Honnibal
|
9721502c81
|
* Update version
|
2016-01-25 15:52:59 +01:00 |
|
Matthew Honnibal
|
907e8cf07d
|
* Add u prefix to string in web example
|
2016-01-25 15:51:38 +01:00 |
|
Matthew Honnibal
|
eba03695ef
|
* Comment out pickle tests
|
2016-01-25 15:51:13 +01:00 |
|
Matthew Honnibal
|
de94e6c525
|
* Mark pickle tests as xfail, due to temp files problem
|
2016-01-25 15:24:17 +01:00 |
|
Matthew Honnibal
|
87172a15c6
|
* Fix runtime error bug that arose from updated Span.root function.
|
2016-01-25 15:22:42 +01:00 |
|
Matthew Honnibal
|
2c8dd91785
|
* Fix first code example on the website
|
2016-01-23 18:09:19 +01:00 |
|
Matthew Honnibal
|
af332f5095
|
* Add some stream of consciousness about NER
|
2016-01-23 13:41:01 +01:00 |
|
Matthew Honnibal
|
3af84cfd6e
|
* Increment version
|
2016-01-21 17:49:27 +01:00 |
|
Matthew Honnibal
|
571d26b773
|
Merge branch 'master' of ssh://github.com/honnibal/spaCy
|
2016-01-21 17:48:32 +01:00 |
|
Matthew Honnibal
|
6842f681e5
|
Merge pull request #234 from henningpeters/master
remove package version constraint
|
2016-01-22 03:48:12 +11:00 |
|
Henning Peters
|
65aeac24cb
|
remove package version constraint
|
2016-01-21 17:40:51 +01:00 |
|
Matthew Honnibal
|
0ec4df6d7c
|
* Write more notes about spaCy's NER
|
2016-01-21 16:37:13 +01:00 |
|
Matthew Honnibal
|
7d16f25218
|
* Update release notes
|
2016-01-21 00:24:21 +01:00 |
|
Matthew Honnibal
|
1270506f7e
|
* Update release notes
|
2016-01-21 00:23:43 +01:00 |
|
Matthew Honnibal
|
792c98a438
|
* Increment version for OSX-fixed release of v0.100
|
2016-01-21 00:23:04 +01:00 |
|
Matthew Honnibal
|
110304f62e
|
* Start writing bootstrap word2vec tutorial
|
2016-01-20 13:51:36 +01:00 |
|
Matthew Honnibal
|
82d011ac43
|
* Fix test for whitespace
|
2016-01-19 20:38:26 +01:00 |
|
Matthew Honnibal
|
e89069dcae
|
* Fix matcher test
|
2016-01-19 20:24:01 +01:00 |
|
Matthew Honnibal
|
63e3d4e27f
|
* Add comment on Vocab.__reduce__
|
2016-01-19 20:11:25 +01:00 |
|
Matthew Honnibal
|
e1282b7f2f
|
* Require user-custom NER classes to work without adding the label.
|
2016-01-19 20:11:03 +01:00 |
|
Matthew Honnibal
|
84c5dfbfc3
|
* Clean up debugging python list
|
2016-01-19 20:10:32 +01:00 |
|
Matthew Honnibal
|
04d0686b26
|
* Make TransitionSystem.add_action idempotent, i.e. ignore duplicate added actions.
|
2016-01-19 20:10:04 +01:00 |
|
Matthew Honnibal
|
c4a89d56bd
|
* Automatically register any entity types pre-set on the tokens, so that the NER works with user-given entity types.
|
2016-01-19 20:09:26 +01:00 |
|
Matthew Honnibal
|
f0f92793f6
|
* Add test for user NER classes in matcher blocking the NER model. Re Issue #178 and Issue #217
|
2016-01-19 19:23:16 +01:00 |
|
Matthew Honnibal
|
65c5bc4988
|
* Add add_label method, to allow users to register new entity types and dependency labels.
|
2016-01-19 19:11:02 +01:00 |
|
Matthew Honnibal
|
151aa0b0e2
|
* Allow users to add_label, in order to extend the entity recogniser to new classes. Does not by itself add a class to the model
|
2016-01-19 19:09:33 +01:00 |
|
Matthew Honnibal
|
c8e0011ebc
|
* Add iterators to the NER and parser transition systems, to get the action types
|
2016-01-19 19:07:43 +01:00 |
|
Matthew Honnibal
|
515493c675
|
* Add xfail test for Issue #225: tokenization with non-whitespace delimiters
|
2016-01-19 13:20:14 +01:00 |
|
Matthew Honnibal
|
7abe653223
|
* Fix imports
|
2016-01-19 03:36:51 +01:00 |
|
Matthew Honnibal
|
590f38bdb2
|
* Add hacky solution to Issue #220. Currently specials.json only supports literal patterns, which doesn't allow us to pre-tag whitespace with the correct token, SP, as a rule. The data-driven approach should be easy but for some reason fails here. Adding a hard code in Morphology isn't a good solution, but we do want to fix the behaviour right away, and don't want to wait for an architecturally better solution.
|
2016-01-19 03:35:20 +01:00 |
|
Matthew Honnibal
|
445164d5b4
|
* Restore the LOCAL_DATA_DIR global in spacy/en/__init__.py, although this is now deprecated
|
2016-01-19 02:54:56 +01:00 |
|
Matthew Honnibal
|
04177debd0
|
* Unwind limit to sentence boundary detection that prevents it from inserting boundaries on whitespace. Replace it with a check for whitespace in StateClass.fast_forward, so that whitespace is LeftArced when it's on the stack. This should prevent the previous problem of whitespace-only sentences. Should fix Issue #184, but may cause further problems. Needs testing.
|
2016-01-19 02:54:15 +01:00 |
|
Matthew Honnibal
|
7893de3203
|
* Add test for Issue #184: Whitespace at sentence boundary causes sentence boundary error.
|
2016-01-18 23:04:38 +01:00 |
|
Matthew Honnibal
|
bba0a5e078
|
* Handle string paths in default_vocab, default_parser, default_entity in Language class
|
2016-01-18 22:37:24 +01:00 |
|
Matthew Honnibal
|
e825fd9554
|
* Make some of the website tests work without models
|
2016-01-18 18:14:44 +01:00 |
|
Matthew Honnibal
|
334c4b2b57
|
* Disprefer punctuation and spaces as heads of spans
|
2016-01-18 18:14:09 +01:00 |
|
Matthew Honnibal
|
bed36ab0ff
|
* Fix import of HEAD attribute
|
2016-01-18 17:34:43 +01:00 |
|
Matthew Honnibal
|
28c659c1fe
|
* Fix import for numpy
|
2016-01-18 17:25:04 +01:00 |
|
Matthew Honnibal
|
fc36bcf458
|
* Fix import for English
|
2016-01-18 17:14:40 +01:00 |
|