Matthew Honnibal
|
a8abc47811
|
Rename BaseThincComponent --> Pipe
|
2017-10-26 12:40:40 +02:00 |
|
Matthew Honnibal
|
b0f3ea2200
|
Fix names of pipeline components
NeuralDependencyParser --> DependencyParser
NeuralEntityRecognizer --> EntityRecognizer
TokenVectorEncoder --> Tensorizer
NeuralLabeller --> MultitaskObjective
|
2017-10-26 12:38:23 +02:00 |
|
Matthew Honnibal
|
b6b4f1aaf7
|
Merge pull request #1462 from explosion/feature/vector-meta-data
💫 Add vector meta data to model meta.json on train/package and show in docs
|
2017-10-26 11:39:41 +02:00 |
|
Ines Montani
|
090bd00369
|
Merge pull request #1464 from mayukh18/develop_bengali_pronouns
added the bengali pronouns for v2.0
|
2017-10-25 21:55:25 +02:00 |
|
mayukh18
|
1bc07758fa
|
added few bengali pronouns
|
2017-10-25 22:24:40 +05:30 |
|
ines
|
728b609bf9
|
Merge branch 'develop' into feature/vector-meta-data
|
2017-10-25 16:32:22 +02:00 |
|
ines
|
c0b55ebdac
|
Fix PhraseMatcher.__contains__ and add more tests
|
2017-10-25 16:31:11 +02:00 |
|
ines
|
91beacf5e3
|
Fix Matcher.__contains__
|
2017-10-25 16:19:38 +02:00 |
|
ines
|
70de2dd035
|
Display vectors in models directory if available (see #1457)
|
2017-10-25 16:15:37 +02:00 |
|
ines
|
11e3f19764
|
Fix vectors data added after training (see #1457)
|
2017-10-25 16:08:26 +02:00 |
|
ines
|
057954695b
|
Read pipeline and vector data off model in --generate-meta
|
2017-10-25 16:03:26 +02:00 |
|
ines
|
273e638183
|
Add vector data to model meta after training (see #1457)
|
2017-10-25 16:03:05 +02:00 |
|
ines
|
18aae423fb
|
Remove import of non-existing function
|
2017-10-25 15:54:10 +02:00 |
|
ines
|
5117a7d24d
|
Fix whitespace
|
2017-10-25 15:54:02 +02:00 |
|
Matthew Honnibal
|
b5de768852
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-10-25 14:44:16 +02:00 |
|
Matthew Honnibal
|
094512fd47
|
Fix model-mark on regression test.
|
2017-10-25 14:44:00 +02:00 |
|
ines
|
0102561f34
|
Update docs
|
2017-10-25 13:57:55 +02:00 |
|
ines
|
72497c8cb2
|
Remove comments and add TODO
|
2017-10-25 12:15:43 +02:00 |
|
ines
|
4d97efc3b5
|
Add missing docstrings
|
2017-10-25 12:10:16 +02:00 |
|
ines
|
1262aa0bf9
|
Implement PhraseMatcher.__contains__
|
2017-10-25 12:10:04 +02:00 |
|
ines
|
9c733a8849
|
Implement PhraseMatcher.__len__
|
2017-10-25 12:09:56 +02:00 |
|
ines
|
7eebeeaf85
|
Fix Matcher.__contains__
|
2017-10-25 12:09:47 +02:00 |
|
ines
|
7bcec57462
|
Remove unused attribute
|
2017-10-25 12:08:54 +02:00 |
|
ines
|
0b1dcbac14
|
Remove unused function
|
2017-10-25 12:08:46 +02:00 |
|
ines
|
3484174e48
|
Add Language.path
|
2017-10-25 11:57:43 +02:00 |
|
ines
|
4a06eddb5f
|
Update README
|
2017-10-24 22:18:40 +02:00 |
|
ines
|
1730648e19
|
Update pull request template
|
2017-10-24 21:49:11 +02:00 |
|
ines
|
972d9e832c
|
Update README for v2.0
|
2017-10-24 21:49:11 +02:00 |
|
ines
|
63683a5151
|
Port over contributors from master
|
2017-10-24 21:49:11 +02:00 |
|
ines
|
c815ff65f6
|
Update feature list
|
2017-10-24 21:49:11 +02:00 |
|
Ines Montani
|
d3bf488e16
|
Merge pull request #1171 from mollerhoj/support-danish
Improve basic support for Danish
|
2017-10-24 20:29:57 +02:00 |
|
ines
|
7459ecfa87
|
Port over contributor agreements
|
2017-10-24 20:13:34 +02:00 |
|
ines
|
d71702b827
|
Fix formatting
|
2017-10-24 20:11:04 +02:00 |
|
Matthew Honnibal
|
d9bb1e5de8
|
Increment version
|
2017-10-24 17:06:19 +02:00 |
|
Matthew Honnibal
|
908809d488
|
Update tests
|
2017-10-24 17:05:15 +02:00 |
|
Matthew Honnibal
|
66766c1454
|
Restore SP tag to English tag_map, until models migrate
|
2017-10-24 17:05:00 +02:00 |
|
Matthew Honnibal
|
30e67fa808
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-10-24 16:08:23 +02:00 |
|
Matthew Honnibal
|
b0f6fd3f1d
|
Disable tokenizer cache for special-cases. Fixes #1250
|
2017-10-24 16:08:05 +02:00 |
|
Matthew Honnibal
|
63f0bde749
|
Add test for #1250: Tokenizer cache clobbered special-case attrs
|
2017-10-24 16:07:18 +02:00 |
|
ines
|
8492d5be6d
|
Always make lemmatizer return a list of lemmas, not a set
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
95f866f99f
|
Add lookup argument to Lemmatizer.load
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
95f6174516
|
Remove tensorizer from model pipeline example in spacy package
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
6686e53530
|
Allow GitHub embeds to specify optional language
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
56a47f137f
|
Add title description for tokenizer
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
3944c1d6e7
|
Document lemmatizer
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
c9dc88ddfc
|
Document current JSON format for training
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
2b8e7c45e0
|
Use better training data JSON example
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
090aed940a
|
Add test for currently failing span.as_doc case
|
2017-10-24 16:00:56 +02:00 |
|
ines
|
4ef81a9ebc
|
Fix whitespace
|
2017-10-24 16:00:56 +02:00 |
|
Matthew Honnibal
|
18f1c1d0ba
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-10-24 14:29:43 +02:00 |
|