Ines Montani
30d6c2ccc2
Merge branch 'master' into spacy.io
2019-06-26 14:47:46 +02:00
Ines Montani
d361e380b8
Fix matcher callback example ( closes #3862 )
2019-06-26 14:47:26 +02:00
Ines Montani
6ccdf37574
Exclude user_data when copying doc in displaCy ( closes #3882 )
2019-06-26 14:37:05 +02:00
Bram Vanroy
f22704621e
Update CITATION ( #3873 )
...
As discussed in https://github.com/explosion/spaCy/pull/2167 the citation should look slightly different.
2019-06-24 11:03:16 +02:00
Ines Montani
c833d9b314
Add "v.s." to English tokenizer exceptions (see #3868 )
2019-06-20 17:48:45 +02:00
Ines Montani
ae2c208735
Auto-format [ci skip]
2019-06-20 10:36:38 +02:00
Ines Montani
1e0bbb615b
Merge branch 'master' into spacy.io
2019-06-20 10:31:34 +02:00
Guillaume Claret
d7a519a922
Typo ( #3865 )
...
* Typo
* Add contributor agreement
2019-06-20 10:31:19 +02:00
Björn Böing
ebf5a04d6c
Update pretrain docs and add unsupported loss_func error ( #3860 )
...
* Add error to `get_vectors_loss` for unsupported loss function of `pretrain`
* Add missing "--loss-func" argument to pretrain docs. Update pretrain plac annotations to match docs.
* Add missing quotation marks
2019-06-20 10:30:44 +02:00
Alejandro Alcalde
4866a7ee9e
Changed learning rate by its param name. ( #3855 )
...
* Changed learning rate by its param name.
I've been searching for a while how the parameter learning rate was named, with `beta1` and `beta2` its easy as they are marked as code, but learning rate wasn't. I think writing the actual parameter name would be helpful.
* Signing SCA
2019-06-20 10:29:20 +02:00
Ines Montani
ec4b1bf1f2
Merge branch 'master' into spacy.io
2019-06-16 14:33:31 +02:00
Ines Montani
81c12640ab
Auto-format [ci skip]
2019-06-16 14:33:20 +02:00
Greg Werner
9041a72d7f
Update tokenizer.md for construction example ( #3790 )
...
* Update tokenizer.md for construction example
Self contained example. You should really say what nlp is so that the example will work as is
* Update CONTRIBUTOR_AGREEMENT.md
* Restore contributor agreement
* Adjust construction examples
2019-06-16 14:32:56 +02:00
Kabir Khan
1e19f34e29
Add optional id
property to EntityRuler patterns ( #3591 )
...
* Adding support for entity_id in EntityRuler pipeline component
* Adding Spacy Contributor aggreement
* Updating EntityRuler to use string.format instead of f strings
* Update Entity Ruler to support an 'id' attribute per pattern that explicitly identifies an entity.
* Fixing tests
* Remove custom extension entity_id and use built in ent_id token attribute.
* Changing entity_id to ent_id for consistent naming
* entity_ids => ent_ids
* Removing kb, cleaning up tests, making util functions private, use rsplit instead of split
2019-06-16 13:29:04 +02:00
Suraj Rajan
46c78d0a41
Dependency tree pattern matcher ( #3465 )
...
* Functional dependency tree pattern matcher
* Tests fail due to inconsistent behaviour
* Renamed dependencymatcher and added optimizations
2019-06-16 13:25:32 +02:00
Paul O'Leary McCann
3f52e12335
Change vector training to work with latest gensim ( fix #3749 ) ( #3757 )
2019-06-16 13:24:06 +02:00
BreakBB
d8573ee715
Update error raising for CLI pretrain to fix #3840 ( #3843 )
...
* Add check for empty input file to CLI pretrain
* Raise error if JSONL is not a dict or contains neither `tokens` nor `text` key
* Skip empty values for correct pretrain keys and log a counter as warning
* Add tests for CLI pretrain core function make_docs.
* Add a short hint for the `tokens` key to the CLI pretrain docs
* Add success message to CLI pretrain
* Update model loading to fix the tests
* Skip empty values and do not create docs out of it
2019-06-16 13:22:57 +02:00
Azagh3l
5accfbb938
Update exemples.py ( #3838 )
...
Added missing hyphen and accent.
2019-06-14 09:31:05 +02:00
Ines Montani
f35ce09776
Add regression test for #3839
2019-06-12 13:38:30 +02:00
Ines Montani
aae9034492
Tidy up [ci skip]
2019-06-12 13:38:23 +02:00
Motoki Wu
9c064e6ad9
Add resume logic to spacy pretrain ( #3652 )
...
* Added ability to resume training
* Add to readmee
* Remove duplicate entry
2019-06-12 13:29:23 +02:00
Azagh3l
eb3e4263ee
Update lex_attrs.py ( #3835 )
...
Corrected typos, added french (from France) versions of some numbers.
2019-06-11 10:59:16 +02:00
Azagh3l
d0d56635ce
Create Azagh3l.md ( #3836 )
2019-06-11 10:58:32 +02:00
Matthew Honnibal
7f71cf0b02
Merge branch 'master' of https://github.com/explosion/spaCy
2019-06-07 20:41:00 +02:00
Matthew Honnibal
a931d72459
Add merge_subtokens as parser post-process. Re #3830
2019-06-07 20:40:41 +02:00
Ines Montani
5d6b4bb3bd
Update srsly pin
2019-06-07 11:14:32 +02:00
Ines Montani
511977ae5e
Update universe [ci skip]
2019-06-04 11:15:51 +02:00
Ramanan Balakrishnan
eb12703d10
minor fix to broken link in documentation ( #3819 ) [ci skip]
2019-06-04 11:15:35 +02:00
intrafind
436a578369
Create intrafindBreno.md ( #3814 )
2019-06-03 18:33:09 +02:00
intrafind
2bba2a3536
Fix for #3811 ( #3815 )
...
Corrected type of seed parameter.
2019-06-03 18:32:47 +02:00
Ines Montani
40b540dca7
Merge branch 'master' into spacy.io
2019-06-03 12:19:23 +02:00
Ines Montani
62ebc65c62
Update universe [ci skip]
2019-06-03 12:19:13 +02:00
Ines Montani
c44d5beb12
Merge branch 'master' into spacy.io
2019-06-02 13:56:18 +02:00
Ines Montani
e703301129
Update universe [ci skip]
2019-06-02 13:55:55 +02:00
Ines Montani
596c7718b2
Merge branch 'master' into spacy.io
2019-06-02 12:58:24 +02:00
Ines Montani
892e72451f
Update universe [ci skip]
2019-06-02 12:58:12 +02:00
Ines Montani
42de5be90c
Tidy up universe [ci skip]
2019-06-02 12:38:48 +02:00
Nirant
638caba9b5
Add multiple packages to universe.json ( #3809 ) [ci skip]
...
* Add multiple packages to universe.json
Added following packages: NLPArchitect, NLPRe, Chatterbot, alibi, NeuroNER
* Auto-format
* Update slogan (probably just copy-paste mistake)
* Adjust formatting
* Update tags / categories
2019-06-02 12:35:52 +02:00
Germán
86eb817b74
Overwrites default getter for like_num in Spanish by adding _num_words and like_num to lex_attrs.py ( #3810 ) ( closes #3803 ))
...
* (#3803 ) Spanish like_num returning false for number-like token
* (#3803 ) Spanish like_num now returning True for number-like token
2019-06-02 12:22:57 +02:00
Ines Montani
101da344aa
Merge branch 'master' into spacy.io
2019-06-01 17:36:55 +02:00
Nirant
d4d1eab5e1
Add Baderlab/saber to universe.json ( #3806 )
2019-06-01 17:36:40 +02:00
Nirant
a5d92a3035
Create NirantK.md ( #3807 ) [ci skip]
2019-06-01 17:36:06 +02:00
Ines Montani
6be7d07315
Update UNIVERSE.md
2019-06-01 16:37:06 +02:00
Ines Montani
09e78b52cf
Improve E024 text for incorrect GoldParse ( closes #3558 )
2019-06-01 14:37:27 +02:00
Ines Montani
2b8bfd6cc7
Merge branch 'master' into spacy.io
2019-06-01 11:35:12 +02:00
Ines Montani
0c74506c9c
Fix typos in docs ( closes #3802 ) [ci skip]
2019-06-01 11:35:01 +02:00
Ines Montani
3e6281cc63
Merge branch 'master' into spacy.io
2019-05-31 16:51:12 +02:00
Nipun Sadvilkar
1f13005751
Incorrect Token attribute ent_iob_ description ( #3800 )
...
* Incorrect Token attribute ent_iob_ description
* Add spaCy contributor agreement
2019-05-31 16:50:45 +02:00
Ramanan Balakrishnan
26c37c5a4d
fix all references to BILUO annotation format ( #3797 )
2019-05-31 12:19:19 +02:00
Ines Montani
a7fd42d937
Make jsonschema dependency optional ( #3784 )
2019-05-30 14:34:58 +02:00