svlandeg
0b04d142de
regenerating KB
2019-06-13 22:32:56 +02:00
svlandeg
78dd3e11da
write entity linking pipe to file and keep vocab consistent between kb and nlp
2019-06-13 16:25:39 +02:00
svlandeg
b12001f368
small fixes
2019-06-12 22:05:53 +02:00
Ines Montani
f35ce09776
Add regression test for #3839
2019-06-12 13:38:30 +02:00
Ines Montani
aae9034492
Tidy up [ci skip]
2019-06-12 13:38:23 +02:00
svlandeg
6521cfa132
speeding up training
2019-06-12 13:37:05 +02:00
Motoki Wu
9c064e6ad9
Add resume logic to spacy pretrain ( #3652 )
...
* Added ability to resume training
* Add to readmee
* Remove duplicate entry
2019-06-12 13:29:23 +02:00
svlandeg
66813a1fdc
speed up predictions
2019-06-11 14:18:20 +02:00
svlandeg
fe1ed432ef
eval on dev set, varying combo's of prior and context scores
2019-06-11 11:40:58 +02:00
Azagh3l
eb3e4263ee
Update lex_attrs.py ( #3835 )
...
Corrected typos, added french (from France) versions of some numbers.
2019-06-11 10:59:16 +02:00
Azagh3l
d0d56635ce
Create Azagh3l.md ( #3836 )
2019-06-11 10:58:32 +02:00
svlandeg
83dc7b46fd
first tests with EL pipe
2019-06-10 21:25:26 +02:00
Matthew Honnibal
7f71cf0b02
Merge branch 'master' of https://github.com/explosion/spaCy
2019-06-07 20:41:00 +02:00
Matthew Honnibal
a931d72459
Add merge_subtokens as parser post-process. Re #3830
2019-06-07 20:40:41 +02:00
svlandeg
7de1ee69b8
training loop in proper pipe format
2019-06-07 15:55:10 +02:00
svlandeg
0486ccabfd
introduce goldparse.links
2019-06-07 13:54:45 +02:00
svlandeg
a5c061f506
storing NEL training data in GoldParse objects
2019-06-07 12:58:42 +02:00
Ines Montani
5d6b4bb3bd
Update srsly pin
2019-06-07 11:14:32 +02:00
svlandeg
61f0e2af65
code cleanup
2019-06-06 20:22:14 +02:00
svlandeg
d8b435ceff
pretraining description vectors and storing them in the KB
2019-06-06 19:51:27 +02:00
svlandeg
5c723c32c3
entity vectors in the KB + serialization of them
2019-06-05 18:29:18 +02:00
svlandeg
9abbd0899f
separate entity encoder to get 64D descriptions
2019-06-05 00:09:46 +02:00
Ines Montani
511977ae5e
Update universe [ci skip]
2019-06-04 11:15:51 +02:00
Ramanan Balakrishnan
eb12703d10
minor fix to broken link in documentation ( #3819 ) [ci skip]
2019-06-04 11:15:35 +02:00
svlandeg
fb37cdb2d3
implementing el pipe in pipes.pyx (not tested yet)
2019-06-03 21:32:54 +02:00
intrafind
436a578369
Create intrafindBreno.md ( #3814 )
2019-06-03 18:33:09 +02:00
intrafind
2bba2a3536
Fix for #3811 ( #3815 )
...
Corrected type of seed parameter.
2019-06-03 18:32:47 +02:00
Ines Montani
62ebc65c62
Update universe [ci skip]
2019-06-03 12:19:13 +02:00
svlandeg
d83a1e3052
Merge branch 'master' into feature/nel-wiki
2019-06-03 09:35:10 +02:00
svlandeg
9e88763dab
60% acc run
2019-06-03 08:04:49 +02:00
Ines Montani
e703301129
Update universe [ci skip]
2019-06-02 13:55:55 +02:00
Ines Montani
892e72451f
Update universe [ci skip]
2019-06-02 12:58:12 +02:00
Ines Montani
42de5be90c
Tidy up universe [ci skip]
2019-06-02 12:38:48 +02:00
Nirant
638caba9b5
Add multiple packages to universe.json ( #3809 ) [ci skip]
...
* Add multiple packages to universe.json
Added following packages: NLPArchitect, NLPRe, Chatterbot, alibi, NeuroNER
* Auto-format
* Update slogan (probably just copy-paste mistake)
* Adjust formatting
* Update tags / categories
2019-06-02 12:35:52 +02:00
Germán
86eb817b74
Overwrites default getter for like_num in Spanish by adding _num_words and like_num to lex_attrs.py ( #3810 ) ( closes #3803 ))
...
* (#3803 ) Spanish like_num returning false for number-like token
* (#3803 ) Spanish like_num now returning True for number-like token
2019-06-02 12:22:57 +02:00
Nirant
d4d1eab5e1
Add Baderlab/saber to universe.json ( #3806 )
2019-06-01 17:36:40 +02:00
Nirant
a5d92a3035
Create NirantK.md ( #3807 ) [ci skip]
2019-06-01 17:36:06 +02:00
Ines Montani
6be7d07315
Update UNIVERSE.md
2019-06-01 16:37:06 +02:00
Ines Montani
09e78b52cf
Improve E024 text for incorrect GoldParse ( closes #3558 )
2019-06-01 14:37:27 +02:00
Ines Montani
0c74506c9c
Fix typos in docs ( closes #3802 ) [ci skip]
2019-06-01 11:35:01 +02:00
Nipun Sadvilkar
1f13005751
Incorrect Token attribute ent_iob_ description ( #3800 )
...
* Incorrect Token attribute ent_iob_ description
* Add spaCy contributor agreement
2019-05-31 16:50:45 +02:00
Ramanan Balakrishnan
26c37c5a4d
fix all references to BILUO annotation format ( #3797 )
2019-05-31 12:19:19 +02:00
Ines Montani
a7fd42d937
Make jsonschema dependency optional ( #3784 )
2019-05-30 14:34:58 +02:00
svlandeg
268a52ead7
experimenting with cosine sim for negative examples (not OK yet)
2019-05-29 16:07:53 +02:00
mak
89379a7fa4
Corrected example model URL in requirements.txt ( #3786 )
...
The URL used to show how to add a model to the requirements.txt had the old release path (excl. explosion).
2019-05-29 10:51:55 +02:00
svlandeg
a761929fa5
context encoder combining sentence and article
2019-05-28 18:14:49 +02:00
Ines Montani
a8416c46f7
Use string name in setup.py
...
Hopefully this will trick GitHub's parser into recognising it as a Python package and show us the dependents / "used by" statistics 🤞
2019-05-28 17:11:39 +02:00
svlandeg
992fa92b66
refactor again to clusters of entities and cosine similarity
2019-05-28 00:05:22 +02:00
svlandeg
8c4aa076bc
small fixes
2019-05-27 14:29:38 +02:00
Ujwal Narayan
ed7be3f64c
Update norm_exceptions.py ( #3778 )
...
* Update norm_exceptions.py
Extended the Currency set to include Franc, Indian Rupee, Bangladeshi Taka, Korean Won, Mexican Dollar, and Egyptian Pound
* Fix formatting [ci skip]
2019-05-27 11:52:52 +02:00