Adriane Boyd
fbbbda1954
Fix start/end chars for empty and out-of-bounds spans ( #8816 )
2021-08-02 19:07:19 +02:00
Adriane Boyd
9ad3b8cf8d
Only add sourced vectors hashes to meta if necessary ( #8830 )
2021-08-02 18:22:35 +02:00
Nick Sorros
0485cdefcc
Add logger debug for project push and pull ( #8860 )
...
* Add logger debug for project push and pull
* Sign contributor agreement
2021-08-02 18:13:53 +02:00
themrmax
de076194c4
Make ConsoleLogger flush after each logging line ( #8810 )
...
This is necessary to avoid "logging blackouts" when running training on Kubernetes pods
2021-08-02 14:33:38 +02:00
Ines Montani
d79dbd0624
Merge pull request #8844 from thomashacker/bugfix/fix-doc-transformer-typo [ci skip]
...
Fix typo in Tok2VecTransformer example config
2021-07-30 09:11:24 +10:00
Ines Montani
4ddee5e84c
Merge pull request #8841 from adrianeboyd/docs/ent-id-sep [ci skip]
...
Fix formatting of ent_id_sep in EntityRuler API docs
2021-07-30 09:11:15 +10:00
Ines Montani
cf9b671566
Merge pull request #8840 from polm/docs/evaluate-speed [ci skip]
2021-07-30 09:11:05 +10:00
Ines Montani
30f20496d5
Merge pull request #8840 from polm/docs/evaluate-speed [ci skip]
2021-07-30 09:10:15 +10:00
Ines Montani
65d163fab5
Adjust formatting [ci skip]
2021-07-30 09:10:04 +10:00
Ines Montani
3a701d3645
Merge pull request #8841 from adrianeboyd/docs/ent-id-sep [ci skip]
...
Fix formatting of ent_id_sep in EntityRuler API docs
2021-07-30 09:09:25 +10:00
Ines Montani
f08be084fb
Merge pull request #8844 from thomashacker/bugfix/fix-doc-transformer-typo [ci skip]
...
Fix typo in Tok2VecTransformer example config
2021-07-30 09:08:59 +10:00
thomashacker
02258916c8
Fix example config typo for transformer architecture
2021-07-29 11:19:40 +02:00
Adriane Boyd
15b12f3e35
Fix formatting of ent_id_sep in EntityRuler API docs
2021-07-29 10:10:12 +02:00
Paul O'Leary McCann
a60cb13910
Update speed entry in metrics table
2021-07-29 16:35:19 +09:00
Paul O'Leary McCann
e125313a50
Revert "Add note about SPEED in output"
...
This reverts commit c92d268176
.
2021-07-29 16:34:08 +09:00
Ines Montani
03a742f332
Merge pull request #8814 from polm/docs/migrate-lexeme-tables [ci skip]
2021-07-29 17:19:44 +10:00
Ines Montani
0a1e299d30
Merge pull request #8814 from polm/docs/migrate-lexeme-tables [ci skip]
2021-07-29 17:18:02 +10:00
Paul O'Leary McCann
c92d268176
Add note about SPEED in output
...
In #8823 it was pointed out that the `SPEED` value wasn't documented
anywhere.
2021-07-29 15:03:07 +09:00
Paul O'Leary McCann
8867e60fbb
Update website/docs/usage/v3.md
...
Co-authored-by: Ines Montani <ines@ines.io>
2021-07-29 14:56:56 +09:00
Adriane Boyd
9e9611233f
Remove labels from textcat component config example ( #8815 )
2021-07-27 13:15:33 +02:00
Paul O'Leary McCann
de5bc8a0e1
Update subset/superset docs ( #8795 )
...
* Update subset/superset docs
* Update website/docs/usage/rule-based-matching.md
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-07-27 13:15:27 +02:00
Adriane Boyd
8547514aa4
Remove labels from textcat component config example ( #8815 )
2021-07-27 13:14:38 +02:00
Paul O'Leary McCann
76ac95923a
Add note to migration guide about lexeme tables ( fix #7290 )
...
This just adds the resolution from #6388 to the docs.
2021-07-27 19:19:25 +09:00
Paul O'Leary McCann
67ecdcc3ac
Update subset/superset docs ( #8795 )
...
* Update subset/superset docs
* Update website/docs/usage/rule-based-matching.md
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2021-07-27 12:08:46 +02:00
Adriane Boyd
81d3a1edb1
Use tokenizer URL_MATCH pattern in LIKE_URL ( #8765 )
2021-07-27 12:07:01 +02:00
Adriane Boyd
4f28190afe
Merge pull request #8813 from adrianeboyd/chore/develop-v3.2
...
Update develop for v3.2
2021-07-27 11:26:18 +02:00
Ines Montani
7f21c7dfa2
Merge pull request #8794 from explosion/autoblack
...
Auto-format code with black
2021-07-27 12:17:15 +10:00
Ines Montani
34c401f04f
Merge pull request #8801 from polm/fix/respect-no-skip ( fixes #8796 )
...
Respect the no_skip value
2021-07-27 12:16:47 +10:00
Ines Montani
cf3855ae05
Merge pull request #8806 from Ledenel/master [ci skip]
...
fix typo
2021-07-27 12:15:44 +10:00
Ines Montani
5c762e08d7
Merge pull request #8808 from kevinlu1248/master [ci skip]
...
Changed a CLI command in data-formats.md due to erroneous information
2021-07-27 12:15:35 +10:00
Ines Montani
134cb06af3
Merge pull request #8808 from kevinlu1248/master [ci skip]
...
Changed a CLI command in data-formats.md due to erroneous information
2021-07-27 12:15:16 +10:00
Ines Montani
9bf0d6f2fd
Merge pull request #8806 from Ledenel/master [ci skip]
...
fix typo
2021-07-27 12:14:22 +10:00
Kevin Lu
4a8e9e4e4e
Update data-formats.md
2021-07-25 22:58:53 -07:00
Ledenel
413f745c68
fix broken example in spaCy universe Chatterbot
2021-07-25 15:53:32 +00:00
Paul O'Leary McCann
284b530c63
Respect the no_skip value
...
Seems like the logic for this was just left out. See #8796 .
2021-07-24 15:31:17 +09:00
explosion-bot
a58ab6ea22
Auto-format code with black
2021-07-23 08:04:09 +00:00
Adriane Boyd
6bbc2b1956
Reload train corpus in debug data after initialize ( #8776 )
2021-07-21 22:38:40 +02:00
Paul O'Leary McCann
1d1679d431
Minor speedup
...
This continue should be a break. The current form doesn't cause errors
but using a break will be a bit faster.
2021-07-21 19:50:10 +09:00
svlandeg
f4f270940c
Merge remote-tracking branch 'upstream/master' into spacy.io
2021-07-20 16:14:16 +02:00
Adriane Boyd
d48c01a6f7
Remove extraneous grc test file ( #8768 )
2021-07-20 15:51:15 +02:00
Sofie Van Landeghem
ffaead8fe0
bump to 3.1.1
2021-07-19 14:48:27 +02:00
Sofie Van Landeghem
83e27d262e
negative tag annotation ( #8731 )
...
* unit test to unlearn tag via negative annotation
* bump thinc to 8.0.8
2021-07-19 14:39:11 +02:00
Adriane Boyd
0e4b96c97e
Update lexeme ranks for loaded vectors ( #8640 )
...
Update the ranks for any lexemes that have been added to the vocab
before the vectors are added to the model.
2021-07-19 18:25:54 +10:00
Adriane Boyd
e532c69475
Update Language.replace_pipe for disabled components ( #8729 )
...
* Fix the index where the replacement in inserted to account for
disabled components
* Allow `Language.replace_pipe` to replace disabled components
2021-07-19 18:06:12 +10:00
Paul O'Leary McCann
a151c62d13
Add sentence map test
2021-07-19 13:05:26 +09:00
Paul O'Leary McCann
3ed0fae671
Add multi-sentence mention test
...
Also formatting.
2021-07-19 13:00:16 +09:00
Paul O'Leary McCann
8bd0474730
Run black
2021-07-18 20:20:22 +09:00
Paul O'Leary McCann
bc081c24fa
Add full traditional scoring
...
This calculates scores as an average of three metrics. As noted in the
code, these metrics all have issues, but we want to use them to match up
with prior work.
This should be replaced with some simpler default scoring and the scorer
here should be moved to an external project to be passed in just for
generating the traditional scores.
2021-07-18 20:13:10 +09:00
Kenneth Enevoldsen
2880ae70b0
removed outdated spacy version for spacymoji
...
From the documentation of spacymoji (and the requirements.txt) it seems like it is not only for version 2.
2021-07-18 19:19:55 +09:00
Kenneth Enevoldsen
812746464b
fixed GitHub link and thumbnail
...
Sorry, I seem to have misunderstood that the GitHub reference shouldn't be a link.
2021-07-18 19:19:37 +09:00