spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-02-15 19:50:39 +03:00

Author	SHA1	Message	Date
svlandeg	5f0a91cf37	fix conv-depth parameter	2020-05-29 09:56:29 +02:00
Rajat	8b8efa1b42	update spacy universe with my project (#5497 ) * added contextualSpellCheck in spacy universe meta * removed extra formatting by code * updated with permanent links * run json linter used by spacy * filled SCA * updated the description	2020-05-25 11:30:23 +02:00
Sofie Van Landeghem	ae1c179f3a	Remove the nested quote	2020-05-23 17:58:19 +02:00
Jannis	aa53ce6996	Documentation Typo Fix (#5492 ) * Fix typo Change 'realize' to 'realise' * Add contributer agreement	2020-05-22 19:50:26 +02:00
Matthew Honnibal	f6078d866a	Merge pull request #5121 from adrianeboyd/bugfix/revert-token-match Revert token_match priority changes from #4374 and extend token match options	2020-05-22 14:42:51 +02:00
Ines Montani	65c7e82de2	Auto-format and remove 2.3 feature [ci skip]	2020-05-22 13:50:30 +02:00
Adriane Boyd	e4a1b5dab1	Rename to url_match Rename to `url_match` and update docs.	2020-05-22 12:41:03 +02:00
Adriane Boyd	730fa493a4	Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match	2020-05-22 12:18:00 +02:00
Ines Montani	ee027de032	Update universe and display of videos [ci skip]	2020-05-21 21:54:23 +02:00
Ines Montani	53da6bd672	Add course to landing [ci skip]	2020-05-21 20:45:33 +02:00
Kevin Lu	c7c4cd5fe1	Changed pyate code example in universe.json	2020-05-20 09:11:32 -07:00
Kevin Lu	0a5b140235	Update universe.json	2020-05-19 20:12:21 -07:00
Ines Montani	f333c2a011	Merge pull request #5386 from svlandeg/fix/nel-docs	2020-05-10 12:00:09 +02:00
Travis Hoppe	d4cc18b746	Added author information for NLPre (#5414 ) * Add author links for NLPre and update category * Add contributor statement	2020-05-08 11:28:54 +02:00
adrianeboyd	4a15b559ba	Clarify Token.pos as UPOS (#5419 )	2020-05-08 10:36:25 +02:00
adrianeboyd	a2345618f1	Fix Token API docs from #5375 (#5418 )	2020-05-08 10:25:02 +02:00
Adriane Boyd	565e0eef73	Add tokenizer option for token match with affixes To fix the slow tokenizer URL (#4374) and allow `token_match` to take priority over prefixes and suffixes by default, introduce a new tokenizer option for a token match pattern that's applied after prefixes and suffixes but before infixes.	2020-05-05 10:35:33 +02:00
Adriane Boyd	792c8af8cf	Merge remote-tracking branch 'upstream/master' into bugfix/revert-token-match	2020-05-05 09:25:57 +02:00
svlandeg	ebaed7dcfa	Few more updates to the EL documentation	2020-04-30 10:17:06 +02:00
adrianeboyd	bdff76dede	Various updates/additions to CLI scripts (#5362 ) * `debug-data`: determine coverage of provided vectors * `evaluate`: support `blank:lg` model to make it possible to just evaluate tokenization * `init-model`: add option to truncate vectors to N most frequent vectors from word2vec file * `train`: * if training on GPU, only run evaluation/timing on CPU in the first iteration * if training is aborted, exit with a non-0 exit status	2020-04-29 12:56:46 +02:00
Sofie Van Landeghem	cfdaf99b80	Fix passing of component configuration (#5374 ) * add kwargs to to_disk methods in docs - otherwise crashes on 'exclude' argument * add fix and test for Issue 5137	2020-04-29 12:56:17 +02:00
Ines Montani	63885c1836	Remove u string and auto-format [ci skip]	2020-04-29 12:54:57 +02:00
Sofie Van Landeghem	f67343295d	Update NEL examples and documentation (#5370 ) * simplify creation of KB by skipping dim reduction * small fixes to train EL example script * add KB creation and NEL training example scripts to example section * update descriptions of example scripts in the documentation * moving wiki_entity_linking folder from bin to projects * remove test for wiki NEL functionality that is being moved	2020-04-29 12:53:53 +02:00
adrianeboyd	a6e521cd79	Add is_sent_end token property (#5375 ) Reconstruction of the original PR #4697 by @MiniLau. Removes unused `SENT_END` symbol and `IS_SENT_END` from `Matcher` schema because the Matcher is only going to be able to support `IS_SENT_START`.	2020-04-29 12:53:16 +02:00
Ines Montani	a77754120d	Merge pull request #5177 from nlptechbook/patch-5	2020-04-29 12:52:21 +02:00
Ines Montani	1cbb272a6b	Update website/meta/universe.json	2020-04-29 12:51:44 +02:00
Ines Montani	732629b0dd	Update website/meta/universe.json	2020-04-29 12:51:37 +02:00
adrianeboyd	90ce34db42	Add cuda101 and cuda102 options to setup (#5377 ) * Add cuda101 and cuda102 options to setup * Update cudaNNN options in docs	2020-04-29 12:51:12 +02:00
Louis Guitton	a27c4014f5	Add mlflow to spaCy universe (#5352 ) * Add mlflow to universe * Use mlflow black logo	2020-04-29 10:18:03 +02:00
adrianeboyd	792aa7b6ab	Remove references to textcat spans (#5360 ) Remove references to unimplemented `TextCategorizer` span labels in `GoldParse` and `Doc`.	2020-04-27 18:01:12 +02:00
adrianeboyd	90c754024f	Update nlp.vectors to nlp.vocab.vectors (#5357 )	2020-04-27 10:53:05 +02:00
Mike	481574cbc8	[minor doc change] embedding vis. link is broken in `website/docs/usage/examples.md` (#5325 ) * The embedding vis. link is broken The first link seems to be reasonable for now unless someone has an updated embedding vis they want to share? * contributor agreement * Update Mlawrence95.md * Update website/docs/usage/examples.md Co-Authored-By: Sofie Van Landeghem <svlandeg@users.noreply.github.com> Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>	2020-04-21 20:35:12 +02:00
Ines Montani	b919844fce	Tidy up and fix alignment of landing cards (#5317 )	2020-04-20 20:33:13 +02:00
laszabine	fb73d4943a	Amend documentation to Language.evaluate (#5319 ) * Specified usage of arguments to Language.evaluate * Created contributor agreement	2020-04-16 20:00:18 +02:00
Thomas Thiebaud	1eef60c658	Add spacy_fastlang to universe (#5271 ) * Add spacy_fastlang to universe * Sign SCA	2020-04-15 13:50:46 +02:00
Sofie Van Landeghem	a3965ec13d	tag-map-path since 2.2.4 instead of 2.2.3 (#5289 )	2020-04-14 14:53:47 +02:00
Marek Grzenkowicz	6a8a52650f	[Closes #5292 ] Fix typo in option name "--n-save_every" (#5293 ) * Sign contributor agreement for chopeen * Fix typo in option name and close #5292	2020-04-11 23:35:01 +02:00
Sofie Van Landeghem	7ad0fcf01d	fix json (#5267 )	2020-04-08 12:58:09 +02:00
vincent d warmerdam	f329d5663a	add "whatlies" to spaCy universe (#5252 ) * Add "whatlies" We're releasing it on our side officially on the 16th of April. If possible, let's announce around the same time :) * sign contributor thing * Added fancy gif as the image * Update universe.json Spellin error and spaCy clarification.	2020-04-06 11:29:30 +02:00
nlptechbook	ddf3c2430d	Update universe.json	2020-04-03 12:10:03 -04:00
Sofie Van Landeghem	1137420840	Small doc fixes (#5250 ) * fix link * torchtext instead tochtext	2020-04-03 13:01:43 +02:00
Nikhil Saldanha	d1ddfa1cb7	update docs for EntityRecognizer.predict return type was wrongly written as a tuple, changed to syntax.StateClass	2020-03-28 18:13:02 +01:00
Tiljander	e53232533b	Describing priority rules for overlapping matches (#5197 ) * Describing priority rules for overlapping matches * Create Tiljander.md * Describing priority rules for overlapping matches * Update website/docs/api/entityruler.md Co-Authored-By: Ines Montani <ines@ines.io> Co-authored-by: Ines Montani <ines@ines.io>	2020-03-26 13:13:22 +01:00
adrianeboyd	d88a377bed	Remove Vectors.from_glove (#5209 )	2020-03-26 10:45:47 +01:00
nlptechbook	b52e1ab677	Update universe.json A bot powered by Clarifai Predict API and spaCy. Can be found in Telegram messenger at @pic2phrase_bot	2020-03-21 11:39:15 -04:00
Baciccin	3b53617a69	Add Ligurian language	2020-03-19 21:37:01 -07:00
Ines Montani	80e7e1347e	Update universe.json [ci skip]	2020-03-17 22:21:34 +01:00
Ines Montani	eda6eff8b1	Update universe.json [ci skip]	2020-03-17 22:19:29 +01:00
Ines Montani	16e7301d34	Merge pull request #5161 from pmbaumgartner/master add gobbli to spacy-universe 🥳	2020-03-17 22:18:30 +01:00
Peter B	b04057c204	add mentions of spaCy use	2020-03-17 15:03:43 -04:00

1 2 3 4 5 ...

1650 Commits