spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-01-24 08:14:15 +03:00

Author	SHA1	Message	Date
Jonathan Besomi	546f3d10d4	Add texthero to universe.json (#5716 ) * Add texthero to universe.json * Add spaCy contributor Agreement	2020-07-07 20:54:22 +02:00
Ines Montani	19b9ea0436	Fix languages.json	2020-06-16 18:34:11 +02:00
Ines Montani	41003a5117	Update Binder version [ci skip]	2020-06-16 17:41:23 +02:00
Ines Montani	fd89f44c0c	Update Binder URL [ci skip]	2020-06-16 17:34:26 +02:00
Adriane Boyd	d5110ffbf2	Documentation updates for v2.3.0 (#5593 ) * Update website models for v2.3.0 * Add docs for Chinese word segmentation * Tighten up Chinese docs section * Merge branch 'master' into docs/v2.3.0 [ci skip] * Merge branch 'master' into docs/v2.3.0 [ci skip] * Auto-format and update version * Update matcher.md * Update languages and sorting * Typo in landing page * Infobox about token_match behavior * Add meta and basic docs for Japanese * POS -> TAG in models table * Add info about lookups for normalization * Updates to API docs for v2.3 * Update adding norm exceptions for adding languages * Add --omit-extra-lookups to CLI API docs * Add initial draft of "What's New in v2.3" * Add new in v2.3 tags to Chinese and Japanese sections * Add tokenizer to migration section * Add new in v2.3 flags to init-model * Typo * More what's new in v2.3 Co-authored-by: Ines Montani <ines@ines.io>	2020-06-16 15:37:35 +02:00
Martino Mensio	de00f967ce	adding spacy-universal-sentence-encoder (#5534 ) * adding spacy-universal-sentence-encoder * update affiliation * updated code example	2020-06-08 20:26:30 +02:00
Rajat	8b8efa1b42	update spacy universe with my project (#5497 ) * added contextualSpellCheck in spacy universe meta * removed extra formatting by code * updated with permanent links * run json linter used by spacy * filled SCA * updated the description	2020-05-25 11:30:23 +02:00
Sofie Van Landeghem	ae1c179f3a	Remove the nested quote	2020-05-23 17:58:19 +02:00
Ines Montani	ee027de032	Update universe and display of videos [ci skip]	2020-05-21 21:54:23 +02:00
Kevin Lu	c7c4cd5fe1	Changed pyate code example in universe.json	2020-05-20 09:11:32 -07:00
Kevin Lu	0a5b140235	Update universe.json	2020-05-19 20:12:21 -07:00
Travis Hoppe	d4cc18b746	Added author information for NLPre (#5414 ) * Add author links for NLPre and update category * Add contributor statement	2020-05-08 11:28:54 +02:00
Ines Montani	63885c1836	Remove u string and auto-format [ci skip]	2020-04-29 12:54:57 +02:00
Ines Montani	a77754120d	Merge pull request #5177 from nlptechbook/patch-5	2020-04-29 12:52:21 +02:00
Ines Montani	1cbb272a6b	Update website/meta/universe.json	2020-04-29 12:51:44 +02:00
Ines Montani	732629b0dd	Update website/meta/universe.json	2020-04-29 12:51:37 +02:00
Louis Guitton	a27c4014f5	Add mlflow to spaCy universe (#5352 ) * Add mlflow to universe * Use mlflow black logo	2020-04-29 10:18:03 +02:00
Thomas Thiebaud	1eef60c658	Add spacy_fastlang to universe (#5271 ) * Add spacy_fastlang to universe * Sign SCA	2020-04-15 13:50:46 +02:00
Sofie Van Landeghem	7ad0fcf01d	fix json (#5267 )	2020-04-08 12:58:09 +02:00
vincent d warmerdam	f329d5663a	add "whatlies" to spaCy universe (#5252 ) * Add "whatlies" We're releasing it on our side officially on the 16th of April. If possible, let's announce around the same time :) * sign contributor thing * Added fancy gif as the image * Update universe.json Spellin error and spaCy clarification.	2020-04-06 11:29:30 +02:00
nlptechbook	ddf3c2430d	Update universe.json	2020-04-03 12:10:03 -04:00
Sofie Van Landeghem	1137420840	Small doc fixes (#5250 ) * fix link * torchtext instead tochtext	2020-04-03 13:01:43 +02:00
nlptechbook	b52e1ab677	Update universe.json A bot powered by Clarifai Predict API and spaCy. Can be found in Telegram messenger at @pic2phrase_bot	2020-03-21 11:39:15 -04:00
Baciccin	3b53617a69	Add Ligurian language	2020-03-19 21:37:01 -07:00
Ines Montani	80e7e1347e	Update universe.json [ci skip]	2020-03-17 22:21:34 +01:00
Ines Montani	eda6eff8b1	Update universe.json [ci skip]	2020-03-17 22:19:29 +01:00
Ines Montani	16e7301d34	Merge pull request #5161 from pmbaumgartner/master add gobbli to spacy-universe 🥳	2020-03-17 22:18:30 +01:00
Peter B	b04057c204	add mentions of spaCy use	2020-03-17 15:03:43 -04:00
Ines Montani	b2b01a5c8b	Update universe.json [ci skip]	2020-03-17 19:53:31 +01:00
Peter B	d2ffb406ad	add gobbli to spacy-universe 🥳	2020-03-17 08:30:29 -04:00
nihil	9cde7eb08c	add spacy_syllables to universe + sign contributor agreement	2020-03-13 18:09:42 +01:00
Ines Montani	1d6aec805d	Fix formatting and update docs for v2.2.4	2020-03-09 11:17:20 +01:00
Ines Montani	4890db6339	Auto-format and fix image [ci skip]	2020-02-23 13:56:50 +01:00
nlptechbook	979a3fd1f5	Update universe.json (#5022 ) e-book is available from https://nostarch.com/NLPPython	2020-02-15 15:44:55 +01:00
Omri Mendels	6ff947e1f9	Added presidio-research to universe.json (#4950 ) * Added presidio-research to universe.json Added a reference to Presidio Research, the data-science toolbox for Microsoft Presidio. * Updated url	2020-02-03 12:57:55 +01:00
Paco Nathan	49fefb6139	Submitting `PyTextRank` for inclusion in the spaCy uniVerse (#4942 ) * submitting PyTextRank for consideration of including in the spaCy uniVerse * including SCA	2020-01-28 11:37:54 +01:00
Bram Vanroy	718704022a	Changes to spacy_conll in universe (#4914 ) * Update information on spacy_conll * Typo fix	2020-01-16 01:56:39 +01:00
Ines Montani	1b838d1313	Divide models into core and starters [ci skip]	2019-12-21 14:10:22 +01:00
Ines Montani	c466e02466	Update universe [ci skip]	2019-12-13 15:57:39 +01:00
Paul O'Leary McCann	f0e3e606a6	Replace python-mecab3 with fugashi for Japanese (#4621 ) * Switch from mecab-python3 to fugashi mecab-python3 has been the best MeCab binding for a long time but it's not very actively maintained, and since it's based on old SWIG code distributed with MeCab there's a limit to how effectively it can be maintained. Fugashi is a new Cython-based MeCab wrapper I wrote. Since it's not based on the old SWIG code it's easier to keep it current and make small deviations from the MeCab C/C++ API where that makes sense. * Change mecab-python3 to fugashi in setup.cfg * Change "mecab tags" to "unidic tags" The tags come from MeCab, but the tag schema is specified by Unidic, so it's more proper to refer to it that way. * Update conftest * Add fugashi link to external deps list for Japanese	2019-11-23 14:31:04 +01:00
richardpaulhudson	8d06386e1e	Update to Holmes Universe entry (#4679 ) * Updated Universe entry for Holmes * Correction * Updated model name * Updated wording	2019-11-21 16:23:24 +01:00
Ines Montani	4b95587ad4	Update universe.json [ci skip]	2019-11-04 13:55:55 +01:00
Yash Patadia	0c396aeed4	add dframcy to universe.json (#4580 )	2019-11-04 13:53:23 +01:00
Ines Montani	726c5dd306	Update universe.json [ci skip]	2019-10-30 13:29:00 +01:00
Neel Kamath	6c036ab57d	Add "spaCy Server" to spaCy Universe (#4553 ) * Add "spaCy Server" to spaCy Universe * Accept the spaCy Contributor Agreement	2019-10-30 13:20:46 +01:00
Nipun Sadvilkar	2a5e71232b	✨ project: pySBD - Python Sentence Boundary Disambiguation (#4455 ) * ✨ project: pySBD - Python Sentence Boundary Disambiguation * 📝 Update links and description * 🐛 Fix missing comma * Update universe.json pysbd as a spacy component through entrypoints * 🚨 Fix universe.json * 📝 Update code_example	2019-10-30 12:13:29 +01:00
Ines Montani	1180304449	Update languages.json [ci skip]	2019-10-26 13:51:42 +02:00
Ines Montani	388ea03065	Update universe.json [ci skip]	2019-10-22 14:54:47 +02:00
Kabir Khan	8a7a30ea1d	Add cookiecutter-spacy-fastapi to spacy universe (#4498 )	2019-10-22 14:50:40 +02:00
Julin S	3ee15fce0d	Update information about Rasa (#4492 ) Rasa has been updated and rasa core and rasa nlu have been merged.	2019-10-22 14:32:31 +02:00
Ines Montani	8f76d6c9ef	Update transformer model details [ci skip]	2019-10-08 15:39:38 +02:00
Ines Montani	12a941d841	Update binder version [ci skip]	2019-10-02 16:47:01 +02:00
Ines Montani	b6670bf0c2	Use consistent spelling	2019-10-02 10:37:39 +02:00
Ines Montani	61263e2fbc	Update universe.json [ci skip]	2019-09-30 13:49:44 +02:00
Ines Montani	3624153591	Update languages.json [ci skip]	2019-09-27 15:15:41 +02:00
Ajinkya Kale	975aebd7e4	typo fix for wordnet_annotator (#4326 )	2019-09-27 11:52:53 +02:00
Eric Semeniuc	09816f8323	update sense2vec version (#4320 )	2019-09-25 12:17:54 +02:00
Sofie Van Landeghem	42340740e3	update neuralcoref example (#4317 )	2019-09-24 10:47:17 +02:00
Ines Montani	d84763727c	Remove unused setting [ci skip]	2019-09-18 21:24:14 +02:00
Ines Montani	dd1810f05a	Update DocBin and add docs	2019-09-18 20:23:21 +02:00
Ines Montani	23e28e2844	Merge branch 'master' into develop	2019-09-15 17:57:09 +02:00
Ines Montani	c7e4ea7154	Update examples and languages.json [ci skip]	2019-09-15 17:56:40 +02:00
Ines Montani	16c2522791	Merge branch 'master' into develop	2019-09-14 16:42:01 +02:00
Ines Montani	86befc80bf	WIP: Add v2.2 page [ci skip]	2019-09-14 16:41:48 +02:00
Ines Montani	76d26a3d5e	Update site.json [ci skip]	2019-09-14 16:32:24 +02:00
Ines Montani	fe87ccc8d1	Update languages.json [ci skip]	2019-09-14 16:23:50 +02:00
Ines Montani	82c16b7943	Remove u-strings and fix formatting [ci skip]	2019-09-12 16:11:15 +02:00
Ines Montani	10257f3131	Document Lookups [ci skip]	2019-09-12 14:00:14 +02:00
Sofie Van Landeghem	0b4b4f1819	Documentation for Entity Linking (#4065 ) * document token ent_kb_id * document span kb_id * update pipeline documentation * prior and context weights as bool's instead * entitylinker api documentation * drop for both models * finish entitylinker documentation * small fixes * documentation for KB * candidate documentation * links to api pages in code * small fix * frequency examples as counts for consistency * consistent documentation about tensors returned by predict * add entity linking to usage 101 * add entity linking infobox and KB section to 101 * entity-linking in linguistic features * small typo corrections * training example and docs for entity_linker * predefined nlp and kb * revert back to similarity encodings for simplicity (for now) * set prior probabilities to 0 when excluded * code clean up * bugfix: deleting kb ID from tokens when entities were removed * refactor train el example to use either model or vocab * pretrain_kb example for example kb generation * add to training docs for KB + EL example scripts * small fixes * error numbering * ensure the language of vocab and nlp stay consistent across serialization * equality with = * avoid conflict in errors file * add error 151 * final adjustements to the train scripts - consistency * update of goldparse documentation * small corrections * push commit * typo fix * add candidate API to kb documentation * update API sidebar with EntityLinker and KnowledgeBase * remove EL from 101 docs * remove entity linker from 101 pipelines / rephrase * custom el model instead of existing model * set version to 2.2 for EL functionality * update documentation for 2 CLI scripts	2019-09-12 11:38:34 +02:00
Ines Montani	2f31f96fce	Update languages.json [ci skip]	2019-09-04 18:15:42 +02:00
Ines Montani	2245e95e2d	Update languages.json [ci skip]	2019-09-04 17:11:40 +02:00
Ines Montani	b91425f803	Update universe.json [ci skip]	2019-08-28 13:45:06 +02:00
Ines Montani	aedae8b4c5	Update universe.json [ci skip]	2019-08-28 11:59:06 +02:00
Ines Montani	8114933f01	Fix universe.json [ci skip]	2019-08-27 12:13:42 +02:00
Ines Montani	48385552c6	Update languages.json [ci skip]	2019-08-27 11:52:51 +02:00
yanaiela	5d7bc26735	new universe project - the numeric fused-head (#4192 ) * new universe project * Update website/meta/universe.json Co-Authored-By: Ines Montani <ines@ines.io> * Update website/meta/universe.json Co-Authored-By: Ines Montani <ines@ines.io>	2019-08-25 17:25:28 +02:00
Ines Montani	b072c13017	Update universe with videos [ci skip]	2019-08-21 21:35:37 +02:00
Pavle Vidanović	4fe9329bfb	Serbian language code update "rs" -> "sr" (#4159 ) * Serbian stopwords added. (cyrillic alphabet) * spaCy Contribution agreement included. * Test initialize updated * Serbian language code update. --bugfix	2019-08-21 19:57:37 +02:00
Ines Montani	072860fcd0	Auto-format [ci skip]	2019-08-20 14:46:41 +02:00
Andrei-Marius Avram	199589228e	Added RONEC to spaCy Universe (#4151 ) * Added RONEC to spaCy Universe * Added contributor file * Corrected date from .github/contributors/avramandrei.md * Convert tabs to spaces * Remove duplicate keys Can only have one GitHub link unfortunately * Also add models category * Adjust ID This is used to generate the URL, so a simpler string is better	2019-08-20 14:46:07 +02:00
Jeno	91441f169c	Update universe.json to include negspacy (#4132 )	2019-08-16 17:48:17 +02:00
Jeno Pizarro	2e6e0321dd	Update universe.json to include negspacy	2019-08-16 10:24:09 -04:00
Ines Montani	1f4d8bf77e	Update universe.json [ci skip]	2019-08-09 17:42:37 +02:00
ICLR&D	87e40b17a0	Add entry for Blackstone in universe.json (#4101 ) * Add entry for Blackstone in universe.json Add an entry for the Blackstone project. Checked JSON is valid. * Create ICLRandD.md * Fix indentation (tabs to spaces) It looks like during validation, the JSON file automatically changed spaces to tabs. This caused the diff to show everything as changed, which is obviously not true. This hopefully fixes that. * Try to fix formatting for diff * Fix diff Co-authored-by: Ines Montani <ines@ines.io>	2019-08-09 17:16:51 +02:00
Ines Montani	a2ac2e873f	Update Binder version [ci skip]	2019-08-08 13:03:45 +02:00
Ines Montani	3e60afacf9	Add Serbian to languages [ci skip]	2019-08-07 13:38:25 +02:00
Ines Montani	1dc28a9ecb	Update Binder version [ci skip]	2019-08-07 13:38:12 +02:00
Ines Montani	7f3212e2f5	💫 Sync branches (#4084 ) [ci skip] * Update from master * Re-added Universe readme (#3688) (closes #3680) * Fix typo * Add version tag to `--base-model` argument (closes #3720) * fixing regex matcher examples (#3708) (#3719) * Improve Token.prob and Lexeme.prob docs (resolves #3701) * Fix DependencyParser.predict docs (resolves #3561) * Update languages.json Co-authored-by: Bram Vanroy <Bram.Vanroy@UGent.be> Co-authored-by: Aaron Kub <aaronkub@gmail.com>	2019-08-05 14:32:54 +02:00
Ines Montani	0f740fad1a	Update universe.json [ci skip]	2019-08-05 14:30:07 +02:00
Mohammed Daudali	23ec07debd	Correct typo for AllenAI url on homepage (#4050 ) * Typo fix for AllenAI url Changed incorrect home page url for AllenAI from appenai.org to allenai.org * Sign contributor agreement * Change date format	2019-07-31 00:16:33 +02:00
Ines Montani	4ebb4865fe	Update languages.json	2019-07-10 11:19:48 +02:00
cedar101	58f06e6180	Korean support (#3901 ) * start lang/ko * add test codes * using natto-py * add test_ko_tokenizer_full_tags() * spaCy contributor agreement * external dependency for ko * collections.namedtuple for python version < 3.5 * case fix * tuple unpacking * add jongseong(final consonant) * apply mecab option * Remove Pipfile for now Co-authored-by: Ines Montani <ines@ines.io>	2019-07-09 22:23:16 +02:00
Ines Montani	4f1dae1c6b	Update languages and examples (see #1107 )	2019-06-26 16:19:17 +02:00
Ines Montani	511977ae5e	Update universe [ci skip]	2019-06-04 11:15:51 +02:00
Ines Montani	62ebc65c62	Update universe [ci skip]	2019-06-03 12:19:13 +02:00
Ines Montani	e703301129	Update universe [ci skip]	2019-06-02 13:55:55 +02:00
Ines Montani	892e72451f	Update universe [ci skip]	2019-06-02 12:58:12 +02:00
Ines Montani	42de5be90c	Tidy up universe [ci skip]	2019-06-02 12:38:48 +02:00
Nirant	638caba9b5	Add multiple packages to universe.json (#3809 ) [ci skip] * Add multiple packages to universe.json Added following packages: NLPArchitect, NLPRe, Chatterbot, alibi, NeuroNER * Auto-format * Update slogan (probably just copy-paste mistake) * Adjust formatting * Update tags / categories	2019-06-02 12:35:52 +02:00
Nirant	d4d1eab5e1	Add Baderlab/saber to universe.json (#3806 )	2019-06-01 17:36:40 +02:00

1 2 3 4

175 Commits