spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-11-01 00:17:44 +03:00

Author	SHA1	Message	Date
Adriane Boyd	f94168a41e	Backport bugfixes from v3.1.0 to v3.0 (#8739 ) * Fix scoring normalization (#7629) * fix scoring normalization * score weights by total sum instead of per component * cleanup * more cleanup * Use a context manager when reading model (fix #7036) (#8244) * Fix other open calls without context managers (#8245) * Don't add duplicate patterns all the time in EntityRuler (fix #8216) (#8246) * Don't add duplicate patterns (fix #8216) * Refactor EntityRuler init This simplifies the EntityRuler init code. This is helpful as prep for allowing the EntityRuler to reset itself. * Make EntityRuler.clear reset matchers Includes a new test for this. * Tidy PhraseMatcher instantiation Since the attr can be None safely now, the guard if is no longer required here. Also renamed the `_validate` attr. Maybe it's not needed? * Fix NER test * Add test to make sure patterns aren't increasing * Move test to regression tests * Exclude generated .cpp files from package (#8271) * Fix non-deterministic deduplication in Greek lemmatizer (#8421) * Fix setting empty entities in Example.from_dict (#8426) * Filter W036 for entity ruler, etc. (#8424) * Preserve paths.vectors/initialize.vectors setting in quickstart template * Various fixes for spans in Docs.from_docs (#8487) * Fix spans offsets if a doc ends in a single space and no space is inserted * Also include spans key in merged doc for empty spans lists * Fix duplicate spacy package CLI opts (#8551) Use `-c` for `--code` and not additionally for `--create-meta`, in line with the docs. * Raise an error for textcat with <2 labels (#8584) * Raise an error for textcat with <2 labels Raise an error if initializing a `textcat` component without at least two labels. * Add similar note to docs * Update positive_label description in API docs * Add Macedonian models to website (#8637) * Fix Azerbaijani init, extend lang init tests (#8656) * Extend langs in initialize tests * Fix az init * Fix ru/uk lemmatizer mp with spawn (#8657) Use an instance variable instead a class variable for the morphological analzyer so that multiprocessing with spawn is possible. * Use 0-vector for OOV lexemes (#8639) * Set version to v3.0.7 Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com> Co-authored-by: Paul O'Leary McCann <polm@dampfkraft.com>	2021-07-19 09:20:40 +02:00
Adriane Boyd	8a95475b3d	Set version to v3.0.6 (#7854 )	2021-04-22 16:33:26 +02:00
Adriane Boyd	53a3b967ac	Update thinc pin and set version to v3.0.5 (#7389 )	2021-03-10 11:10:53 +01:00
Adriane Boyd	3b911ee5ef	Set version to v3.0.4 (#7376 )	2021-03-09 16:49:41 +01:00
Ines Montani	660642902a	Increment version [ci skip]	2021-02-14 13:36:13 +11:00
Matthew Honnibal	b31471b5b8	Set version to v3.0.2	2021-02-13 23:50:00 +11:00
Matthew Honnibal	b6a198481b	Set version to v3.0.0	2021-02-02 20:26:17 +11:00
Sofie Van Landeghem	f319d2765f	Add capture argument to project_run (#6878 ) * add capture argument to project_run and run_commands * git bump to 3.0.1 * Set version to 3.0.1.dev0 Co-authored-by: Matthew Honnibal <honnibal+gh@gmail.com>	2021-02-02 10:11:15 +08:00
Ines Montani	14f631f52c	Update parent package and version [ci skip]	2021-01-30 20:12:42 +11:00
Matthew Honnibal	e8674c5c42	Set version to v3.0.0rc5	2021-01-26 14:55:41 +11:00
Sofie Van Landeghem	e680efc7cc	Set annotations in update (#6767 ) * bump to 3.0.0rc4 * do set_annotations in component update calls * update docs and remove set_annotations flag * fix EL test	2021-01-20 11:49:25 +11:00
Adriane Boyd	084fc575aa	Set version to v3.0.0rc3	2020-11-03 17:29:57 +01:00
Sofie Van Landeghem	75a202ce65	TextCat updates and fixes (#6263 ) * small fix in example imports * throw error when train_corpus or dev_corpus is not a string * small fix in custom logger example * limit macro_auc to labels with 2 annotations * fix typo * also create parents of output_dir if need be * update documentation of textcat scores * refactor TextCatEnsemble * fix tests for new AUC definition * bump to 3.0.0a42 * update docs * rename to spacy.TextCatEnsemble.v2 * spacy.TextCatEnsemble.v1 in legacy * cleanup * small fix * update to 3.0.0rc2 * fix import that got lost in merge * cursed IDE * fix two typos	2020-10-18 14:50:41 +02:00
Ines Montani	ff4267d181	Fix success message [ci skip]	2020-10-15 14:42:08 +02:00
Ines Montani	10611bf56a	Increment version [ci skip]	2020-10-15 13:30:11 +02:00
Sofie Van Landeghem	f8a1c1afd6	avoid dropout at runtime (#6247 )	2020-10-13 14:39:59 +02:00
Ines Montani	a0e12c136b	Increment version [ci skip]	2020-10-13 10:00:53 +02:00
Ines Montani	4fa967ea84	Increment version [ci skip]	2020-10-11 13:10:58 +02:00
Ines Montani	bfa3931c9d	Revert added_strings change (#6236 )	2020-10-10 18:55:07 +02:00
Ines Montani	796f8b9424	Increment version	2020-10-09 18:00:27 +02:00
Sofie Van Landeghem	d093d6343b	TrainablePipe (#6213 ) * rename Pipe to TrainablePipe * split functionality between Pipe and TrainablePipe * remove unnecessary methods from certain components * cleanup * hasattr(component, "pipe") should be sufficient again * remove serialization and vocab/cfg from Pipe * unify _ensure_examples and validate_examples * small fixes * hasattr checks for self.cfg and self.vocab * make is_resizable and is_trainable properties * serialize strings.json instead of vocab * fix KB IO + tests * fix typos * more typos * _added_strings as a set * few more tests specifically for _added_strings field * bump to 3.0.0a36	2020-10-08 21:33:49 +02:00
Matthew Honnibal	1a500f9717	Set version to v3.0.0a35	2020-10-06 14:19:07 +02:00
Ines Montani	9ca283a899	Merge branch 'develop' into feature/project-spacy-version	2020-10-05 21:06:07 +02:00
Ines Montani	582701519e	Remove __release__ flag	2020-10-05 20:00:49 +02:00
Sofie Van Landeghem	f4f49f5877	update blis (#6198 ) * allow higher blis version * fix typo * bump to 3.0.0a34 * fix pins in other files	2020-10-05 14:58:56 +02:00
Ines Montani	4b15ff7504	Increment version [ci skip]	2020-10-04 22:47:04 +02:00
Matthew Honnibal	70b9de8e58	Set version to v3.0.0a32	2020-10-03 19:26:52 +02:00
Ines Montani	d6c967401f	Increment version	2020-10-03 17:20:47 +02:00
Matthew Honnibal	7b127f307e	Set version to v3.0.0a30	2020-10-03 16:06:42 +02:00
Ines Montani	568768643e	Increment version [ci skip]	2020-10-02 01:50:13 +02:00
Ines Montani	da30701cd1	Increment version [ci skip]	2020-10-01 21:58:11 +02:00
Ines Montani	1700c8541e	Increment version [ci skip]	2020-10-01 17:57:16 +02:00
Sofie Van Landeghem	009ba14aaf	Fix pretraining in train script (#6143 ) * update pretraining API in train CLI * bump thinc to 8.0.0a35 * bump to 3.0.0a26 * doc fixes * small doc fix	2020-09-25 15:47:10 +02:00
Ines Montani	20b89a9717	Increment version [ci skip]	2020-09-24 16:57:02 +02:00
Ines Montani	3f751e68f5	Increment version [ci skip]	2020-09-24 14:45:41 +02:00
Ines Montani	3f77eb749c	Increment version [ci skip]	2020-09-23 19:50:15 +02:00
Ines Montani	3c3863654e	Increment version [ci skip]	2020-09-23 16:54:43 +02:00
Ines Montani	db7126ead9	Increment version	2020-09-22 10:31:26 +02:00
Ines Montani	b3327c1e45	Increment version [ci skip]	2020-09-21 16:04:30 +02:00
Matthew Honnibal	8303d101a5	Set version to v3.0.0a19	2020-09-17 00:18:49 +02:00
Matthew Honnibal	475323cd36	Set version to v3.0.0a18	2020-09-14 22:05:43 +02:00
Matthew Honnibal	fdd2340f6c	Set version to v3.0.0a17	2020-09-13 23:52:03 +02:00
Ines Montani	908f3a4494	Update default projects repo [ci skip]	2020-09-10 11:42:14 +02:00
Sofie Van Landeghem	8e7557656f	Renaming gold & annotation_setter (#6042 ) * version bump to 3.0.0a16 * rename "gold" folder to "training" * rename 'annotation_setter' to 'set_extra_annotations' * formatting	2020-09-09 10:31:03 +02:00
Matthew Honnibal	5d09e3e154	Set version to v3.0.0a15	2020-09-08 15:25:10 +02:00
Matthew Honnibal	12e1279f6b	Set version to v3.0.0a14	2020-09-05 04:13:53 +02:00
Matthew Honnibal	9341cbc013	Set version to v3.0.0a13	2020-08-30 23:10:43 +02:00
Matthew Honnibal	6547472347	Set version to v3.0.0a12	2020-08-26 04:02:34 +02:00
Matthew Honnibal	8d6e1ce306	Update v3.0.0a11	2020-08-25 00:32:08 +02:00
Matthew Honnibal	001546c19e	Set version to v3.0.0a10	2020-08-23 21:15:38 +02:00

1 2 3 4 5 ...

298 Commits