Commit Graph

15660 Commits

Author SHA1 Message Date
richardpaulhudson
42b7b8d509 Major refactoring 2022-10-21 12:01:24 +02:00
richardpaulhudson
f7d9942e7c Intermediate state 2022-10-20 21:48:53 +02:00
richardpaulhudson
2707d30ce0 Intermediate state 2022-10-19 23:20:11 +02:00
richardpaulhudson
356a341096 Add note 2022-10-14 20:29:22 +02:00
richardpaulhudson
fa5724e927 Remove unnecessary endianness stuff 2022-10-14 20:24:32 +02:00
richardpaulhudson
342433f09d Change to trigger CI 2022-10-14 18:22:04 +02:00
richardpaulhudson
07b6b53dae Correction 2022-10-14 17:22:48 +02:00
richardpaulhudson
c6cf5f2cb4 Fix indentation problem 2022-10-14 17:09:59 +02:00
richardpaulhudson
c116e11942 Add search char byte array feature 2022-10-14 17:03:52 +02:00
richardpaulhudson
1e9176f9c5 Intermediate state 2022-10-13 20:50:25 +02:00
richardpaulhudson
fc99b97e3c Merge branch 'feature/etl' of https://github.com/richardpaulhudson/spacy into feature/etl 2022-10-13 12:21:54 +02:00
richardpaulhudson
be363a7710 Intermediate state necessary to test equivalence 2022-10-13 12:20:56 +02:00
Richard Hudson
92762e69b4
Merge branch 'master' into feature/etl 2022-10-06 17:04:54 +02:00
richardpaulhudson
f410c066f4 Documentation improvements 2022-10-06 15:40:51 +02:00
richardpaulhudson
761d5ab9c3 Update errors 2022-10-06 15:12:41 +02:00
richardpaulhudson
581f380c00 Python code and documentation 2022-10-06 15:10:27 +02:00
richardpaulhudson
06fe50a12d Corrections 2022-10-06 08:04:50 +02:00
richardpaulhudson
f2c73aa85d Corrections 2022-10-06 07:50:35 +02:00
richardpaulhudson
7d4e99425b Another temporary type:ignore 2022-10-05 19:30:10 +02:00
richardpaulhudson
2a6c1cf63c Add temporary #type:ignore s 2022-10-05 19:15:18 +02:00
richardpaulhudson
ed76c89968 Remove extra lines 2022-10-05 18:57:10 +02:00
richardpaulhudson
28da06780e Remove extra line 2022-10-05 18:56:15 +02:00
richardpaulhudson
cbe2010e48 Format with black 2022-10-05 18:54:26 +02:00
richardpaulhudson
523bb2ad0b Temporarily commented out code 2022-10-05 18:51:08 +02:00
richardpaulhudson
6bb8d26528 Improvements 2022-10-05 18:35:46 +02:00
richardpaulhudson
d6c77659dc New error 2022-10-05 14:18:07 +02:00
richardpaulhudson
f712e0bc4a Performance improvements 2022-10-05 14:17:28 +02:00
Adriane Boyd
8cd77dd54c
Sync flake8 version across requirements (#11580) 2022-10-04 11:23:04 +02:00
Sofie Van Landeghem
70e21dfcad
PR to test importlib-metadata (#11569)
* empty commit

* restrict importlib-metadata to lower than 5.0.0

* restrict importlib-metadata also for validate CI step

* set fixed version for CI

* try flake8 5.0.4 in CI validation step

* from importlib-metadata from requirements again
2022-10-03 13:04:03 +02:00
Paul O'Leary McCann
087cc74c6a
Remove mention of 1.7 from issue template (#11570)
It's rare to have anyone using v1 anymore, so this message is no longer
helpful.
2022-10-03 11:53:21 +02:00
richardpaulhudson
d296ae9d8e Intermediate state 2022-09-30 22:26:14 +02:00
richardpaulhudson
da63b9448b Intermediate state 2022-09-29 22:09:18 +02:00
Gabriele Picco
ff9002b726
Add Zshot Spacy plugin (#11557)
* Add Zshot Spacy plugin

Add Zshot (Zero and Few shot named entity & relationships recognition) Spacy plugin

* Update website/meta/universe.json

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update website/meta/universe.json

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2022-09-29 17:34:44 +02:00
Sofie Van Landeghem
bcda8bc1e7
update mypy to latest version (#11546)
* update mypy and disable it for python 3.6

* ignoring mypy's type redefinition error
2022-09-29 14:24:40 +02:00
richardpaulhudson
644d6131af Intermediate state 2022-09-29 13:14:42 +02:00
Paul O'Leary McCann
ba63f57f81
Update docs to reflect Doc input to Language (#11555) 2022-09-29 18:50:29 +09:00
Adriane Boyd
6d7630c5d3
Allow overriding spacy_version in spacy package meta (#11552) 2022-09-29 10:44:06 +02:00
Peter Baumgartner
e794d4ae39
debug data Spancat Table Improvements (#11504)
* update

* fix format function

* pull out _format_number

* format with black
2022-09-28 17:16:05 +02:00
Raphael Mitsch
aea16719be
Simplify and clarify enable/disable behavior of spacy.load() (#11459)
* Change enable/disable behavior so that arguments take precedence over config options. Extend error message on conflict. Add warning message in case of overwriting config option with arguments.

* Fix tests in test_serialize_pipeline.py to reflect changes to handling of enable/disable.

* Fix type issue.

* Move comment.

* Move comment.

* Issue UserWarning instead of printing wasabi message. Adjust test.

* Added pytest.warns(UserWarning) for expected warning to fix tests.

* Update warning message.

* Move type handling out of fetch_pipes_status().

* Add global variable for default value. Use id() to determine whether used values are default value.

* Fix default value for disable.

* Rename DEFAULT_PIPE_STATUS to _DEFAULT_EMPTY_PIPES.
2022-09-27 14:22:36 +02:00
Taniguchi Yasufumi
9557b0fb01
Add spacy-partial-tagger to spaCy Universe (#11538) 2022-09-27 14:11:50 +02:00
Paul O'Leary McCann
a44b7d4622
Add experimental coref docs (#11291)
* Add experimental coref docs

* Docs cleanup

* Apply suggestions from code review

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

* Apply changes from code review

* Fix prettier formatting

It seems a period after a number made this think it was a list?

* Update docs on examples for initialize

* Add docs for coref scorers

* Remove 3.4 notes from coref

There won't be a "new" tag until it's in core.

* Add docs for span cleaner

* Fix docs

* Fix docs to match spacy-experimental

These weren't properly updated when the code was moved out of spacy
core.

* More doc fixes

* Formatting

* Update architectures

* Fix links

* Fix another link

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
Co-authored-by: svlandeg <svlandeg@github.com>
2022-09-27 18:11:23 +09:00
Adriane Boyd
877671e09a
Preserve missing entity annotation in augmenters (#11540)
Preserve both `-` and `O` annotation in augmenters rather than relying
on `Example.to_dict`'s default support for one option outside of labeled
entity spans.

This is intended as a temporary workaround for augmenters for v3.4.x.
The behavior of `Example` and related IOB utils could be improved in the
general case for v3.5.
2022-09-27 10:16:51 +02:00
Paul O'Leary McCann
936a5f0506
Fix English pipeline names in 3.4 release notes (#11542) 2022-09-27 08:25:24 +02:00
Richard Hudson
6f692a06d5
Remove side effects from Doc.__init__() (#11506)
* Remove side effects from Doc.__init__()

* Changes based on review comment

* Readd test

* Change interface of Doc.__init__()

* Simplify test

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update doc.md

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
2022-09-26 15:58:21 +02:00
Basile Dura
f40d2fac29
fix: remove duplicate v3.2 (#11530) 2022-09-23 13:18:51 +02:00
richardpaulhudson
6f42d79c1e Intermediate state 2022-09-16 20:00:20 +02:00
Raphael Mitsch
af9b01ef97
Add dependency check to project step runs (#11226)
* Add dependency check to project step running.

* Fix dependency mismatch warning.

* Remove newline.

* Add types-setuptools to setup.cfg.

* Move types-setuptools to test requirements. Move warnings into _validate_requirements(). Handle file reading in project_run().

* Remove newline formatting for output of package conflicts.

* Show full version conflict message instead of just package name.

* Update spacy/cli/project/run.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Fix typo.

* Re-add rephrasing of message for conflicting packages. Remove requirements path redundancy.

* Update spacy/cli/project/run.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update spacy/cli/project/run.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Print unified message for requirement conflicts and missing requirements.

* Update spacy/cli/project/run.py

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Fix warning message.

* Print conflict/missing messages individually.

* Print conflict/missing messages individually.

* Add check_requirements setting in project.yml to disable requirements check.

* Update website/docs/usage/projects.md

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update website/docs/usage/projects.md

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>

* Update description of project.yml structure in projects.md.

* Update website/docs/usage/projects.md

Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>

* Prettify projects docs.

Co-authored-by: Adriane Boyd <adrianeboyd@gmail.com>
Co-authored-by: Sofie Van Landeghem <svlandeg@users.noreply.github.com>
2022-09-16 16:54:31 +02:00
richardpaulhudson
d575b9f8d4 Return 64-bit integers 2022-09-16 13:28:58 +02:00
github-actions[bot]
279358be63
Auto-format code with black (#11513)
Co-authored-by: explosion-bot <explosion-bot@users.noreply.github.com>
2022-09-16 11:50:19 +02:00
Sofie Van Landeghem
df0b815c23
more explicit Example constructor example (#11489)
* make constructor example for Example more explicit

* shorten example and add spaces
2022-09-16 09:26:33 +02:00