Matthw Honnibal
85ed5730a2
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-06-30 01:14:16 +02:00
Ines Montani
e8033df81e
Also handle python3 and pip3
2020-06-29 20:30:42 +02:00
Ines Montani
c874dde66c
Show help on "spacy project"
2020-06-29 20:11:34 +02:00
Ines Montani
1d2c646e57
Fix init and remove .dvc/plots
2020-06-29 20:07:21 +02:00
Matthw Honnibal
5bed6fc431
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-06-29 19:55:24 +02:00
svlandeg
1176783310
fix one more shlex.split
2020-06-29 18:37:42 +02:00
svlandeg
ff233d5743
print details on error msg (e.g. PermissionError on specific file)
2020-06-29 18:22:33 +02:00
svlandeg
894b8e7ff6
throw warning (instead of crashing) when temp dir can't be cleaned
2020-06-29 18:16:39 +02:00
svlandeg
efe7eb71f2
create subfolder in working dir
2020-06-29 17:46:08 +02:00
svlandeg
3487214ba1
fix shlex.split for non-posix
2020-06-29 17:45:47 +02:00
Ines Montani
126050f259
Improve asset fetching
...
Get all paths first and run dvc add once so it only shows one progress bar and one combined git command (if repo is git repo)
2020-06-29 16:55:24 +02:00
Ines Montani
7c08713baa
Improve error messages
2020-06-29 16:54:47 +02:00
Ines Montani
24664efa23
Import project_run_all function
2020-06-29 16:54:19 +02:00
svlandeg
f8dddeda27
print help msg when just calling 'project' without args
2020-06-29 16:38:15 +02:00
svlandeg
bf43ebbf61
fix typo's
2020-06-29 16:32:25 +02:00
Matthew Honnibal
67928036f2
Set version to v3.0.0.dev12
2020-06-29 14:45:43 +02:00
Matthew Honnibal
2d715451a2
Revert "Convert custom user_data to token extension format for Japanese tokenizer ( #5652 )" ( #5665 )
...
This reverts commit 1dd38191ec
.
2020-06-29 14:34:15 +02:00
Sofie Van Landeghem
8d3c0306e1
refactor fixes ( #5664 )
...
* fixes in ud_train, UX for morphs
* update pyproject with new version of thinc
* fixes in debug_data script
* cleanup of old unused error messages
* remove obsolete TempErrors
* move error messages to errors.py
* add ENT_KB_ID to default DocBin serialization
* few fixes to simple_ner
* fix tags
2020-06-29 14:33:00 +02:00
Adriane Boyd
1dd38191ec
Convert custom user_data to token extension format for Japanese tokenizer ( #5652 )
...
* Convert custom user_data to token extension format
Convert the user_data values so that they can be loaded as custom token
extensions for `inflection`, `reading_form`, `sub_tokens`, and `lemma`.
* Reset Underscore state in ja tokenizer tests
2020-06-29 14:20:26 +02:00
Adriane Boyd
167df42cb6
Move lemmatizer is_base_form to language settings ( #5663 )
...
Move `Lemmatizer.is_base_form` to the language settings so that each
language can provide a language-specific method as
`LanguageDefaults.is_base_form`.
The existing English-specific `Lemmatizer.is_base_form` is moved to
`EnglishDefaults`.
2020-06-29 14:16:57 +02:00
Sofie Van Landeghem
fc3cb1fa9e
NER align tests ( #5656 )
...
* one_to_man works better. misalignment doesn't yet.
* fix tests
* restore example
* xfail alignment tests
2020-06-29 13:59:17 +02:00
Matthew Honnibal
2d9604d39c
Set version to v3.0.0.dev11
2020-06-29 13:56:46 +02:00
Matthew Honnibal
0a54022138
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-06-29 13:56:20 +02:00
Matthew Honnibal
acbf6345c9
Fix thinc dependency
2020-06-29 13:56:07 +02:00
Matthw Honnibal
da50473701
Tweak efficiency of arc_eager.set_costs
2020-06-29 12:17:41 +02:00
Ines Montani
bac8a8d766
Merge branch 'feature/project-cli' into develop
2020-06-29 10:49:05 +02:00
Sofie Van Landeghem
cfeb2ba4d7
updating thinc also in pyproject.toml
2020-06-29 09:51:20 +02:00
Matthew Honnibal
e14bf9decb
Set version to v3.0.0.dev9
2020-06-28 23:58:10 +02:00
Matthew Honnibal
7c9178d503
Update requirements
2020-06-28 23:56:45 +02:00
Matthew Honnibal
58c8f731bd
Set version to v3.0.0.dev9
2020-06-28 23:53:14 +02:00
Ines Montani
569376e34e
Replace curl with requests
2020-06-28 16:25:53 +02:00
Ines Montani
dbe86b3453
Update project.py
2020-06-28 15:45:19 +02:00
Ines Montani
dbfa292ed3
Output more stats in evaluate
2020-06-28 15:34:28 +02:00
Ines Montani
90b7fa8fed
Run DVC command in project dir
2020-06-28 15:33:53 +02:00
Ines Montani
2f6ee0d018
Tidy up, document and add custom clone logic
2020-06-28 15:08:35 +02:00
Matthew Honnibal
dc7a9be9f8
Merge branch 'feature/project-cli' of https://github.com/explosion/spaCy into feature/project-cli
2020-06-28 14:07:53 +02:00
Matthew Honnibal
e08257d401
Add example of how to do sparse-checkout
2020-06-28 14:07:32 +02:00
Ines Montani
1b331237aa
Update hashing and config update
2020-06-28 13:17:19 +02:00
Ines Montani
f385344286
Update asset logic and add import-url
2020-06-28 13:07:31 +02:00
Ines Montani
d6aa4cb478
Update asset logic
2020-06-28 12:40:11 +02:00
Ines Montani
ed46951842
Update
2020-06-28 12:24:59 +02:00
Ines Montani
d54f33441a
Merge branch 'feature/project-cli' of https://github.com/explosion/spaCy into feature/project-cli
2020-06-27 21:17:00 +02:00
Ines Montani
cd0dd78276
Simplify model loading (now supported via load_model)
2020-06-27 21:16:57 +02:00
Matthew Honnibal
8e3baebdce
Merge branch 'feature/project-cli' of https://github.com/explosion/spaCy into feature/project-cli
2020-06-27 21:16:18 +02:00
Matthew Honnibal
d8c70b415e
Fix Example usage in evaluate
2020-06-27 21:15:25 +02:00
Ines Montani
e33d2b1bea
Add success message
2020-06-27 21:15:13 +02:00
Ines Montani
42eb381ec6
Improve output handling in evaluate
2020-06-27 21:13:11 +02:00
Ines Montani
df22d490b1
Tidy up types
2020-06-27 21:13:06 +02:00
Ines Montani
6678bd80c2
Check if deps exist in non-DVC commands
2020-06-27 20:57:26 +02:00
Ines Montani
fe06697150
Fix package command and add version option
2020-06-27 20:36:08 +02:00