Adriane Boyd
0104bd1600
Sort the AttributeRuler matches by rule order
...
Sort the returned matches by rule order (the `match_id`) so that the
rules are applied in the order they were added. This is necessary, for
instance, if the `AttributeRuler` is used for the tag map and later
rules require POS tags.
2020-08-28 21:01:06 +02:00
Ines Montani
6a999c9303
Remove outdated component attr check
2020-08-28 20:59:19 +02:00
Adriane Boyd
8674b17651
Serialize AttributeRuler.patterns
...
Serialize `AttributeRuler.patterns` instead of the individual lists to
simplify the serialized and so that patterns are reloaded exactly as
they were originally provided (preserving `_attrs_unnormed`).
2020-08-28 20:44:45 +02:00
Ines Montani
10da74382f
Raise if disabled components are removed before DisabledPipes.restore
2020-08-28 20:35:26 +02:00
Ines Montani
1e0363290e
Remove todos and update docstrings
2020-08-28 20:34:46 +02:00
Ines Montani
cad988da7f
Allow component decorators to re-run with same function
2020-08-28 16:27:22 +02:00
Ines Montani
3ce5be4b76
Allow loaded but disabled components
2020-08-28 15:20:14 +02:00
Ines Montani
89f692bc8a
Merge pull request #5992 from svlandeg/feature/wandb-restrict-config
2020-08-28 15:05:29 +02:00
Ines Montani
9c4049b57f
Merge pull request #5986 from explosion/fix/language-config-interpolate-disk-bytes
2020-08-28 15:03:52 +02:00
Ines Montani
adc050cdc5
Fix code style in test [ci skip]
2020-08-28 15:03:21 +02:00
svlandeg
05a1bafa15
fix type
2020-08-28 14:08:33 +02:00
svlandeg
33883aa764
rename field
2020-08-28 14:06:23 +02:00
svlandeg
1d8c4070aa
add disable_fields to wandb_logger
2020-08-28 13:55:32 +02:00
Ines Montani
a51b4f3a19
Merge branch 'develop' into fix/language-config-interpolate-disk-bytes
2020-08-28 13:21:17 +02:00
Ines Montani
03dde511b4
Merge pull request #5987 from explosion/feature/debug-config [ci skip]
2020-08-28 11:30:18 +02:00
Ines Montani
96ad472361
Merge pull request #5990 from svlandeg/fix/cli-error
2020-08-28 11:29:07 +02:00
Ines Montani
62e9967228
Merge branch 'develop' into fix/language-config-interpolate-disk-bytes
2020-08-28 11:19:36 +02:00
Ines Montani
4ca2698f85
Merge branch 'develop' into feature/debug-config
2020-08-28 11:19:17 +02:00
Adriane Boyd
48df50533d
Update sentence segmentation usage docs
...
Update sentence segmentation usage docs to incorporate `senter`.
2020-08-28 10:58:16 +02:00
svlandeg
9a8255ffd5
two tests because of different exit type
2020-08-28 10:50:26 +02:00
svlandeg
73baaf330a
update error type
2020-08-28 10:46:21 +02:00
svlandeg
72a87095d9
add loggers registry
2020-08-27 20:26:28 +02:00
svlandeg
aa9e0c9c39
small fix
2020-08-27 19:56:52 +02:00
svlandeg
8cde6ccb7d
Merge remote-tracking branch 'upstream/develop' into feature/vectors-docs
2020-08-27 19:56:09 +02:00
Matthew Honnibal
c558ca4485
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-08-27 19:47:26 +02:00
svlandeg
556e975a30
various fixes
2020-08-27 19:24:44 +02:00
Matthew Honnibal
d3ffe4ca63
Fix error when tagger was initialized with no labels
2020-08-27 18:56:58 +02:00
Ines Montani
d1780db6a4
Tidy up and use different error [ci skip]
2020-08-27 18:56:55 +02:00
Ines Montani
ff4175e839
Add more info to debug config
2020-08-27 18:17:58 +02:00
Ines Montani
daac8ebacd
Don't interpolate config on Language deserialization
2020-08-27 16:44:36 +02:00
svlandeg
329e490560
small import fixes
2020-08-27 14:50:43 +02:00
svlandeg
28e4ba7270
fix references to TransformerListener
2020-08-27 14:33:28 +02:00
svlandeg
4d37ac3f33
configure_custom_sent_spans example
2020-08-27 14:14:16 +02:00
svlandeg
c68169f83f
fix link
2020-08-27 10:19:43 +02:00
svlandeg
acc794c975
example of writing to other custom attribute
2020-08-27 10:10:10 +02:00
svlandeg
559b65f2e0
adjust references to null_annotation_setter to trfdata_setter
2020-08-27 09:43:32 +02:00
Matthew Honnibal
e1e1760fd6
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-08-27 03:22:11 +02:00
Matthew Honnibal
95adb58f15
Force tagger to pass batch of docs into model in begin_training
2020-08-27 03:21:03 +02:00
Ines Montani
cdc114e212
Merge pull request #5977 from explosion/refactor/vector-names
2020-08-26 19:03:16 +02:00
Ines Montani
8692d176f6
Merge pull request #5978 from explosion/feature/update-wasabi
...
Update wasabi: new diff_strings and MarkdownRenderer
2020-08-26 19:02:52 +02:00
Ines Montani
adc5d42f61
Remove more unused files
2020-08-26 15:59:46 +02:00
Ines Montani
696f167478
Add diff example to docs [ci skip]
2020-08-26 15:57:54 +02:00
Matthew Honnibal
9b22714a4e
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
2020-08-26 15:48:45 +02:00
Matthew Honnibal
172af24f95
Fix upload and download
2020-08-26 15:48:23 +02:00
Ines Montani
a5fff1df51
Remove outdated non-empty output dir warning [ci skip]
2020-08-26 15:45:51 +02:00
Matthew Honnibal
2d520d3b45
Remove unused error
2020-08-26 15:41:14 +02:00
Adriane Boyd
90d88729e0
Add AttributeRuler.score ( #5963 )
...
* Add AttributeRuler.score
Add scoring for TAG / POS / MORPH / LEMMA if these are present in the
assigned token attributes.
Add default score weights (that don't really make a lot of sense) so
that the scores are in the default config in some form.
* Update docs
2020-08-26 15:39:30 +02:00
Ines Montani
3aec98ca38
Update wasabi: new diff_strings and MarkdownRenderer
2020-08-26 15:33:11 +02:00
Sofie Van Landeghem
79d460e3a2
Weights & Biases logger for train CLI ( #5971 )
...
* quick test as part of train script
* train_logger in config, default ConsoleLogger in loggers catalogue
* entitiy typo
* add wandb_logger
* cleanup
* Update spacy/cli/train_logger.py
Co-authored-by: Ines Montani <ines@ines.io>
* move loggers to gold.loggers
Co-authored-by: Ines Montani <ines@ines.io>
2020-08-26 15:24:33 +02:00
Ines Montani
cb54f0d779
Merge pull request #5970 from explosion/chore/clean-examples-extra
...
Clean out /examples and /bin
2020-08-26 15:20:41 +02:00