Ines Montani
82e88f0e3b
Merge pull request #6379 from svlandeg/fix/labels-constructor
2020-12-08 06:29:56 +01:00
Adriane Boyd
78085fab1f
Check for spacy-nightly package in download ( #6502 )
...
Also check for spacy-nightly in download so that `--no-deps` isn't set
for normal nightly installs.
2020-12-04 09:40:03 +01:00
Ines Montani
63f83e7034
Merge pull request #6470 from adrianeboyd/feature/license-in-package
2020-12-04 03:55:54 +01:00
Sofie Van Landeghem
d6c616a125
Fixes in test suite ( #6457 )
...
* fix slow test for textcat readers
* cleanup test_issue5551
* add explicit score weight
* cleanup
2020-12-02 12:57:08 +01:00
Adriane Boyd
31ec9a906e
Clean up 3rd party license info ( #6478 )
...
Move scikit-learn license from `Scorer` to
`licenses/3rd_party_licenses.txt`.
2020-12-02 10:15:23 +01:00
Adriane Boyd
591cd48aa8
Remove config.cfg from MANIFEST
2020-12-01 12:58:02 +01:00
Adriane Boyd
b0dd13e0ba
Support LICENSE in spacy package
...
If present, include the file `input_dir/LICENSE` at the top level of the
packaged model.
2020-11-30 13:43:58 +01:00
Adriane Boyd
1442d2f213
Improve simple training example in v3 migration ( #6438 )
...
* Create the examples once
* Use the examples in the initialization
* Provide the batch size
* Fix `begin_training` migration example
2020-11-30 09:39:45 +08:00
Sofie Van Landeghem
079f6ea474
avoid resolving the full config ( #6465 )
2020-11-30 09:34:29 +08:00
Ines Montani
9beba7164f
Make jinja2 top-level import
...
No problem anymore since it's now an official dependency
2020-11-27 15:17:14 +08:00
Ines Montani
d21d2c2e59
Don't multiply accuracy by 100
2020-11-27 15:15:51 +08:00
Adriane Boyd
26296ab223
Add error message if DocBin zlib decompress fails ( #6394 )
...
Add a better error message if DocBin zlib decompress fails, indicating
that the data is not in `DocBin` format.
2020-11-27 14:39:49 +08:00
Sofie Van Landeghem
165993d8e5
fix typo in transformer docs ( #6404 )
2020-11-19 14:11:38 +01:00
Adriane Boyd
96726ec1f6
Fix DocBin init in training example ( #6396 )
2020-11-17 14:36:44 +01:00
svlandeg
73fc1ed963
remove labels from morphologizer constructor
2020-11-11 21:48:50 +01:00
svlandeg
d5a920325f
remove labels from constructor
2020-11-11 21:34:12 +01:00
svlandeg
fcd79e0655
remove set_morphology from docs
2020-11-11 21:32:34 +01:00
Adriane Boyd
a7e7d6c6c9
Ignore misaligned in Morphologizer.get_loss ( #6363 )
...
Fix bug where `Morphologizer.get_loss` treated misaligned annotation as
`EMPTY_MORPH` rather than ignoring it. Remove unneeded default `EMPTY_MORPH`
mappings.
2020-11-10 20:15:09 +08:00
Sofie Van Landeghem
a0c899a0ff
Fix textcat + transformer architecture ( #6371 )
...
* add pooling to textcat TransformerListener
* maybe_get_dim in case it's null
2020-11-10 20:14:47 +08:00
Ines Montani
3ca5c7082d
Use pip install . in quickstart [ci skip]
2020-11-10 17:27:49 +08:00
Ines Montani
de6453940e
Merge pull request #6305 from svlandeg/feature/score-docs [ci skip]
2020-11-10 02:52:11 +01:00
Ines Montani
d7950c5ada
Merge pull request #6297 from adrianeboyd/docs/nightly-conda-install [ci skip]
2020-11-10 02:45:52 +01:00
Ines Montani
448bfbdc30
Remove conda from nightly install widget [ci skip]
2020-11-10 09:44:52 +08:00
Ines Montani
363ac73c72
Update docs [ci skip]
2020-11-09 12:43:26 +08:00
Sofie Van Landeghem
8ef056cf98
fix embed_size in Entity Linker architecture ( #6343 )
2020-11-04 22:20:13 +01:00
Ines Montani
019a1dd5e8
Fix v3 overview [ci skip]
2020-11-03 18:10:06 +01:00
Adriane Boyd
1c4df8fd09
Replace pytokenizations with internal alignment ( #6293 )
...
* Replace pytokenizations with internal alignment
Replace pytokenizations with internal alignment algorithm that is
restricted to only allow differences in whitespace and capitalization.
* Rename `spacy.training.align` to `spacy.training.alignment` to contain
the `Alignment` dataclass
* Implement `get_alignments` in `spacy.training.align`
* Refactor trailing whitespace handling
* Remove unnecessary exception for empty docs
Allow a non-empty whitespace-only doc to be aligned with an empty doc
* Remove empty docs exceptions completely
2020-11-03 16:24:38 +01:00
Adriane Boyd
a4b32b9552
Handle missing reference values in scorer ( #6286 )
...
* Handle missing reference values in scorer
Handle missing values in reference doc during scoring where it is
possible to detect an unset state for the attribute. If no reference
docs contain annotation, `None` is returned instead of a score. `spacy
evaluate` displays `-` for missing scores and the missing scores are
saved as `None`/`null` in the metrics.
Attributes without unset states:
* `token.head`: relies on `token.dep` to recognize unset values
* `doc.cats`: unable to handle missing annotation
Additional changes:
* add optional `has_annotation` check to `score_scans` to replace
`doc.sents` hack
* update `score_token_attr_per_feat` to handle missing and empty morph
representations
* fix bug in `Doc.has_annotation` for normalization of `IS_SENT_START`
vs. `SENT_START`
* Fix import
* Update return types
2020-11-03 15:47:18 +01:00
Adriane Boyd
5d2cb86c34
Fix on_match callback for DependencyMatcher ( #6313 )
...
Fix `DependencyMatcher` so that the callback is called only once per
match.
2020-10-31 12:20:27 +01:00
Sofie Van Landeghem
2918923541
fix resolving of dot notation ( #6326 )
2020-10-31 12:17:06 +01:00
Adriane Boyd
dc816bba9d
Fix node name typo in dependency matcher example ( #6311 )
2020-10-28 16:32:46 +01:00
Sofie Van Landeghem
ace6ae435b
set pydantic upper pin to 1.7 for now ( #6308 )
2020-10-26 23:31:08 +01:00
svlandeg
77688b0072
fix config
2020-10-26 11:14:34 +01:00
svlandeg
5878ff6bcd
cleanup
2020-10-26 11:13:02 +01:00
svlandeg
e95d9caa87
small edits
2020-10-26 11:09:25 +01:00
svlandeg
a664994a81
adding score method to explanation of new component
2020-10-26 10:52:47 +01:00
svlandeg
080066ae74
remove TODO note
2020-10-26 10:37:25 +01:00
Ines Montani
2c9804038d
Fix success message [ci skip]
2020-10-23 16:11:54 +02:00
Adriane Boyd
253480353c
Remove zh from quickstart extras
2020-10-23 11:39:25 +02:00
Adriane Boyd
af26886fff
Fix formatting
2020-10-23 11:38:14 +02:00
Adriane Boyd
c0b76f4c19
Add install step to "Compile from source"
2020-10-23 11:36:36 +02:00
Adriane Boyd
8fe7ede667
Add install step to source install quickstart
2020-10-23 11:34:43 +02:00
Adriane Boyd
4299a7f654
Setup / install / quickstart updates
...
* Add `cuda110` to setup.cfg and quickstart dropdown
* Switch to `pip` for pip-only packages in conda quickstart instructions
* Update zh pkuseg install message with version range and conda
* Remove `zh` from `extras_require` because the default doesn't require
additional packages
2020-10-23 11:27:54 +02:00
Ines Montani
270c836bd6
Merge pull request #6276 from adrianeboyd/chore/add-jinja2
2020-10-20 10:05:53 +02:00
Ines Montani
6523f2daac
Merge pull request #6273 from adrianeboyd/bugfix/detailed-scores-in-evaluate2
2020-10-20 10:03:09 +02:00
Adriane Boyd
3629296757
Fix requirements, remove version pins
2020-10-19 19:04:42 +02:00
Adriane Boyd
56077e7e64
Add dependency for jinja2
2020-10-19 18:58:15 +02:00
Adriane Boyd
fbe65b257b
Convert accuracy numbers on website models page
2020-10-19 18:55:55 +02:00
Ines Montani
b6b1c1e23c
Merge pull request #6271 from walterhenry/develop-proof [ci skip]
2020-10-19 16:31:43 +02:00
Adriane Boyd
563a21834e
Save raw scores in evaluate output
2020-10-19 15:49:09 +02:00