Matthew Honnibal
98acf5ffe4
💫 Allow passing of config parameters to specific pipeline components ( #3386 )
...
* Add component_cfg kwarg to begin_training
* Document component_cfg arg to begin_training
* Update docs and auto-format
* Support component_cfg across Language
* Format
* Update docs and docstrings [ci skip]
* Fix begin_training
2019-03-10 23:36:47 +01:00
Ines Montani
8dbf1e9037
Also fix #3387 on develop
2019-03-10 23:36:28 +01:00
Tim
40d33261d6
Fixed typo in example of html visualizer ( #3387 )
...
* Fixed typo
* Add contributor agreement for tmetzl
2019-03-10 23:36:13 +01:00
Ines Montani
c998cde7e2
Auto-format [ci skip]
2019-03-10 19:22:59 +01:00
Ines Montani
37cb09b90a
Merge branch 'develop' into spacy.io
2019-03-10 19:19:12 +01:00
Ines Montani
7ba3a5d95c
💫 Make serialization methods consistent ( #3385 )
...
* Make serialization methods consistent
exclude keyword argument instead of random named keyword arguments and deprecation handling
* Update docs and add section on serialization fields
2019-03-10 19:16:45 +01:00
Ines Montani
9a8f169e5c
Update v2-1.md
2019-03-10 18:58:51 +01:00
Ines Montani
67e38690d4
Un-xfail passing tests and tidy up
2019-03-10 18:42:16 +01:00
Matthew Honnibal
27dd820753
Fix vocab deserialization when loading already present lexemes ( #3383 )
...
* Fix vocab deserialization bug. Closes #2153
* Un-xfail test for #2153
2019-03-10 17:21:19 +01:00
Matthew Honnibal
d6eaa71afc
Handle scalar values in doc.from_array()
2019-03-10 16:54:03 +01:00
Matthew Honnibal
61e5ce02a4
Add xfailing test for #2153
2019-03-10 16:36:29 +01:00
Matthew Honnibal
7461e5e055
Fix batch bug in issue #3344
2019-03-10 16:01:34 +01:00
Matthew Honnibal
8a6272f842
Un-xfail test
2019-03-10 15:51:15 +01:00
Matthew Honnibal
4e80fc41ad
Make doc.from_array() consistent with doc.to_array(). Closes #3382
2019-03-10 15:50:48 +01:00
Ines Montani
0426689db8
💫 Improve Doc.to_json and add Doc.is_nered ( #3381 )
...
* Use default return instead of else
* Add Doc.is_nered to indicate if entities have been set
* Add properties in Doc.to_json if they were set, not if they're available
This way, if a processed Doc exports "pos": None, it means that the tag was explicitly unset. If it exports "ents": [], it means that entity annotations are available but that this document doesn't contain any entities. Before, this would have been unclear and problematic for training.
2019-03-10 15:24:34 +01:00
Ines Montani
7984543953
Add xfailing test for to_array/from_array string attrs
2019-03-10 15:08:15 +01:00
Ines Montani
6bbf4ea309
Simplify tests and avoid tokenizing
2019-03-10 15:05:56 +01:00
Matthew Honnibal
a5b1f6dcec
Fix NER when preset entities cross sentence boundaries ( #3379 )
...
💫 Fix NER when preset entities cross sentence boundaries
2019-03-10 14:53:03 +01:00
Ines Montani
3fe5811fa7
Only link model after download if shortcut link ( #3378 )
2019-03-10 13:02:24 +01:00
Matthew Honnibal
231bc7bb7b
Add xfailing test for #3345
2019-03-10 13:00:15 +01:00
Matthew Honnibal
bdc77848f5
Add helper method to apply a transition in parser/NER
2019-03-10 13:00:00 +01:00
Matthew Honnibal
78aba46530
Update feature/lemmatizer from develop
2019-03-10 02:45:33 +01:00
Matthew Honnibal
5431c47b91
Refactor morphology slightly
2019-03-10 00:59:51 +00:00
Matthew Honnibal
0f12082465
Refactor morphologizer
2019-03-09 22:54:59 +00:00
Matthew Honnibal
41a3016019
Refactor morphologizer class map
2019-03-09 20:55:33 +01:00
Matthew Honnibal
ce1fe8a510
Add comment
2019-03-09 17:51:17 +00:00
Matthew Honnibal
28c26e212d
Fix textcat model for GPU
2019-03-09 17:50:08 +00:00
Ines Montani
16fa4d6b90
Update .travis.yml
2019-03-09 14:36:52 +01:00
Ines Montani
d12af4c92c
Update README.md
2019-03-09 13:07:35 +01:00
Ines Montani
b65e2f554f
Update CI
...
Only keep Travis for Python 2.7 for now until we've sorted out unicode build in Azure Pipelines
2019-03-09 13:06:18 +01:00
Ines Montani
b11ca720f8
Update azure-pipelines.yml
2019-03-09 12:56:39 +01:00
Ines Montani
610fb306bd
Revert hyphens
2019-03-09 12:51:53 +01:00
Matthew Honnibal
f742900f83
Set pos attribute in morphologizer
2019-03-09 11:51:11 +00:00
Matthew Honnibal
a6d153b0a0
Add UPOS as morphological field in ud_train
2019-03-09 11:50:50 +00:00
Matthew Honnibal
bba5f57f91
Add method to export utf8 array to Doc
2019-03-09 11:50:27 +00:00
Matthew Honnibal
e1a83d15ed
Add support for character features to Tok2Vec
2019-03-09 11:50:08 +00:00
Matthew Honnibal
eae384ebb2
Add POS to morphological fields
2019-03-09 11:49:44 +00:00
Ines Montani
bbabb6aaae
Escape more hyphens
2019-03-09 12:41:05 +01:00
Ines Montani
b8db219850
Auto-format
2019-03-09 12:40:58 +01:00
Ines Montani
d59f8cff29
Re-add missing travis.sh for now
2019-03-09 03:18:11 +01:00
Ines Montani
a145bfe627
Try escaping hyphens again
2019-03-09 03:06:50 +01:00
Ines Montani
3f01228253
Merge branch 'master' into develop
2019-03-09 03:01:42 +01:00
Ines Montani
db03558288
Fix flake8
2019-03-09 02:59:29 +01:00
Ines Montani
40def86fdf
Try running flake8 first
2019-03-09 02:56:20 +01:00
Ines Montani
9531213846
Remove other CI
2019-03-09 02:56:08 +01:00
Ines Montani
b9c71fc0f0
Fix flags
2019-03-09 02:46:04 +01:00
Ines Montani
ae09b6a6cf
Try fixing unicode inconsistencies on Python 2
2019-03-09 02:37:50 +01:00
Ines Montani
d957d7a697
Auto-format
2019-03-09 02:37:41 +01:00
Ines Montani
fe0c8e147c
Update azure-pipelines.yml
2019-03-09 02:28:21 +01:00
Ines Montani
b28de881da
Update azure-pipelines.yml
2019-03-09 02:27:13 +01:00