Matthew Honnibal
2669190b85
Normalize props for morph exceptions
2019-03-07 18:32:36 +01:00
Matthew Honnibal
e585b50458
Fix features in English tag map
2019-03-07 18:32:09 +01:00
Matthew Honnibal
0ad09b16ad
Add header for morphanalysis
2019-03-07 17:24:57 +01:00
Matthew Honnibal
fed0371db7
Remove enums from morphology
2019-03-07 17:14:57 +01:00
Matthew Honnibal
932d7dde1c
Fix compile error
2019-03-07 14:34:54 +01:00
Matthew Honnibal
b9ade7d4e0
Add MorphAnalysisC struct
2019-03-07 14:03:07 +01:00
Matthew Honnibal
b69013e2d7
Fix passing of morphological features to lemmatizer
2019-03-07 13:11:38 +01:00
Matthew Honnibal
74db1d9602
Revert "Space out symbols enum, to make maintaining easier"
...
This reverts commit be5235369c
.
2019-03-07 12:52:30 +01:00
Matthew Honnibal
c773b5011c
Revert "Fix StringStore after symbols changes"
...
This reverts commit bcfe3bd312
.
2019-03-07 12:52:15 +01:00
Matthew Honnibal
bcfe3bd312
Fix StringStore after symbols changes
2019-03-07 12:51:11 +01:00
Matthew Honnibal
d0ca64bb07
Fix imports in morphanalysis
2019-03-07 12:14:53 +01:00
Matthew Honnibal
6734cfec88
Add comment
2019-03-07 12:14:37 +01:00
Matthew Honnibal
be5235369c
Space out symbols enum, to make maintaining easier
2019-03-07 12:14:23 +01:00
Matthew Honnibal
34651c8ddf
Fix lemmatizer
2019-03-07 12:13:47 +01:00
Matthew Honnibal
8805966460
Fix moved Morphologizer class
2019-03-07 10:46:27 +01:00
Matthew Honnibal
21008ad2d8
Draft API for morphological analysis class
2019-03-07 10:45:24 +01:00
Matthew Honnibal
fc1cc4c529
Move morphologizer under spacy/pipes
2019-03-07 01:36:26 +01:00
Matthew Honnibal
bfa52d9d8a
Move morphologizer within spacy/pipes
2019-03-07 01:34:32 +01:00
Matthew Honnibal
98dfe5e433
Fix ud_train.py
2019-03-07 01:31:23 +01:00
Matthew Honnibal
ae7c728c5f
Fix json dependency
2019-03-07 01:17:19 +01:00
Matthew Honnibal
010f846d5f
Fix dependencies in morphologizer
2019-03-07 00:16:51 +01:00
Matthew Honnibal
3993f41cc4
Update morphology branch from develop
2019-03-07 00:14:43 +01:00
Ines Montani
6bd34e9d54
Expose Japanese stop words ( closes #3346 )
2019-03-06 14:21:15 +01:00
Ines Montani
85deb96278
Fix whitespace
2019-03-06 14:20:34 +01:00
Ines Montani
23f6ebf0f3
Add missing " ( closes #3343 )
2019-02-27 16:37:03 +01:00
Ines Montani
533b580c19
Add test for stray print statements in languages (see #3342 )
2019-02-27 16:04:30 +01:00
Ines Montani
48a2046d1c
Remove stray print statement ( closes #3342 )
2019-02-27 15:35:04 +01:00
Ines Montani
07d7c0a1af
Fix whitespace
2019-02-27 15:34:21 +01:00
Ines Montani
9b62639d19
Auto-format [ci skip]
2019-02-27 14:24:55 +01:00
Matthew Honnibal
656edcb984
Set version to v2.1.0a10
2019-02-27 12:26:13 +01:00
Matthew Honnibal
f1d77eb140
💫 Improve handling of missing NER tags ( closes #2603 ) ( #3341 )
...
* Improve handling of missing NER tags
GoldParse can accept missing NER tags, if entities is provided
in BILUO format (rather than as spans). Missing tags can be provided
as None values.
Fix bug that occurred when first tag was a None value. Closes #2603 .
* Document specification of missing NER tags.
2019-02-27 12:06:32 +01:00
Ines Montani
e359bdd0e3
Auto-format
2019-02-27 11:56:45 +01:00
Matthew Honnibal
4a3371acd5
Make doc[0].is_sent_start == True ( closes #2869 ) ( #3340 )
...
* Make doc[0] have sent_start True. Closes #2869
* Document that doc[0].is_sent_start defaults True.
2019-02-27 11:17:17 +01:00
Matthew Honnibal
2d3ce89b78
Improve matcher tests re issue #3328
2019-02-27 10:25:56 +01:00
Matthew Honnibal
8d6954e0e7
Fix matcher bug #3328
2019-02-27 10:25:39 +01:00
Ines Montani
aadf586789
Add xfailing test for #3331
2019-02-25 22:33:30 +01:00
Matthew Honnibal
3cdd3eb518
Set version to v2.1.0a9
2019-02-25 21:55:19 +01:00
Matthew Honnibal
b449be0f04
Add comment re issue #3170
2019-02-25 21:24:03 +01:00
Matthew Honnibal
9ccd6a3062
Fix head-outside-sentence bug. Fixes #3170
2019-02-25 21:21:44 +01:00
Matthew Honnibal
f2fae1f186
Add batch size argument to Language.evaluate(). Closes #3263
2019-02-25 19:30:33 +01:00
Ines Montani
f135d663f7
Update conftest.py
2019-02-25 15:55:29 +01:00
Ines Montani
76ce8b2662
Merge branch 'master' into develop
2019-02-25 15:54:55 +01:00
Julia Makogon
f1c3108d52
Fixing pymorphy2 dependency issue ( #3329 ) ( closes #3327 )
...
* Classes for Ukrainian; small fix in Russian.
* Contributor agreement
* pymorphy2 initialization split for ru and uk (#3327 )
* stop-words fixed
* Unit-tests updated
2019-02-25 15:48:17 +01:00
Ines Montani
1a735e0f1f
Add regression test for #3328
2019-02-25 10:12:58 +01:00
Ines Montani
dfbed07d3b
Remove unused temp errors
2019-02-24 22:26:08 +01:00
Ines Montani
62b558ab72
💫 Support lexical attributes in retokenizer attrs ( closes #2390 ) ( #3325 )
...
* Fix formatting and whitespace
* Add support for lexical attributes (closes #2390 )
* Document lexical attribute setting during retokenization
* Assign variable oputside of nested loop
2019-02-24 21:13:51 +01:00
Ines Montani
a48deb4081
Merge regression tests
2019-02-24 21:03:39 +01:00
Ines Montani
8f6c193a4d
Delete _test_issue1622.py
2019-02-24 20:33:31 +01:00
Ines Montani
c8e967c78d
Try include previously segfaulting test
2019-02-24 20:32:46 +01:00
Ines Montani
328b589deb
Merge regression tests
2019-02-24 20:31:38 +01:00