svlandeg
d1d6f16776
fix the fix
2020-06-18 19:15:32 +02:00
svlandeg
e822367cf7
prevent writing dummy values like deps because that could interfer with sent_start values
2020-06-18 17:47:59 +02:00
svlandeg
0b6d45eae1
various small fixes
2020-06-18 15:55:00 +02:00
svlandeg
1c71f2310c
fix renames and simple_ner labels
2020-06-18 15:33:28 +02:00
svlandeg
64fc840a5d
bugfix tok2vec
2020-06-18 15:24:40 +02:00
svlandeg
01f9ae774c
small fixes
2020-06-18 14:01:19 +02:00
svlandeg
0c6f1f3891
fix BiluoPushDown parsing entities
2020-06-18 13:00:03 +02:00
svlandeg
cd790aaa2a
fix parser tests to work with example (most still failing)
2020-06-18 11:19:22 +02:00
svlandeg
9f43ba839a
throw informative error when running the components with the wrong type of objects
2020-06-18 10:36:05 +02:00
svlandeg
6712d0b5db
textcat bugfix
2020-06-18 10:09:56 +02:00
svlandeg
40b2b21eef
small bug fix
2020-06-17 23:33:51 +02:00
svlandeg
d6c4dd6eea
pipe() takes docs, not examples
2020-06-17 21:29:36 +02:00
svlandeg
0f123af35e
ensure test keeps working with non-linked entities
2020-06-17 21:13:38 +02:00
svlandeg
6d73e139b0
fix entity linker
2020-06-17 21:12:25 +02:00
svlandeg
be5934b827
fix tagger
2020-06-17 19:42:11 +02:00
svlandeg
10d396977e
add support for MORPH in to/from_array, fix morphologizer overfitting test
2020-06-17 17:48:07 +02:00
svlandeg
1a151b10d6
correct silly typo
2020-06-17 14:48:14 +02:00
svlandeg
f6c451b650
cleanup
2020-06-17 14:45:54 +02:00
svlandeg
2d9f406188
fix test_cli
2020-06-17 14:42:48 +02:00
svlandeg
f7ad8e8c83
various fixes in scripts - needs to be further tested
2020-06-17 12:05:58 +02:00
svlandeg
3c4f9e4cc4
fix augment (needs further testing)
2020-06-17 10:46:29 +02:00
Ines Montani
6d712f3e06
Merge pull request #5599 from adrianeboyd/docs/v2.3.0-minor
2020-06-16 13:49:25 -07:00
svlandeg
4ed399c848
minibatch utiltiy can deal with strings, docs or examples
2020-06-16 21:35:55 +02:00
Adriane Boyd
02369f91d3
Fix spacy convert argument
2020-06-16 20:41:17 +02:00
Adriane Boyd
f0fd77648f
Change example title to Dr.
...
Change example title to Dr. so the current model does exclude the title
in the initial example.
2020-06-16 20:36:21 +02:00
Adriane Boyd
a6abdfbc3c
Fix numpy.zeros() dtype for Doc.from_array
2020-06-16 20:35:45 +02:00
Adriane Boyd
9aff317ca7
Update POS in tagging example
2020-06-16 20:26:57 +02:00
Adriane Boyd
457babfa0c
Update alignment example for new gold.align
2020-06-16 20:22:03 +02:00
svlandeg
8b66c11ff2
add spaces to json output format
2020-06-16 19:30:03 +02:00
svlandeg
ba80ad7efd
fixed some tests + WIP roundtrip unit test
2020-06-16 18:26:50 +02:00
Ines Montani
41003a5117
Update Binder version [ci skip]
2020-06-16 17:41:23 +02:00
Ines Montani
fd89f44c0c
Update Binder URL [ci skip]
2020-06-16 17:34:26 +02:00
Ines Montani
44af53bdd9
Add pkuseg warnings and auto-format [ci skip]
2020-06-16 17:13:35 +02:00
Ines Montani
a9e5b840ee
Fix typos and auto-format [ci skip]
2020-06-16 16:38:45 +02:00
Ines Montani
1d3e8b7578
Merge pull request #5595 from explosion/v2.3.x
2020-06-16 07:37:10 -07:00
Ines Montani
e9d3e177f0
Merge branch 'master' into v2.3.x
2020-06-16 16:31:38 +02:00
Ines Montani
bb54f54369
Fix model accuracy table [ci skip]
2020-06-16 16:10:12 +02:00
Adriane Boyd
d5110ffbf2
Documentation updates for v2.3.0 ( #5593 )
...
* Update website models for v2.3.0
* Add docs for Chinese word segmentation
* Tighten up Chinese docs section
* Merge branch 'master' into docs/v2.3.0 [ci skip]
* Merge branch 'master' into docs/v2.3.0 [ci skip]
* Auto-format and update version
* Update matcher.md
* Update languages and sorting
* Typo in landing page
* Infobox about token_match behavior
* Add meta and basic docs for Japanese
* POS -> TAG in models table
* Add info about lookups for normalization
* Updates to API docs for v2.3
* Update adding norm exceptions for adding languages
* Add --omit-extra-lookups to CLI API docs
* Add initial draft of "What's New in v2.3"
* Add new in v2.3 tags to Chinese and Japanese sections
* Add tokenizer to migration section
* Add new in v2.3 flags to init-model
* Typo
* More what's new in v2.3
Co-authored-by: Ines Montani <ines@ines.io>
2020-06-16 15:37:35 +02:00
svlandeg
43d41d6bb6
allow None as BILUO annotation
2020-06-16 15:30:05 +02:00
svlandeg
44a0f9c2c8
test_gold_biluo_different_tokenization works
2020-06-16 15:21:20 +02:00
svlandeg
1c35b8efcd
fix spaces
2020-06-16 12:08:25 +02:00
svlandeg
6fea5fa4bd
attempt to fix cases with weird spaces
2020-06-16 11:52:29 +02:00
svlandeg
0702a1d3fb
fix test for misaligned
2020-06-15 23:10:47 +02:00
svlandeg
a28f8f369e
Fix many-to-one IOB codes
2020-06-15 23:06:22 +02:00
svlandeg
12886b787b
fixing NER one-to-many alignment
2020-06-15 22:44:17 +02:00
Matthew Honnibal
7ff447c5a0
Set version to v2.3.0
2020-06-15 18:22:25 +02:00
Matthew Honnibal
a0bf73a5dd
Merge branch 'whatif/arrow' of https://github.com/explosion/spaCy into whatif/arrow
2020-06-15 18:16:01 +02:00
Matthew Honnibal
c66f93299e
Remove TokenAnnotation code from nonproj
2020-06-15 18:14:47 +02:00
Matthew Honnibal
c95494739c
Fix import
2020-06-15 18:11:10 +02:00
Matthew Honnibal
8f978f2031
Fix import
2020-06-15 18:10:47 +02:00