Matthew Honnibal
|
2bc7d87c70
|
Add example for training text classifier
|
2017-07-22 20:15:32 +02:00 |
|
Matthew Honnibal
|
9bae0ddc50
|
Fix minibatching
|
2017-07-22 20:14:49 +02:00 |
|
Matthew Honnibal
|
ded0df5e2f
|
Expose hyper-param as keyword arg
|
2017-07-22 20:14:37 +02:00 |
|
Matthew Honnibal
|
f5de8deeec
|
Increment version
|
2017-07-22 20:04:53 +02:00 |
|
Matthew Honnibal
|
b55714d5d1
|
Make gold_tuples arg optional in begin_training
|
2017-07-22 20:04:43 +02:00 |
|
Matthew Honnibal
|
ed6c85fa3c
|
Fix loading of text categories in GoldParse
|
2017-07-22 20:04:03 +02:00 |
|
Matthew Honnibal
|
6ffec9dfea
|
Update _ml, for textcat model
|
2017-07-22 20:03:40 +02:00 |
|
Matthew Honnibal
|
d6a5c2c85a
|
Add test for NER
|
2017-07-22 01:48:58 +02:00 |
|
Matthew Honnibal
|
28244df4da
|
Add test for beam parsing
|
2017-07-22 01:48:35 +02:00 |
|
Matthew Honnibal
|
c86445bdfd
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-07-22 01:14:28 +02:00 |
|
Matthew Honnibal
|
b3a749610e
|
Fix name of TextCategorizer
|
2017-07-22 01:14:07 +02:00 |
|
Matthew Honnibal
|
2424493970
|
Remove unnecessary import of Mock
|
2017-07-22 01:13:54 +02:00 |
|
Matthew Honnibal
|
baa3d81c35
|
Add text categorizer to Language
|
2017-07-22 01:13:36 +02:00 |
|
Matthew Honnibal
|
a6a2159969
|
Add slot for text categories to Doc
|
2017-07-22 00:34:15 +02:00 |
|
Matthew Honnibal
|
374ab3ecfb
|
Increment alpha version
|
2017-07-22 00:32:49 +02:00 |
|
Matthew Honnibal
|
289f23df51
|
Test beam parsing
|
2017-07-20 15:03:10 +02:00 |
|
Matthew Honnibal
|
3da1063b36
|
Add beam decoding to parser, to allow NER uncertainties
|
2017-07-20 15:02:55 +02:00 |
|
Matthew Honnibal
|
0ca5832427
|
Improve negative example handling in NER oracle
|
2017-07-20 00:18:49 +02:00 |
|
Matthew Honnibal
|
a231b56d40
|
Add text-classification hook to pipeline
|
2017-07-20 00:18:15 +02:00 |
|
Matthew Honnibal
|
7ea50182a5
|
Add support for text-classification labels to GoldParse
|
2017-07-20 00:17:47 +02:00 |
|
Matthew Honnibal
|
727481377e
|
Add text-classifer thinc models
|
2017-07-20 00:17:17 +02:00 |
|
Matthew Honnibal
|
f014138c11
|
Fix parser tests
|
2017-07-20 00:16:52 +02:00 |
|
Ines Montani
|
c91642efd5
|
Port over changes from #1168
|
2017-07-01 11:43:54 +02:00 |
|
Ines Montani
|
e265e34e18
|
Merge pull request #1153 from jimregan/polish
add tokeniser exceptions for Polish
|
2017-06-27 14:48:00 +02:00 |
|
Jim Regan
|
d81ceb0cd5
|
Merge branch 'develop' into polish
|
2017-06-26 22:42:27 +01:00 |
|
Jim O'Regan
|
2f84c73585
|
a start
|
2017-06-26 22:40:04 +01:00 |
|
Jim O'Regan
|
28d7f0a672
|
reference
|
2017-06-26 22:38:28 +01:00 |
|
Ines Montani
|
01c7c09c7f
|
Merge pull request #1146 from jarle/doc-patch
Fix small typo in the new spaCy 101 guide
|
2017-06-26 10:41:18 +02:00 |
|
Jarle Mathiesen
|
f20533ec0c
|
fix small typo
|
2017-06-24 12:31:33 +02:00 |
|
Matthew Honnibal
|
91e52543ef
|
Merge pull request #1118 from Gregory-Howard/patch-2
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-20 11:16:07 +02:00 |
|
Matthew Honnibal
|
8ea785e01a
|
Merge pull request #1119 from oroszgy/patch-3
Fixed conllu converter
|
2017-06-20 11:14:41 +02:00 |
|
Ines Montani
|
f64e3efc76
|
Merge pull request #1128 from thinline72/patch-1
Changed the capital of Lithuania to Vilnius
|
2017-06-13 13:14:43 +02:00 |
|
Savva Kolbachev
|
800a8faff4
|
Changed the capital of Lithuania to Vilnius
Hi,
There is a typo about the capital of Lithuania.
Vilnius is the capital of Lithuania https://en.wikipedia.org/wiki/Vilnius
Ljubljana is the capital of Slovenia https://en.wikipedia.org/wiki/Ljubljana
|
2017-06-12 23:27:00 +03:00 |
|
Ines Montani
|
6eae9f943a
|
Merge pull request #1125 from Tpt/french_noun_chunks
Adds function to extract french noun chunks
|
2017-06-12 21:25:33 +02:00 |
|
Ines Montani
|
57f64b9e1c
|
Merge pull request #1124 from v3t3a/patch-3
docs - Fix url error for Displacy Ent visualizer
|
2017-06-12 21:20:32 +02:00 |
|
Ines Montani
|
b2a28028cf
|
Merge pull request #1115 from v3t3a/patch-2
docs - Add read() method when opening file (Lightning tour)
|
2017-06-12 21:19:25 +02:00 |
|
Ines Montani
|
fe8d136ae0
|
Merge pull request #1114 from v3t3a/patch-1
docs - Update doc.jade (Just remove a duplicate 'doc =')
|
2017-06-12 21:19:02 +02:00 |
|
Tpt
|
7745b3ae04
|
Adds noun chunks to French syntax iterators
|
2017-06-12 15:29:58 +02:00 |
|
Tpt
|
57e8254f63
|
Adds function to extract french noun chunks
|
2017-06-12 15:20:49 +02:00 |
|
Vetea
|
eae1f7b19c
|
Fix url error for Displacy Ent visualizer
|
2017-06-12 14:30:02 +02:00 |
|
György Orosz
|
62dbf9025c
|
Fixed conllu converter
|
2017-06-09 22:53:56 +02:00 |
|
Grégory Howard
|
cd974b32b7
|
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-09 17:58:18 +02:00 |
|
ines
|
49026a1346
|
Fix typos in example (see #1105)
|
2017-06-08 19:15:50 +02:00 |
|
Vetea
|
cc3aee1189
|
Add read() method when opening file
Add read() method for
to avoid :
```TypeError: Argument 'string' has incorrect type (expected str, got _io.TextIOWrapper)```
Test with:
spaCy : v2.0.0 Alpha
python : 3.5.2+ (default, Sep 22 2016, 12:18:14)
|
2017-06-08 11:27:09 +02:00 |
|
Vetea
|
8e20cf6368
|
Update doc.jade
Just remove a duplicate 'doc ='
|
2017-06-08 10:35:58 +02:00 |
|
ines
|
34a2eecb17
|
Add simple "naughty strings" test (see #1107)
|
2017-06-06 17:43:51 +02:00 |
|
ines
|
6b799bac54
|
Fix formatting and details
|
2017-06-06 14:37:49 +02:00 |
|
ines
|
6c34b1a65b
|
Update alpha thread link
|
2017-06-06 00:58:12 +02:00 |
|
ines
|
045574a936
|
Update package name and increment version
|
2017-06-05 20:41:30 +02:00 |
|
Matthew Honnibal
|
1f5874a927
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-05 20:20:00 +02:00 |
|