ines
|
ab8ffbaab7
|
Add text classification to v2 overview
|
2017-07-22 17:56:51 +02:00 |
|
ines
|
f085b88f9d
|
Add TextCategorizer API docs stub
|
2017-07-22 17:56:33 +02:00 |
|
ines
|
ab1a4e8b3c
|
Add Tensorizer API docs stub
|
2017-07-22 17:56:25 +02:00 |
|
ines
|
0fb89dd204
|
Add text classification usage guide template
|
2017-07-22 17:56:07 +02:00 |
|
ines
|
d05ab1b3a0
|
Add text classification to 101 overview and change order
|
2017-07-22 17:55:53 +02:00 |
|
ines
|
d2a7e5b8e5
|
Add GoldParse.cats attribute
|
2017-07-22 17:55:35 +02:00 |
|
ines
|
23d976ed00
|
Add Doc.cats attribute and missing v2 tag
|
2017-07-22 17:55:14 +02:00 |
|
Ines Montani
|
1ddbeddca2
|
Fix typo
|
2017-07-22 15:00:58 +02:00 |
|
Matthew Honnibal
|
d6a5c2c85a
|
Add test for NER
|
2017-07-22 01:48:58 +02:00 |
|
Matthew Honnibal
|
28244df4da
|
Add test for beam parsing
|
2017-07-22 01:48:35 +02:00 |
|
Matthew Honnibal
|
c86445bdfd
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-07-22 01:14:28 +02:00 |
|
Matthew Honnibal
|
b3a749610e
|
Fix name of TextCategorizer
|
2017-07-22 01:14:07 +02:00 |
|
Matthew Honnibal
|
2424493970
|
Remove unnecessary import of Mock
|
2017-07-22 01:13:54 +02:00 |
|
Matthew Honnibal
|
baa3d81c35
|
Add text categorizer to Language
|
2017-07-22 01:13:36 +02:00 |
|
Matthew Honnibal
|
a6a2159969
|
Add slot for text categories to Doc
|
2017-07-22 00:34:15 +02:00 |
|
Matthew Honnibal
|
374ab3ecfb
|
Increment alpha version
|
2017-07-22 00:32:49 +02:00 |
|
Matthew Honnibal
|
289f23df51
|
Test beam parsing
|
2017-07-20 15:03:10 +02:00 |
|
Matthew Honnibal
|
3da1063b36
|
Add beam decoding to parser, to allow NER uncertainties
|
2017-07-20 15:02:55 +02:00 |
|
Matthew Honnibal
|
0ca5832427
|
Improve negative example handling in NER oracle
|
2017-07-20 00:18:49 +02:00 |
|
Matthew Honnibal
|
a231b56d40
|
Add text-classification hook to pipeline
|
2017-07-20 00:18:15 +02:00 |
|
Matthew Honnibal
|
7ea50182a5
|
Add support for text-classification labels to GoldParse
|
2017-07-20 00:17:47 +02:00 |
|
Matthew Honnibal
|
727481377e
|
Add text-classifer thinc models
|
2017-07-20 00:17:17 +02:00 |
|
Matthew Honnibal
|
f014138c11
|
Fix parser tests
|
2017-07-20 00:16:52 +02:00 |
|
mollerhoj
|
85144835da
|
Add Tag_map for Danish
|
2017-07-03 15:52:55 +02:00 |
|
mollerhoj
|
64c732918a
|
Add Morph_rules. (TODO: Not working?)
|
2017-07-03 15:52:55 +02:00 |
|
mollerhoj
|
3b2cb107a3
|
Add like_num functionality to Danish
|
2017-07-03 15:49:51 +02:00 |
|
mollerhoj
|
e8f40ceed8
|
Add short names of months to tokenizer_exceptions
|
2017-07-03 15:49:51 +02:00 |
|
mollerhoj
|
e840077601
|
Add some basic tests for Danish
|
2017-07-03 15:49:51 +02:00 |
|
mollerhoj
|
23025d3b05
|
Clean up a couple of strange English stopwords
|
2017-07-03 15:41:59 +02:00 |
|
mollerhoj
|
dc5be7d2f3
|
Cleanup list of Danish stopwords
|
2017-07-03 15:40:58 +02:00 |
|
Ines Montani
|
c91642efd5
|
Port over changes from #1168
|
2017-07-01 11:43:54 +02:00 |
|
Jim O'Regan
|
70f4d26c10
|
bounds checks
|
2017-06-28 10:59:46 +01:00 |
|
Jim O'Regan
|
1ba38b2036
|
some helpers; the Irish part of UD only has 2500 sentences so this will need source of morphology
|
2017-06-28 00:42:00 +01:00 |
|
Jim O'Regan
|
559e03605a
|
b'
|
2017-06-27 22:42:16 +01:00 |
|
Ines Montani
|
e265e34e18
|
Merge pull request #1153 from jimregan/polish
add tokeniser exceptions for Polish
|
2017-06-27 14:48:00 +02:00 |
|
Jim Regan
|
d81ceb0cd5
|
Merge branch 'develop' into polish
|
2017-06-26 22:42:27 +01:00 |
|
Jim O'Regan
|
2f84c73585
|
a start
|
2017-06-26 22:40:04 +01:00 |
|
Jim O'Regan
|
28d7f0a672
|
reference
|
2017-06-26 22:38:28 +01:00 |
|
Jim O'Regan
|
e12defdd9c
|
missed a couple
|
2017-06-26 22:24:14 +01:00 |
|
Jim O'Regan
|
c1e4e0f3bf
|
just now discovered that you can do multiwords
|
2017-06-26 22:19:39 +01:00 |
|
Jim O'Regan
|
5e5f94c1c0
|
fix dup
|
2017-06-26 21:57:00 +01:00 |
|
Jim O'Regan
|
a8dff9133e
|
add POS
|
2017-06-26 21:53:41 +01:00 |
|
Jim O'Regan
|
3c4d83aa6e
|
CLA
|
2017-06-26 21:32:48 +01:00 |
|
Jim O'Regan
|
e9213f54de
|
missed one
|
2017-06-26 21:29:21 +01:00 |
|
Jim O'Regan
|
1eb7cc3017
|
attempt a port from #1147
|
2017-06-26 21:24:55 +01:00 |
|
Ines Montani
|
01c7c09c7f
|
Merge pull request #1146 from jarle/doc-patch
Fix small typo in the new spaCy 101 guide
|
2017-06-26 10:41:18 +02:00 |
|
Jarle Mathiesen
|
f20533ec0c
|
fix small typo
|
2017-06-24 12:31:33 +02:00 |
|
Matthew Honnibal
|
91e52543ef
|
Merge pull request #1118 from Gregory-Howard/patch-2
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-20 11:16:07 +02:00 |
|
Matthew Honnibal
|
8ea785e01a
|
Merge pull request #1119 from oroszgy/patch-3
Fixed conllu converter
|
2017-06-20 11:14:41 +02:00 |
|
Ines Montani
|
f64e3efc76
|
Merge pull request #1128 from thinline72/patch-1
Changed the capital of Lithuania to Vilnius
|
2017-06-13 13:14:43 +02:00 |
|