Matthew Honnibal
|
aca53b95e1
|
Link architecture blurb
|
2017-06-04 13:10:06 +02:00 |
|
Matthew Honnibal
|
64ca5123bb
|
Add Architecture 101 blurb
|
2017-06-04 13:09:19 +02:00 |
|
Matthew Honnibal
|
e77ed953f4
|
Update GPU instructions
|
2017-06-04 12:03:22 +02:00 |
|
ines
|
1d3b012e56
|
Update adding languages docs and add 101
|
2017-06-03 23:54:23 +02:00 |
|
Matthew Honnibal
|
7ca215bc26
|
Resolve lex_attr_getters conflict
|
2017-06-03 16:12:01 -05:00 |
|
Matthew Honnibal
|
21eef90dbc
|
Support specifying which GPU
|
2017-06-03 16:10:23 -05:00 |
|
Matthew Honnibal
|
d0e42f9275
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-03 15:30:32 -05:00 |
|
Matthew Honnibal
|
8a17b99b1c
|
Use NORM attribute, not LOWER
|
2017-06-03 15:30:16 -05:00 |
|
ines
|
4c643d74c5
|
Add norm exceptions to other Language classes
|
2017-06-03 22:29:21 +02:00 |
|
ines
|
a3715a81d5
|
Update adding languages guide
|
2017-06-03 22:16:38 +02:00 |
|
ines
|
ec6d2bc81d
|
Add table of contents mixin
|
2017-06-03 22:16:26 +02:00 |
|
ines
|
fa7e576c57
|
Change order of exception dicts
|
2017-06-03 21:52:06 +02:00 |
|
Matthew Honnibal
|
3f5c85d8de
|
Reorder setting of lex attrs, to avoid clobbering
|
2017-06-03 14:47:55 -05:00 |
|
Matthew Honnibal
|
aeb7520133
|
Make norm use lower-case
|
2017-06-03 14:47:38 -05:00 |
|
Matthew Honnibal
|
de3954843e
|
Populate norm exceptions with lower-case
|
2017-06-03 14:47:12 -05:00 |
|
Matthew Honnibal
|
f6955a459c
|
Fix prev commit
|
2017-06-03 14:38:37 -05:00 |
|
Matthew Honnibal
|
468ca6c760
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-03 14:33:51 -05:00 |
|
Matthew Honnibal
|
c647a0d33e
|
Fix training counter for gold preprocessing
|
2017-06-03 14:33:39 -05:00 |
|
ines
|
e47eef5e03
|
Update German tokenizer exceptions and tests
|
2017-06-03 21:07:44 +02:00 |
|
ines
|
d77c2cc8bb
|
Add tests for English norm exceptions
|
2017-06-03 20:59:50 +02:00 |
|
ines
|
0d6fa8b241
|
Add German norm exceptions
|
2017-06-03 20:54:18 +02:00 |
|
ines
|
5bd311c77e
|
Fix update of norm exceptions
|
2017-06-03 20:54:09 +02:00 |
|
Matthew Honnibal
|
94e063ae2a
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-03 13:31:40 -05:00 |
|
Matthew Honnibal
|
fea1144e6d
|
Set max batch size in evaluate
|
2017-06-03 13:31:33 -05:00 |
|
Matthew Honnibal
|
805495af27
|
Fix off-by-one in number of tags
|
2017-06-03 13:29:23 -05:00 |
|
Matthew Honnibal
|
e62f46d39f
|
Clarify gold.pyx slightly
|
2017-06-03 13:28:52 -05:00 |
|
Matthew Honnibal
|
43353b5413
|
Improve train CLI script
|
2017-06-03 13:28:20 -05:00 |
|
ines
|
746653880c
|
Add English norm exceptions to lex_attrs
|
2017-06-03 20:27:28 +02:00 |
|
ines
|
095eeeb12f
|
Update English tokenizer exceptions and add norms
|
2017-06-03 20:27:16 +02:00 |
|
ines
|
e5d426406a
|
Add base norm exceptions
|
2017-06-03 20:27:05 +02:00 |
|
ines
|
4c2bbc3ccc
|
Add add_lookups util function
|
2017-06-03 19:44:47 +02:00 |
|
ines
|
05fe6758a7
|
Set lexeme attributes for tokenizer special cases
|
2017-06-03 19:44:39 +02:00 |
|
ines
|
3152ee5ca2
|
Update serialization tests for tokenizer
|
2017-06-03 17:05:28 +02:00 |
|
ines
|
7c919aeb09
|
Make sure serializers and deserializers are ordered
|
2017-06-03 17:05:09 +02:00 |
|
ines
|
1ebd0d3f27
|
Add assert_packed_msg_equal util function
|
2017-06-03 17:04:30 +02:00 |
|
ines
|
9acf8686f7
|
Update note on compact mode issues
|
2017-06-03 13:31:16 +02:00 |
|
ines
|
b0225183c2
|
Update displaCy defaults
|
2017-06-03 13:27:06 +02:00 |
|
ines
|
de974f7bef
|
Add serializer tests for tokenizer
|
2017-06-03 13:26:34 +02:00 |
|
ines
|
0153b66a86
|
Return self in Tokenizer.from_bytes
|
2017-06-03 13:26:13 +02:00 |
|
ines
|
82154a1861
|
Add letter spacing to arrow label
|
2017-06-03 13:25:41 +02:00 |
|
ines
|
32c6f05de9
|
Adjust spacing and sizing in compact mode
|
2017-06-03 13:25:32 +02:00 |
|
ines
|
cc8c8617a4
|
Shut down displaCy server on KeyboardInterrupt
|
2017-06-03 13:24:56 +02:00 |
|
ines
|
70fbba7d08
|
Clone Doc to never merge punctuation on original Doc
|
2017-06-03 13:24:43 +02:00 |
|
ines
|
c60431357d
|
Port over docs typo corrections
|
2017-06-03 11:31:30 +02:00 |
|
ines
|
459a1e8470
|
Fix whitespace
|
2017-06-03 11:31:18 +02:00 |
|
ines
|
5109bba910
|
Port over fix from #1070
|
2017-06-03 11:31:11 +02:00 |
|
ines
|
71954d5fe7
|
Update Thinc version
|
2017-06-03 10:32:53 +02:00 |
|
ines
|
d21459f87d
|
Update serializer tests
|
2017-06-02 21:42:26 +02:00 |
|
ines
|
c862527474
|
Add more variations of .env to gitignore
|
2017-06-02 21:08:39 +02:00 |
|
ines
|
6669583f4e
|
Use OrderedDict
|
2017-06-02 21:07:56 +02:00 |
|