ines
|
acd65c00f6
|
Add serialization tests for StringStore and Vocab
|
2017-06-02 10:57:42 +02:00 |
|
ines
|
41a6adf1f6
|
Initialise Vocab length correctly
|
2017-06-02 10:57:25 +02:00 |
|
ines
|
53b82f972a
|
Add strings to Vocab in init, instead of StringStore
|
2017-06-02 10:57:06 +02:00 |
|
ines
|
023f38bdd4
|
Fix return value of Vocab.from_bytes
|
2017-06-02 10:56:40 +02:00 |
|
ines
|
9692c98f57
|
Add test utils for temp file and temp dir
|
2017-06-02 10:56:09 +02:00 |
|
Francisco Aranda
|
70a2180199
|
fix(spanish sentence segmentation): remove tokenizer exceptions the break sentence segmentation. Aligned with training corpus
|
2017-06-02 08:19:57 +02:00 |
|
Francisco Aranda
|
5b385e7d78
|
feat(spanish model): add the spanish noun chunker
|
2017-06-02 08:14:06 +02:00 |
|
Matthew Honnibal
|
c650bc481c
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-01 13:03:57 -05:00 |
|
Matthew Honnibal
|
c8f4bed36f
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-01 12:24:21 -05:00 |
|
Matthew Honnibal
|
307d615c5f
|
Fix serialization for tagger when tag_map has changed
|
2017-06-01 12:18:36 -05:00 |
|
ines
|
9064fbbf1e
|
Fix empty arguments in mixins
|
2017-06-01 18:57:02 +02:00 |
|
ines
|
8bee34126d
|
Update model size
|
2017-06-01 18:22:35 +02:00 |
|
ines
|
6c908700c4
|
Add alpha badge
|
2017-06-01 18:20:33 +02:00 |
|
ines
|
36b20d66bf
|
Add alpha banner
|
2017-06-01 18:11:49 +02:00 |
|
ines
|
a6d99f8dab
|
Add prefix to option IDs to avoid conflicts
|
2017-06-01 17:56:18 +02:00 |
|
ines
|
1e918b871c
|
Remove infoboxes
|
2017-06-01 17:53:47 +02:00 |
|
ines
|
ab83dd5d25
|
Fix lightning tour example
|
2017-06-01 17:53:41 +02:00 |
|
ines
|
c6dc2fafc0
|
Add Spanish and move example sentences to meta
|
2017-06-01 17:49:56 +02:00 |
|
Matthew Honnibal
|
1d18cedae8
|
Fiddle with msgpack bytes vs unicode
|
2017-06-01 10:48:43 -05:00 |
|
ines
|
1bebc6392c
|
Add source files to pipeline components
|
2017-06-01 17:38:06 +02:00 |
|
ines
|
7a2380f617
|
Rename "nn_tagger" to "tagger"
|
2017-06-01 17:37:53 +02:00 |
|
ines
|
e5ae6ccf4e
|
Fix typo
|
2017-06-01 16:46:15 +02:00 |
|
ines
|
a3e4f91f4a
|
Only load vocab if it exists
|
2017-06-01 14:38:35 +02:00 |
|
ines
|
b577ed79ee
|
Move social image logic out to function and move files
|
2017-06-01 14:27:44 +02:00 |
|
ines
|
8fc52878f7
|
Make graphic smaller
|
2017-06-01 13:03:54 +02:00 |
|
ines
|
5e60b09dcd
|
Fix custom tokenizer example
|
2017-06-01 13:02:50 +02:00 |
|
ines
|
706cec6d58
|
Move annotation specs up
|
2017-06-01 13:02:43 +02:00 |
|
ines
|
fd77917c5a
|
Remove bottom padding from sidebar
|
2017-06-01 13:02:36 +02:00 |
|
ines
|
8274dffad6
|
Update NER training draft
|
2017-06-01 12:51:36 +02:00 |
|
ines
|
04fac3f52a
|
Add NER training example code
|
2017-06-01 12:47:47 +02:00 |
|
ines
|
7f5e7e7320
|
Fix typo
|
2017-06-01 12:47:36 +02:00 |
|
ines
|
5cef1dd305
|
Always use develop branch of GitHub links in ALPHA mode
|
2017-06-01 12:47:30 +02:00 |
|
ines
|
992559bf9a
|
Fix formatting and remove unused imports
|
2017-06-01 12:47:18 +02:00 |
|
Matthew Honnibal
|
d310b0aab3
|
Merge branch 'develop' of https://github.com/explosion/spaCy into develop
|
2017-06-01 04:58:03 -05:00 |
|
Matthew Honnibal
|
3ff7d7fcef
|
Merge for updated requirements
|
2017-06-01 04:57:47 -05:00 |
|
ines
|
4a927154d8
|
Update v2 docs
|
2017-06-01 11:56:32 +02:00 |
|
ines
|
03bbb96db8
|
Remove outdated examples
|
2017-06-01 11:56:02 +02:00 |
|
Matthew Honnibal
|
5eae3b9a1e
|
Fix to/from disk in tagger
|
2017-06-01 04:55:49 -05:00 |
|
ines
|
789e69b73f
|
Update training guide
|
2017-06-01 11:53:23 +02:00 |
|
ines
|
2f40d6e7e7
|
Add training 101
|
2017-06-01 11:53:16 +02:00 |
|
ines
|
abed463bbb
|
Update serialization 101
|
2017-06-01 11:52:58 +02:00 |
|
ines
|
72380c952a
|
Update training section in NER guide and add links
|
2017-06-01 11:52:49 +02:00 |
|
ines
|
d5c8d2f5fd
|
Update about.py and increment version
|
2017-06-01 11:52:24 +02:00 |
|
ines
|
77dca25c7f
|
Update Language API docs
|
2017-06-01 11:51:31 +02:00 |
|
ines
|
9c975c4882
|
Add training illustrations
|
2017-06-01 11:51:22 +02:00 |
|
ines
|
bea6e6bfad
|
Allow annotation row to take children
|
2017-06-01 11:51:14 +02:00 |
|
Matthew Honnibal
|
4c97371051
|
Fixes for thinc 6.7
|
2017-06-01 04:22:16 -05:00 |
|
Matthew Honnibal
|
53d00a0371
|
Move weight serialization to Thinc
|
2017-06-01 03:04:36 -05:00 |
|
Matthew Honnibal
|
ae8010b526
|
Move weight serialization to Thinc
|
2017-06-01 02:56:12 -05:00 |
|
Gyorgy Orosz
|
f0c3b09242
|
More robust Hungarian tokenizer.
|
2017-05-31 22:28:40 +02:00 |
|