Ines Montani
7c66691790
Merge pull request #1197 from jsparedes/patch-1
...
Fix url broken
2017-07-21 14:05:26 +02:00
Jorge Paredes
fadacd0d47
Fix url broken
...
The related url to **custom named entities** was broken
2017-07-16 10:06:32 -05:00
Ines Montani
2d22b63e09
Merge pull request #1186 from lgenerknol/master
...
.../cli/#foo is 404
2017-07-13 17:33:55 +02:00
lgenerknol
2b219caf0d
.../cli/#foo is 404
...
https://spacy.io/docs/usage/cli/#package is a 404.
Changed to https://spacy.io/docs/usage/cli#package
Definitely a larger fix possible to deal with trailing slashes
2017-07-12 13:12:24 -04:00
Ines Montani
d79fa8743a
Merge pull request #1185 from lgenerknol/master
...
Missing markup char
2017-07-12 17:27:42 +02:00
lgenerknol
6cf2690943
Missing markup char
...
Frontend displayed:
```
If start_idx and do not mark[...]
```
Note the missing "end_idx" after 'and'.
2017-07-12 11:06:16 -04:00
Ines Montani
9eca6503c1
Merge pull request #1157 from polm/master
...
Add basic Japanese Tokenizer Test
2017-07-10 13:07:11 +02:00
Paul O'Leary McCann
bc87b815cc
Add comment clarifying what LANGUAGES does
2017-07-09 16:28:55 +09:00
Paul O'Leary McCann
04e6a65188
Remove Japanese from LANGUAGES
...
LANGUAGES is a list of languages whose tokenizers get run through a
variety of generic tests. Since the generic tests don't check the JA
fixture, it blows up when it can't find janome. -POLM
2017-07-09 16:23:26 +09:00
Ines Montani
2b9411bb54
Merge pull request #1181 from val314159/patch-1
...
make this work in python2.7
2017-07-08 00:15:47 +02:00
val314159
19d4706f69
make this work in python2.7
2017-07-07 13:18:17 -07:00
Ines Montani
84eb9d6bd3
Merge pull request #1167 from callumkift/fix/docs-ner-training
...
Fixed error training NER documentation and example
2017-07-01 11:46:31 +02:00
Ines Montani
0c7f5af5ee
Merge pull request #1168 from gispk47/master
...
Update zh language error
2017-07-01 11:43:12 +02:00
gispk47
669bd14213
Update __init__.py
...
remove the empty string return from jieba.cut,this will cause the list of tokens cant be pushed assert error
2017-07-01 13:12:00 +08:00
Callum Kift
dfaeee1f37
fixed bug in training ner documentation and example
2017-06-30 09:56:33 +02:00
Paul O'Leary McCann
c336193392
Parametrize and extend Japanese tokenizer tests
2017-06-29 00:09:40 +09:00
Paul O'Leary McCann
30a34ebb6e
Add importorskip for janome
2017-06-29 00:09:20 +09:00
Paul O'Leary McCann
e56fea14eb
Add basic Japanese tokenizer test
2017-06-28 01:24:25 +09:00
Paul O'Leary McCann
84041a2bb5
Make create_tokenizer work with Japanese
2017-06-28 01:18:05 +09:00
Ines Montani
f69ff15089
Update CONTRIBUTORS.md
2017-06-27 14:49:02 +02:00
Ines Montani
d6e08f2bf6
Merge pull request #1142 from garfieldnate/patch-1
...
fix confusing typo
2017-06-26 10:41:47 +02:00
Nathan Glenn
81166c3d56
fix confusing typo
...
This document describes the `Vocab` class, not the `Span` class.
2017-06-21 19:22:30 +02:00
Ines Montani
9335736c20
Merge pull request #1127 from bartbroere/master
...
Fixed a minor typo in the documentation
2017-06-13 13:15:20 +02:00
Bart Broere
e3be243e06
Merge pull request #1 from explosion/master
...
Update
2017-06-12 22:06:59 +02:00
Ines Montani
6b94c3cf00
Merge pull request #1126 from ianmobbs/master
...
Added html5lib==1.0b8 to requirements.txt
2017-06-12 21:18:24 +02:00
Ian Mobbs
d19ce29a23
Create requirements.txt
2017-06-12 13:21:44 -04:00
Bart Broere
e4a45ae55f
Very minor documentation fix
2017-06-12 12:28:51 +02:00
Ines Montani
47aaecd974
Merge pull request #1109 from oroszgy/patch-2
...
Fixed typo in cli/package.py
2017-06-07 16:39:40 +02:00
György Orosz
fa26041da6
Fixed typo in cli/package.py
2017-06-07 16:19:08 +02:00
Ines Montani
3cceabbf32
Update README.rst
2017-06-06 14:39:54 +02:00
ines
6ef04afdc8
Update docs with Spanish model
2017-06-06 12:49:25 +02:00
ines
6071c727d2
Add more env options to gitignore
2017-06-06 12:49:17 +02:00
Ines Montani
99d02b2bb6
Update CONTRIBUTORS.md
2017-06-06 03:20:20 +02:00
Ines Montani
4cbe55622d
Update README.rst
2017-06-05 21:32:36 +02:00
Ines Montani
289650a857
Merge pull request #1103 from kootenpv/patch-1
...
for easy copy & paste
2017-06-05 20:42:40 +02:00
Pascal van Kooten
e66cd9cc70
for easy copy & paste
2017-06-05 20:41:28 +02:00
Ines Montani
e7ef51b382
Update tokenizer_exceptions.py
2017-06-02 19:00:01 +02:00
Ines Montani
83467a00a7
Update README.rst
2017-06-02 12:42:22 +02:00
Ines Montani
c4e62c7651
Update README.rst
2017-06-02 12:39:44 +02:00
Ines Montani
af466496f1
Rename appveyor.yml to .appveyor.yml
2017-06-02 12:33:57 +02:00
Ines Montani
3e16535fef
Update appveyor.yml
2017-06-02 12:31:31 +02:00
Ines Montani
3c2cce8efc
Update appveyor.yml
2017-06-02 12:27:36 +02:00
Ines Montani
0404b5f43b
Update appveyor.yml
2017-06-02 12:18:51 +02:00
Ines Montani
86277d4ef2
Create appveyor.yml
2017-06-02 12:13:59 +02:00
Ines Montani
81918155ef
Merge pull request #1096 from recognai/master
...
Spanish model features
2017-06-02 11:07:27 +02:00
Francisco Aranda
70a2180199
fix(spanish sentence segmentation): remove tokenizer exceptions the break sentence segmentation. Aligned with training corpus
2017-06-02 08:19:57 +02:00
Francisco Aranda
5b385e7d78
feat(spanish model): add the spanish noun chunker
2017-06-02 08:14:06 +02:00
ines
36b20d66bf
Add alpha banner
2017-06-01 18:11:49 +02:00
ines
a6d99f8dab
Add prefix to option IDs to avoid conflicts
2017-06-01 17:56:18 +02:00
ines
1e918b871c
Remove infoboxes
2017-06-01 17:53:47 +02:00