Ines Montani
|
e7ef51b382
|
Update tokenizer_exceptions.py
|
2017-06-02 19:00:01 +02:00 |
|
Francisco Aranda
|
70a2180199
|
fix(spanish sentence segmentation): remove tokenizer exceptions the break sentence segmentation. Aligned with training corpus
|
2017-06-02 08:19:57 +02:00 |
|
oeg
|
f0bcd0babb
|
fix(model): Add SPACE to es tag_map. Fixing error in morphology.pyx when SP tag is missing
|
2017-04-20 11:36:24 +02:00 |
|
oeg
|
936a297241
|
fix(model): Fix tag map for fixing issues with tag SPACE
|
2017-04-19 23:30:21 +02:00 |
|
oeg
|
c693d40791
|
feature(model): Add support for creating the Spanish model, including rich tagset, configuration, and basich tests
|
2017-04-06 18:48:45 +02:00 |
|
Matthew Honnibal
|
5c66cffafd
|
Add tag map for Spanish
|
2017-03-16 18:05:15 -05:00 |
|
ines
|
66c1f194f9
|
Use consistent unicode declarations
|
2017-03-12 13:07:28 +01:00 |
|
Matthew Honnibal
|
fba67fa342
|
Fix Issue #736: Times were being tokenized with incorrect string values.
|
2017-01-12 11:21:01 +01:00 |
|
Ines Montani
|
0dec90e9f7
|
Use global abbreviation data languages and remove duplicates
|
2017-01-08 20:36:00 +01:00 |
|
Ines Montani
|
1436b9f15a
|
Fix formatting and consistency
|
2016-12-23 21:36:01 +01:00 |
|
Ines Montani
|
1d64527727
|
Update Spanish tokenizer
Remove reflexive pronouns as they're part of an open class, fix
mistakes and add exceptions
|
2016-12-23 21:36:01 +01:00 |
|
Ines Montani
|
3c87c71d43
|
Add tokenizer exceptions for a.m. and p.m. in Spanish
|
2016-12-21 18:19:10 +01:00 |
|
Ines Montani
|
d60380418e
|
Update tokenizer exceptions for Spanish
|
2016-12-21 18:06:17 +01:00 |
|
Ines Montani
|
2b2ea8ca11
|
Reorganise language data
|
2016-12-18 16:54:19 +01:00 |
|
Ines Montani
|
e0a7b5c612
|
Fix formatting
|
2016-12-17 12:33:09 +01:00 |
|
Ines Montani
|
08162dce67
|
Move shared functions and constants to global language data
|
2016-12-17 12:32:48 +01:00 |
|
Ines Montani
|
6a60a61086
|
Move update_exc to global language data utils
|
2016-12-17 12:29:02 +01:00 |
|
Ines Montani
|
487ce1e20a
|
Add encoding declaration
|
2016-12-17 12:25:44 +01:00 |
|
Ines Montani
|
1a284d342e
|
Add Spanish language data
|
2016-12-08 19:47:03 +01:00 |
|
Matthew Honnibal
|
41a4766c1c
|
Fix infixes in spanish and portuguese
|
2016-11-02 20:43:12 +01:00 |
|
Matthew Honnibal
|
ad1c747c6b
|
Fix stray POS in language stubs
|
2016-11-02 20:37:55 +01:00 |
|
Matthew Honnibal
|
6dbf4f7ad7
|
Stub out support for French, Spanish, Italian and Portuguese
|
2016-11-02 20:02:41 +01:00 |
|