ines
|
417d45f5d0
|
Add lemmatizer data as variable on language data
Don't create lookup lemmatizer within Language class and just pass in
the data so it can be set on Token creation
|
2017-10-11 02:24:58 +02:00 |
|
ines
|
0c2343d73a
|
Tidy up language data
|
2017-10-11 02:22:49 +02:00 |
|
ines
|
bb5c631402
|
Implement like_num getter for French (via #1161)
|
2017-09-26 16:47:45 +02:00 |
|
ines
|
1fe5e1a4d1
|
Add language example sentences (see #1107)
da, de, en, es, fr, he, it, nb, pl, pt, sv
|
2017-08-19 12:22:29 +02:00 |
|
Matthew Honnibal
|
91e52543ef
|
Merge pull request #1118 from Gregory-Howard/patch-2
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-20 11:16:07 +02:00 |
|
Tpt
|
7745b3ae04
|
Adds noun chunks to French syntax iterators
|
2017-06-12 15:29:58 +02:00 |
|
Grégory Howard
|
cd974b32b7
|
Update _tokenizer_exceptions_list (adding cities)
|
2017-06-09 17:58:18 +02:00 |
|
ines
|
4c643d74c5
|
Add norm exceptions to other Language classes
|
2017-06-03 22:29:21 +02:00 |
|
ines
|
924e8506de
|
Move Defaults subclass to module scope (necessary for pickling)
|
2017-05-20 19:02:27 +02:00 |
|
ines
|
e895d1afd7
|
Reorganise French punctuation rules
|
2017-05-09 00:00:54 +02:00 |
|
ines
|
a91278cb32
|
Rename _URL_PATTERN to URL_PATTERN
|
2017-05-09 00:00:00 +02:00 |
|
ines
|
73b577cb01
|
Fix relative imports
|
2017-05-08 22:29:04 +02:00 |
|
ines
|
ae99990f63
|
Fix formatting
|
2017-05-08 22:23:48 +02:00 |
|
ines
|
f46ffe3e89
|
Move language data to /lang module
|
2017-05-08 20:00:40 +02:00 |
|