Lemmatization of Adjectives - French : adding rules and vocabulary (#3045)

* modifying FR lemmatisation for Adjectives

* adding contributor agreement for amperinet

* correcting some errors in vocabulary files
This commit is contained in:
Amandine Périnet 2018-12-16 18:11:07 +01:00 committed by Ines Montani
parent 6ae8e49bff
commit 361554f629
4 changed files with 55418 additions and 49076 deletions

View File

@ -101,6 +101,6 @@ mark both statements:
| Name | Amandine Périnet |
| Company name (if applicable) | 365Talents |
| Title or role (if applicable) | Data Science Researcher |
| Date | 04/12/2018 |
| Date | 12/12/2018 |
| GitHub username | amperinet |
| Website (optional) | |

File diff suppressed because it is too large Load Diff

File diff suppressed because it is too large Load Diff

View File

@ -3,9 +3,26 @@ from __future__ import unicode_literals
ADJECTIVE_RULES = [
["s", ""],
["a", "a"],
["aux", "al"],
["c", "c"],
["d", "d"],
["e", ""],
["es", ""]
["é", "é"],
["eux", "eux"],
["f", "f"],
["i", "i"],
["ï", "ï"],
["l", "l"],
["m", "m"],
["n", "n"],
["o", "o"],
["p", "p"],
["r", "r"],
["s", ""],
["t", "t"],
["u", "u"],
["y", "y"]
]