spaCy/spacy/lang/fa/lemmatizer/lemma_rules.json
Ines Montani 5ca7dd0f94
💫 WIP: Basic lookup class scaffolding and JSON for all lemmati… (#4167)
* Improve load_language_data helper

* WIP: Add Lookups implementation

* Start moving lemma data over to JSON

* WIP: move data over for more languages

* Convert more languages

* Fix lemmatizer fixtures in tests

* Finish conversion

* Auto-format JSON files

* Fix test for now

* Make sure tables are stored on instance
2019-08-22 14:21:32 +02:00

41 lines
882 B
JSON
Raw Blame History

This file contains invisible Unicode characters

This file contains invisible Unicode characters that are indistinguishable to humans but may be processed differently by a computer. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

{
"adj": [
["ین", ""],
["‌ترین", ""],
["ترین", ""],
["‌تر", ""],
["تر", ""],
["‌ای", ""]
],
"noun": [
["ایان", "ا"],
["ویان", "و"],
["ایانی", "ا"],
["ویانی", "و"],
["گان", "ه"],
["گانی", "ه"],
["گان", ""],
["گانی", ""],
["ان", ""],
["انی", ""],
["ات", ""],
["ات", "ه"],
["ات", "ت"],
["اتی", ""],
["اتی", "ه"],
["اتی", "ت"],
["ها", ""],
["ها", ""],
["‌های", ""],
["های", ""],
["‌هایی", ""],
["هایی", ""]
],
"verb": [],
"punct": [
["“", "\""],
["”", "\""],
["", "'"],
["", "'"]
]
}