spaCy

mirror of https://github.com/explosion/spaCy.git synced 2026-02-18 05:00:41 +03:00

History

Paul O'Leary McCann 43eedf73f2 [ja] Stash tokenizer output for speed Before this commit, the Mecab tokenizer had to be called twice when creating a Doc- once during tokenization and once during tagging. This creates a JapaneseDoc wrapper class for Doc that stashes the parsed tokenizer output to remove redundant processing. -POLM		2017-10-15 23:33:25 +09:00
..
__init__.py	[ja] Stash tokenizer output for speed	2017-10-15 23:33:25 +09:00
language_data.py	Add newline	2017-05-03 09:38:07 +02:00
stop_words.py	Add newline	2017-05-03 09:38:29 +02:00
tag_map.py	Update the TAG_MAP	2017-08-22 00:02:55 +09:00