spaCy/spacy/ja
Paul O'Leary McCann 43eedf73f2 [ja] Stash tokenizer output for speed
Before this commit, the Mecab tokenizer had to be called twice when
creating a Doc- once during tokenization and once during tagging. This
creates a JapaneseDoc wrapper class for Doc that stashes the parsed
tokenizer output to remove redundant processing. -POLM
2017-10-15 23:33:25 +09:00
..
__init__.py [ja] Stash tokenizer output for speed 2017-10-15 23:33:25 +09:00
language_data.py Add newline 2017-05-03 09:38:07 +02:00
stop_words.py Add newline 2017-05-03 09:38:29 +02:00
tag_map.py Update the TAG_MAP 2017-08-22 00:02:55 +09:00