spaCy/test_tokens_from_list.py at 1def5a6cbe3298767a7a7e85d242dde9a8aa480d - spaCy - Gitea

explosion/spaCy

mirror of https://github.com/explosion/spaCy.git synced 2025-11-04 01:48:04 +03:00

Matthew Honnibal 877abb0e5b * Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds.

2015-06-07 17:24:49 +02:00

10 lines

235 B

Python

Raw Blame History

 from __future__ import unicode_literals
 import pytest
 def test1(en_tokenizer):
     words = ['JAPAN', 'GET', 'LUCKY']
     tokens = en_tokenizer.tokens_from_list(words)
     assert len(tokens) == 3
     assert tokens[0].orth_ == 'JAPAN'