spaCy/test_tokens_from_list.py at 4b07c17d6fc094e2c5f9891a0be0686e10a7db48 - spaCy - Gitea

explosion/spaCy

mirror of https://github.com/explosion/spaCy.git synced 2024-09-21 19:39:13 +03:00

Matthew Honnibal 877abb0e5b * Set up tokenizer/ tests properly, using a session-scoped fixture to avoid long load/unload times. Tokenizer tests now complete in 20 seconds.

2015-06-07 17:24:49 +02:00

10 lines

235 B

Python

Raw Blame History

 from __future__ import unicode_literals
 import pytest
 def test1(en_tokenizer):
     words = ['JAPAN', 'GET', 'LUCKY']
     tokens = en_tokenizer.tokens_from_list(words)
     assert len(tokens) == 3
     assert tokens[0].orth_ == 'JAPAN'