spaCy/spacy/tests/regression/test_issue429.py

# coding: utf-8
from __future__ import unicode_literals

from ...matcher import Matcher

import pytest


@pytest.mark.models('en')
def test_issue429(EN):
    def merge_phrases(matcher, doc, i, matches):
      if i != len(matches) - 1:
        return None
      spans = [(ent_id, ent_id, doc[start:end]) for ent_id, start, end in matches]
      for ent_id, label, span in spans:
        span.merge('NNP' if label else span.root.tag_, span.text, EN.vocab.strings[label])

    doc = EN('a')
    matcher = Matcher(EN.vocab)
    matcher.add('TEST', merge_phrases, [{'ORTH': 'a'}])
    doc = EN.make_doc('a b c')
    EN.tensorizer(doc)
    EN.tagger(doc)
    matcher(doc)
    EN.entity(doc)
Tidy up regression tests 2017-01-10 21:24:10 +03:00			`# coding: utf-8`
Test Issue 429: No valid actions for NER after matcher adds a new entity label. 2016-10-27 19:01:34 +03:00			`from __future__ import unicode_literals`

Tidy up and rename regression tests and remove unnecessary imports 2017-01-13 00:00:37 +03:00			`from ...matcher import Matcher`
Test Issue 429: No valid actions for NER after matcher adds a new entity label. 2016-10-27 19:01:34 +03:00
Tidy up regression tests 2017-01-10 21:24:10 +03:00			`import pytest`

Test Issue 429: No valid actions for NER after matcher adds a new entity label. 2016-10-27 19:01:34 +03:00
Update model fixtures and reorganise tests 2017-05-29 23:14:31 +03:00			`@pytest.mark.models('en')`
Tidy up and rename regression tests and remove unnecessary imports 2017-01-13 00:00:37 +03:00			`def test_issue429(EN):`
Test Issue 429: No valid actions for NER after matcher adds a new entity label. 2016-10-27 19:01:34 +03:00			`def merge_phrases(matcher, doc, i, matches):`
			`if i != len(matches) - 1:`
			`return None`
Fix tests and use the new Matcher API 2017-05-22 14:54:20 +03:00			`spans = [(ent_id, ent_id, doc[start:end]) for ent_id, start, end in matches]`
Test Issue 429: No valid actions for NER after matcher adds a new entity label. 2016-10-27 19:01:34 +03:00			`for ent_id, label, span in spans:`
Tidy up and rename regression tests and remove unnecessary imports 2017-01-13 00:00:37 +03:00			`span.merge('NNP' if label else span.root.tag_, span.text, EN.vocab.strings[label])`

			`doc = EN('a')`
			`matcher = Matcher(EN.vocab)`
Fix matcher tests and matcher docs 2017-05-23 12:36:02 +03:00			`matcher.add('TEST', merge_phrases, [{'ORTH': 'a'}])`
Revert "Revert "WIP on improving parser efficiency"" This reverts commit 532afef4a811d5c71c75f5e63fbec3232f6ea937. 2017-05-23 11:06:53 +03:00			`doc = EN.make_doc('a b c')`
Update tests 2017-06-04 23:53:17 +03:00			`EN.tensorizer(doc)`
Tidy up and rename regression tests and remove unnecessary imports 2017-01-13 00:00:37 +03:00			`EN.tagger(doc)`
			`matcher(doc)`
			`EN.entity(doc)`