spaCy/spacy/syntax
Matthew Honnibal bb911e5f4e Fix #3830: 'subtok' label being added even if learn_tokens=False (#4188)
* Prevent subtok label if not learning tokens

The parser introduces the subtok label to mark tokens that should be
merged during post-processing. Previously this happened even if we did
not have the --learn-tokens flag set. This patch passes the config
through to the parser, to prevent the problem.

* Make merge_subtokens a parser post-process if learn_subtokens

* Fix train script

* Add test for 3830: subtok problem

* Fix handlign of non-subtok in parser training
2019-08-23 17:54:00 +02:00
..
__init__.pxd * Work on greedy parser 2014-12-16 22:46:55 +11:00
__init__.py * Work on greedy parser 2014-12-16 22:46:55 +11:00
_beam_utils.pxd Export hash_state from beam_utils 2019-03-15 15:20:28 +01:00
_beam_utils.pyx Use hash_state in beam 2019-03-15 15:22:58 +01:00
_parser_model.pxd Fix handling of added labels. Resolves #3189 2019-02-24 16:41:41 +01:00
_parser_model.pyx Normalize over all actions in parser, not just valid ones 2019-03-15 15:22:16 +01:00
_state.pxd Fix NER when preset entities cross sentence boundaries (#3379) 2019-03-10 14:53:03 +01:00
_state.pyx Tidy up syntax 2017-10-27 19:45:57 +02:00
arc_eager.pxd WIP on stringstore change. 27 failures 2017-05-28 14:06:40 +02:00
arc_eager.pyx Fix #3830: 'subtok' label being added even if learn_tokens=False (#4188) 2019-08-23 17:54:00 +02:00
ner.pxd WIP on stringstore change. 27 failures 2017-05-28 14:06:40 +02:00
ner.pyx Tidy up property code style (#3391) 2019-03-11 15:59:09 +01:00
nn_parser.pxd 💫 Better support for semi-supervised learning (#3035) 2018-12-10 16:25:33 +01:00
nn_parser.pyx Fix #3830: 'subtok' label being added even if learn_tokens=False (#4188) 2019-08-23 17:54:00 +02:00
nonproj.pxd integrated pseudo-projective parsing into parser 2016-03-01 10:09:08 +01:00
nonproj.pyx Merge master into develop. Big merge, many conflicts -- need to review 2018-04-29 14:49:26 +02:00
stateclass.pxd Fix memory leak in beam parser 2017-11-14 02:11:40 +01:00
stateclass.pyx Fix memory leak in beam parser 2017-11-14 02:11:40 +01:00
transition_system.pxd Revert "Merge branch 'develop' of https://github.com/explosion/spaCy into develop" 2018-03-27 19:23:02 +02:00
transition_system.pyx 💫 Make serialization methods consistent (#3385) 2019-03-10 19:16:45 +01:00