mirror of
https://github.com/explosion/spaCy.git
synced 2025-01-26 09:14:32 +03:00
609c0ba557
* Tidy up train-from-config a bit * Fix accidentally quadratic perf in TokenAnnotation.brackets When we're reading in the gold data, we had a nested loop where we looped over the brackets for each token, looking for brackets that start on that word. This is accidentally quadratic, because we have one bracket per word (for the POS tags). So we had an O(N**2) behaviour here that ended up being pretty slow. To solve this I'm indexing the brackets by their starting word on the TokenAnnotations object, and having a property to provide the previous view. * Fixes |
||
---|---|---|
.. | ||
converters | ||
__init__.py | ||
convert.py | ||
debug_data.py | ||
download.py | ||
evaluate.py | ||
info.py | ||
init_model.py | ||
package.py | ||
pretrain.py | ||
profile.py | ||
train_from_config.py | ||
train.py | ||
validate.py |