Change mention limit to match reference implementations

This generall means fewer spans are considered, which makes individual
steps in training faster but can make training take longer to find the
good spans.
This commit is contained in:
Paul O'Leary McCann 2021-08-08 19:55:52 +09:00
parent 1d1679d431
commit 56803d3909

View File

@ -237,8 +237,8 @@ def coarse_prune(
# calculate the doc length
doclen = ends[-1] - starts[0]
# XXX seems to make more sense to use menlen than doclen here?
# mlimit = min(mention_limit, int(mention_limit_ratio * doclen))
mlimit = min(mention_limit, int(mention_limit_ratio * menlen))
# coref-hoi uses doclen (number of words).
mlimit = min(mention_limit, int(mention_limit_ratio * doclen))
# csel is a 1d integer list
csel = select_non_crossing_spans(tops, starts, ends, mlimit)
# add the offset so these indices are absolute