diff --git a/website/docs/api/spancategorizer.mdx b/website/docs/api/spancategorizer.mdx
index dcc7db903..04655c742 100644
--- a/website/docs/api/spancategorizer.mdx
+++ b/website/docs/api/spancategorizer.mdx
@@ -89,7 +89,7 @@ architectures and their arguments and hyperparameters.
| `negative_weight` 3.5.1 | Multiplier for the loss terms. It can be used to downweight the negative samples if there are too many. It is only used when `add_negative_label` is `True`. Defaults to `1.0`. ~~float~~ |
| `allow_overlap` 3.5.1 | If `True`, the data is assumed to contain overlapping spans. It is only available when `max_positive` is exactly 1. Defaults to `True`. ~~bool~~ |
-> ⚠️ Note that if you set a non-default value for `spans_key`, you'll have to
+> ⚠️ Caution: if you set a non-default value for `spans_key`, you'll have to
> update `[training.score_weights]` as well so that weights are computed
> properly. I. e. for `span_key == "myspankey"`, include this in your config:
>