Z-Scores Over Gut Feel: Teaching Similarity Scores What “Weird” Actually Means

Thu, 02 Apr 2026 08:00:00 +0000

A fixed similarity threshold is easy to implement and surprisingly hard to justify. It assumes your data behaves consistently, your model behaves consistently, and your definition of “normal” does not drift. None of those assumptions tend to hold for long.

Up to this point in the series, we have treated cosine similarity as a raw signal. A number comes out, we compare it to a cutoff, and we move on. Here we start treating those numbers as a distribution instead.

Statistics on Thoughts and code

Z-Scores Over Gut Feel: Teaching Similarity Scores What “Weird” Actually Means