Search A/B

Quoted vs tokenized Reddit search: an A/B test

Searching Reddit for a multi-word phrase: do you want the exact phrase, or any post containing the words? We measured which one actually helps.

The noise problem

Search Reddit for “dating communication skills” without quotes and the top result is a drama-aggregator subreddit — none of whose posts contain the phrase. Loose, tokenized matching treats your phrase as a bag of words and pulls in anything vaguely adjacent.

Exact-phrase (“quoted”) search fixes the precision but risks the opposite failure: too few results to be useful. We tested whether that risk is real.

The test

We ran ten keywords — a mix of one-, two-, and three-word phrases across two domains — through the same Reddit search endpoint the pipeline uses, quoted and unquoted, and counted results. The bar: a quoted query needed at least five posts to count as usable.

The headline

Keyword typeResult
Multi-word keywords tested8
Usable quoted (≥5 exact-match posts)6 (75%)
Fell back to loose matching2
Single-word (quoting changes nothing)2

Six of eight multi-word keywords returned enough exact-phrase results to use directly. The two that underflowed fall back safely to loose matching, so no keyword loses coverage.

The pattern: do humans type this phrase?

The keywords that underflowed shared a shape — three loosely-connected concepts nobody types verbatim, like “dating communication skills” or “online dating struggles”. People write “I’m struggling with online dating”, not the noun phrase.

The ones that worked were real idioms people post as-is: “communication in relationships”, “bad at flirting”. So the predictor of success isn’t word count — it’s whether the phrase is something a human actually types.

What we shipped

Quoted-with-fallback: for any multi-word keyword, try the exact phrase first; if it clears the five-post bar, use it; otherwise fall back to loose matching. It costs about one extra search call on the keywords that need the fallback — roughly 1.25 calls per multi-word keyword on average.

Caveats we keep in view

Small sample (eight multi-word keywords, two domains), and Reddit’s search ranker shifts over time, so individual counts near the threshold can flip. The 75% verdict is directionally solid, not a precise constant.

How the pipeline works

Validate what people actually say, not what you wish they would.