The Counterfactual Dilemma

On picking the right comparison.

The invisible half

Every experiment rests on a question you can't answer: what would have happened if we hadn't done the thing?

This hypothetical – the outcome in the world without intervention – is called the counterfactual. It's the invisible half of every experiment. You can never observe it, only construct it. And how you construct it determines whether your conclusions are sound or fiction.

Getting it wrong is easy. A recurring part of my work is finding where someone built the wrong counterfactual and showing why their conclusions collapse.

The same error runs outside data too, into decisions where no one audits the comparison.

A confident wrong answer

A company runs an ad campaign. Budget constraints mean only 40% of the eligible audience actually sees it. The other 60% are in the experiment pool but never get the ad.

When measuring whether it worked, the tempting shortcut: compare the exposed group against all of control. Treatment on the left, control on the right. The exposed group bought more. Campaign worked.

Except the exposed 40% didn't get there randomly. They're systematically different from the unreached majority: more active, more likely to be in-market, more likely to convert regardless. Comparing them against an unfiltered control doesn't measure the campaign's effect. It measures self-selection wearing the campaign's name.

The fix is precise: if it's not like for like, it's not a valid comparison. Everything else is just a mirror angled to confirm what you already believed.

A bad counterfactual gives you a confident wrong answer. That's worse than no answer at all.

The Silicon Valley trap

The tech mythology runs like this: the dropout who built a billion-dollar company at 19, the engineer who turned down Google at 22 and made something from scratch. These stories are real. They're also a broken counterfactual for anyone who uses them as a benchmark.

The reason isn't talent – it might be equal, might not be, and you can't measure it cleanly enough to know. What you can measure is starting position. And starting position is what you inherited before the gun went off, not when you entered the race.

There's a tailwind that accumulates invisibly when you grow up inside an ecosystem: the university network that quietly routes opportunities your way, the childhood friend who joined a fund, the professor who makes an introduction, the cultural literacy that means you know how rooms work before you ever enter one. Nobody earned these things. They're interest that compounds on being born in the right place. The safety net that makes failure survivable is capital, not courage.

When you arrive mid-life from somewhere else, you're not starting at zero. You're starting negative, working to close a gap that others never had to think about. The skills travel. The tailwind doesn't come with them.

Which means the comparison group you've been measuring against was never valid.

A sample size of one

Counterfactual reasoning works when you can find a valid comparison group: random assignment, similar distributions, comparable starting conditions. The more your conditions diverge, the less any external benchmark tells you.

Stack enough divergent variables and the pool of valid comparisons shrinks.

At some point it reaches one.

When n=1, there is no control group. There never was. Every comparison you've been running – every time you measured yourself against the mythology – was already invalid. Not directionally off, but structurally broken. The number was never measuring what you thought it was.

The method that holds is pre-post: measuring against your own prior state. Not am I where they are? but am I further than I was? It strips away the noise of incomparable starting conditions and measures the only quantity you have a clean read on: your own trajectory.

From where you actually started, are you gaining?

Raisul ChowdhuryWriting & building

Hi, I'm Raisul. I build AI products at Google and write about tech, craft, and what comes next.

Connect    More info

Painted night scene — a figure at a glowing laptop, child's family drawing on the wall, mountain valley and city lights through the windows

Amplified IntelligenceNotes on where machine ends and we begin.

The Perfect Stranger (coming soon)The Perfect Mirror (coming soon)A Bubble of One (coming soon)Inception (coming soon)The Replacement Trap (coming soon)Thinking in a Second Language (coming soon)

WritingSelected essays
and notes

The Counterfactual DilemmaOn picking the right comparison.2026
Hello WorldDeciding to write anyway.2026

Entries2026

AboutMe

Hi, I'm Raisul Chowdhury, a data scientist and product builder in San Francisco.

I grew up in Dhaka, moved to Toronto in my twenties, did an MBA at Kellogg, and have been in California with my wife and son ever since. Before Google, I helped grow StackAdapt from 70 to 1,000 people. And way before that, product and growth across digital platforms in emerging markets.

My titles changed over time, but the work I do stayed the same: understand the user, find the gap, ship it, learn, ship again.

These days I build AI products at Google and write about where the machines end and we begin. The bet underneath: AI that makes the person using it sharper, not smaller. I don't know if it's right yet. I'm building toward it.

When I'm not working, I'm usually at the park with my son, making waffles, or sitting with music and chai thinking about what comes next.

Experience

  • Senior Data ScientistGoogleCurrently teaching YouTube what to play next.
  • MBA in Artificial IntelligenceKellogg School of ManagementNorthwestern's first AI-focused MBA. Half the syllabus changed during the program.
  • Head of Audience & Optimization ProductsStackAdaptMade up a job. Built a team to do it. Shipped what the job was supposed to ship.
  • Product & GrowthRocket Internet GmbHDesigned a mobile platform for users who couldn't read.

Elsewhere