Product Design UX UI Enterprise

Distillery

An in-house A/B testing platform for a video-streaming organization — comparable in shape to Netflix's BLAZE.

Distillery — product lockup

Why Distillery existed

The streaming team needed to run real experiments against real customers — not gut-feel design changes. We didn't have a tool for that. Off-the-shelf platforms didn't speak to our data warehouse, and the engineering org had built ad-hoc scripts that only a handful of analysts could read.

So we built our own. Distillery was the in-house A/B testing platform for the streaming product, designed to make experimentation legible to PMs, designers, and engineers alike.

It's one thing to ask whether a blue button beats an orange button. It's another to communicate variants, control groups, mean effects, and statistical significance to people who don't live in the data every day. That second job — translation — is what the design needed to solve.

What I actually did

UX designers aren't scientists or mind readers. The job here was to ask the questions everyone else assumed they already knew the answer to:

  • Who needs to launch an experiment, and what's the most they can be expected to know?
  • What's the smallest amount of statistical literacy we can assume? What can we surface visually instead?
  • How do we make a "wait, that's significant — but is it the right kind of significant?" moment legible?

I worked closely with stakeholders in product, data science, and engineering — translating business requirements into deliverables developers could build against. A researcher, a UX designer with too much coffee, and a UI designer can take you a long way if you keep asking the right questions.

distillery / describe
Describe step — name the test, the hypothesis, the success metric
distillery / metrics
Metrics step — primary, secondary, and guardrail metrics

A guided five-step flow — Describe, Variants, Metrics, Plan, Tech Settings, Launch — replaced what used to be a YAML config file.

distillery / launch
Launch step — sampling rate, allocation, and reporting
distillery / results
Results — significance and behavior metrics in plain language

Launch and read the results in the same surface — no spreadsheet handoff, no tribal knowledge required.

distillery / histogram
Distribution histogram — content consumed by variant
distillery / trend
Cumulative confidence interval over time

Statistical significance, expressed in plots a PM can actually read.

Five steps, one mental model

The product reads like a checklist because it is one. Distillery walks the experimenter through six discrete decisions — describe the test, define variants, pick the metrics, plan the rollout, set tech parameters, launch — with progress indicators that reveal where you are and what's incomplete.

The same shape that makes the tool approachable is what makes the data trustworthy on the other end. If you've named your hypothesis, declared your safeguards, and selected your customer success metric upfront, the results screen has something honest to compare against.

Design principle

Hide the math. Surface the decisions. Make the trade-offs visible.

[ 04 ] Outcome

Distillery became the streaming team's default experimentation surface — not because the team was forced to use it, but because it spoke their language. Decisions that used to live in tickets started living in the product, where everyone could see them.

[ Up next ]
Standup