Question 1

Is a single winning test worth the cost of a CRO program?

Accepted Answer

A single big winner rarely justifies the program. A consistent cadence of 2-4 tests per month, winning or losing, compounds into real revenue over a year. We size engagements so the expected cumulative lift covers the retainer within 3-4 months. If your traffic or volume can't support that, we'll say so on the discovery call.

Question 2

Don't I need a ton of traffic to run A/B tests?

Accepted Answer

For statistical significance on a 2-3 percentage-point lift you generally want at least 20,000 sessions per variant per test. Below that, tests take 6+ weeks to call and the risk of false positives climbs. We design experiment plans that match your traffic - fewer, higher-impact tests for mid-traffic sites, faster cadence for high-traffic sites.

Question 3

What tools do you use for experimentation?

Accepted Answer

GrowthBook for most engineering-adjacent teams (open source, self-hostable, free). Optimizely for enterprise teams already using it. LaunchDarkly when feature flags and experiments are combined. VWO or Convert for marketing-led teams that want a visual editor. We configure, integrate, and hand off - not tool salespeople.

Question 4

Do you write copy and design the variants?

Accepted Answer

We write the hypothesis and experiment spec. For copy and design, we partner with your in-house team or bring a designer we've worked with. We own the statistical rigor, the analysis, and the recommendation. You own the creative craft.

Question 5

What about p-hacking and false positives?

Accepted Answer

We use predefined success metrics, locked sample sizes, and Bayesian analysis or sequential testing where appropriate. No peeking, no cherry-picking. Every test ships with a pre-registration doc before launch and a post-test memo after. If a test is inconclusive, we say so - we don't manufacture wins.

Question 6

How do you decide what to test first?

Accepted Answer

We run a research sprint first - analytics review, session replay analysis, on-site surveys, funnel breakdown, and stakeholder interviews. The output is a prioritised backlog scored by expected impact, confidence in the hypothesis, and ease of build. High-impact, high-confidence, low-effort wins first.

Experiments that ship, with the statistics to back them.

Six deliverables, one program

Research sprint

Experiment program

Tool implementation

Statistical rigor

Post-test analysis

Program operations

Four phases to a compounding program

From first call to dashboards your clients trust

Research

Instrument

Ship

Compound

Questions we hear often

Is a single winning test worth the cost of a CRO program?

Don't I need a ton of traffic to run A/B tests?

What tools do you use for experimentation?

Do you write copy and design the variants?

What about p-hacking and false positives?

How do you decide what to test first?

Ready to ship tests that actually compound?

Experiments that ship, with the statistics to back them.

Six deliverables, one program

Research sprint

Experiment program

Tool implementation

Statistical rigor

Post-test analysis

Program operations

Four phases to a compounding program

From first call to dashboards your clients trust

Research

Instrument

Ship

Compound

Questions we hear often

Is a single winning test worth the cost of a CRO program?

Don't I need a ton of traffic to run A/B tests?

What tools do you use for experimentation?

Do you write copy and design the variants?

What about p-hacking and false positives?

How do you decide what to test first?

Ready to ship tests that actually compound?