A method of product experimentation in which users are randomly split into two or more cohorts (e.g., Group A and Group B), with each group exposed to a different variation of a feature such as copy, design, or pricing. A primary success metric is established before the test begins, and the outcome is typically measured using frequentist statistics and confidence intervals to determine if one version outperforms the other — or if results are inconclusive.