Sequential A/B Testing Keeps the World Streaming Netflix Part 1: Continuous Data
<p>Can you spot any difference between the two data streams below? Each observation is the time interval between a Netflix member hitting the play button and playback commencing, i.e., <em>play-delay</em>. These observations are from a particular type of A/B test that Netflix runs called a software canary or regression-driven experiment. More on that below — for now, what’s important is that we want to <strong>quickly</strong> and <strong>confidently</strong> identify any difference in the distribution of play-delay — or conclude that, within some tolerance, there is no difference.</p>
<p>In this blog post, we will develop a statistical procedure to do just that, and describe the impact of these developments at Netflix. The key idea is to switch from a “fixed time horizon” to an “any-time valid” framing of the problem.</p>
<p><a href="https://netflixtechblog.com/sequential-a-b-testing-keeps-the-world-streaming-netflix-part-1-continuous-data-cba6c7ed49df"><strong>Website</strong></a></p>