Multi-armed bandit algorithms such as Thompson sampling (TS) have been put forth for decades as useful tools for optimizing sequential decision-making in online experiments. By skewing the allocation ratio towards superior arms, they can minimize …
Adaptive digital field experiments are continually increasing in their breadth of use in fields like mobile health and digital education. Using adaptive experimentation in education can help not only to explore and eventually compare various arms but …