Cost-Efficient Fixed-Width Confidence Intervals for the Difference of Two Bernoulli Proportions
We study properties of confidence intervals (CIs) for the difference of two Bernoulli distributions' success parameters, p_x - p_y, in the case where the goal is to obtain a CI of a given half-width while minimizing sampling costs when the observation costs may be different between the two distributions. Assuming that we are provided with preliminary estimates of the success parameters, we propose three different methods for constructing fixed-width CIs: (i) a two-stage sampling procedure, (ii) a sequential method that carries out sampling in batches, and (iii) an ℓ-stage "look-ahead" procedure. We use Monte Carlo simulation to show that, under diverse success probability and observation cost scenarios, our proposed algorithms obtain significant cost savings versus their baseline counterparts (up to 50% for the two-stage procedure, up to 15% for the sequential methods). Furthermore, for the battery of scenarios under study, our sequential-batches and ℓ-stage "look-ahead" procedures approximately obtain the nominal coverage while also meeting the desired width requirement. Our sequential-batching method turned out to be more efficient than the "look-ahead" method from a computational standpoint, with average running times at least an order-of-magnitude faster over all the scenarios tested.
READ FULL TEXT