Machine Learning on data with sPlot background subtraction
Data analysis in high energy physics has to deal with data samples produced from different sources. One of the most widely used ways to unfold their contributions is the sPlot technique. It uses the results of a maximum likelihood fit to assign weights to events. Some weights produced by sPlot are by design negative. Negative weights make it difficult to apply machine learning methods. Loss function becomes unbounded and neural network training divergent. In this paper we propose a mathematically rigorous way to transform the weights obtained by sPlot into class probabilities conditioned on observables thus enabling to apply any machine learning algorithm out-of-the box.
READ FULL TEXT