Machine Learning on data with sPlot background subtraction

05/28/2019
by   Maxim Borisyak, et al.
0

Data analysis in high energy physics has to deal with data samples produced from different sources. One of the most widely used ways to unfold their contributions is the sPlot technique. It uses the results of a maximum likelihood fit to assign weights to events. Some weights produced by sPlot are by design negative. Negative weights make it difficult to apply machine learning methods. Loss function becomes unbounded and neural network training divergent. In this paper we propose a mathematically rigorous way to transform the weights obtained by sPlot into class probabilities conditioned on observables thus enabling to apply any machine learning algorithm out-of-the box.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset