Fair Classification with Noisy Protected Attributes

06/08/2020
by   L. Elisa Celis, et al.
0

Due to the growing deployment of classification algorithms in various social contexts, developing methods that are fair with respect to protected attributes such as gender or race is an important problem. However, the information about protected attributes in datasets may be inaccurate due to either issues with data collection or when the protected attributes used are themselves predicted by algorithms. Such inaccuracies can prevent existing fair classification algorithms from achieving desired fairness guarantees. Motivated by this, we study fair classification problems when the protected attributes in the data may be “noisy”. In particular, we consider a noise model where any protected type may be flipped to another with some fixed probability. We propose a “denoised” fair optimization formulation that can incorporate very general fairness goals via a set of constraints, mitigates the effects of such noise perturbations, and comes with provable guarantees. Empirically, we show that our framework can lead to near-perfect statistical parity with only a slight loss in accuracy for significant noise levels.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset