Non-Convex SGD Learns Halfspaces with Adversarial Label Noise

06/11/2020
by   Ilias Diakonikolas, et al.
0

We study the problem of agnostically learning homogeneous halfspaces in the distribution-specific PAC model. For a broad family of structured distributions, including log-concave distributions, we show that non-convex SGD efficiently converges to a solution with misclassification error O()+, where is the misclassification error of the best-fitting halfspace. In sharp contrast, we show that optimizing any convex surrogate inherently leads to misclassification error of ω(), even under Gaussian marginals.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset