Large-scale pre-trained language models such as BERT are popular solutio...
Clinical notes are assigned ICD codes - sets of codes for diagnoses and
...
Prediction using the ground truth sounds like an oxymoron in machine
lea...
The importance of parameter selection in supervised learning is well kno...
We study the problem of learning similarity by using nonlinear embedding...
Deep learning involves a difficult non-convex optimization problem, whic...
Deep learning involves a difficult non-convex optimization problem with ...