Yuguang Yue

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Mingyuan Zhou
110 publications
Yunhao Tang
41 publications
Zhendong Wang
29 publications
Purnamrita Sarkar
25 publications
Wenzhe Shi
16 publications
Wenyuan Li
16 publications
Mingzhang Yin
14 publications
Corey W. Arnold
12 publications
Zichen Wang
12 publications
William Speier
10 publications
Jonathan J Hunt
10 publications

research

∙ 01/19/2022

Learning to Rank For Push Notifications Using Pairwise Expected Regret

Listwise ranking losses have been widely studied in recommender systems....

0 Yuguang Yue, et al. ∙

research

∙ 07/13/2020

Implicit Distributional Reinforcement Learning

To improve the sample efficiency of policy-gradient based reinforcement ...

23 Yuguang Yue, et al. ∙

research

∙ 02/10/2020

Discrete Action On-Policy Learning with Action-Value Critic

Reinforcement learning (RL) in discrete action space is ubiquitous in re...

14 Yuguang Yue, et al. ∙

research

∙ 10/18/2019

Semi-supervised Learning using Adversarial Training with Good and Bad Samples

In this work, we investigate semi-supervised learning (SSL) for image cl...

20 Wenyuan Li, et al. ∙

research

∙ 10/17/2019

A Unified Framework for Tuning Hyperparameters in Clustering Problems

Selecting hyperparameters for unsupervised learning problems is difficul...

57 Xinjie Fan, et al. ∙

research

∙ 05/04/2019

ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables

To address the challenge of backpropagating the gradient through categor...

0 Mingzhang Yin, et al. ∙

research

∙ 07/21/2018

T-optimal design for multivariate polynomial regression using semidefinite programming

We consider T-optimal experiment design problems for discriminating mult...

0 Yuguang Yue, et al. ∙

Success!

An error occurred

Yuguang Yue

Featured Co-authors

Learning to Rank For Push Notifications Using Pairwise Expected Regret

Implicit Distributional Reinforcement Learning

Discrete Action On-Policy Learning with Action-Value Critic

Semi-supervised Learning using Adversarial Training with Good and Bad Samples

A Unified Framework for Tuning Hyperparameters in Clustering Problems

ARSM: Augment-REINFORCE-Swap-Merge Estimator for Gradient Backpropagation Through Categorical Variables

T-optimal design for multivariate polynomial regression using semidefinite programming

Sign in with Google

Consider DeepAI Pro