Shivaram Kalyanakrishnan | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Peter Stone
126 publications
Milind Tambe
58 publications
Shalabh Bhatnagar
50 publications
Kevin Leyton-Brown
36 publications
Sarit Kraus
35 publications
Ece Kamar
27 publications
Julie Shah
24 publications
Kalpesh Krishna
23 publications
Sourav Das
17 publications
Vihari Piratla
14 publications
David Parkes
13 publications

research

∙ 11/28/2022

Some Upper Bounds on the Running Time of Policy Iteration on Deterministic MDPs

Policy Iteration (PI) is a widely used family of algorithms to compute o...

0 Ritesh Goenka, et al. ∙

research

∙ 10/31/2022

Artificial Intelligence and Life in 2030: The One Hundred Year Study on Artificial Intelligence

In September 2016, Stanford's "One Hundred Year Study on Artificial Inte...

0 Peter Stone, et al. ∙

research

∙ 09/10/2021

PAC Mode Estimation using PPR Martingale Confidence Sequences

We consider the problem of correctly identifying the mode of a discrete ...

0 Shubham Anand Jain, et al. ∙

research

∙ 02/07/2021

An Analysis of Frame-skipping in Reinforcement Learning

In the practice of sequential decision making, agents are often designed...

0 Shivaram Kalyanakrishnan, et al. ∙

research

∙ 09/16/2020

Lower Bounds for Policy Iteration on Multi-action MDPs

Policy Iteration (PI) is a classical family of algorithms to compute an ...

6 Kumar Ashutosh, et al. ∙

research

∙ 01/24/2019

Regret Minimisation in Multi-Armed Bandits Using Bounded Arm Memory

In this paper, we propose a constant word (RAM model) algorithm for regr...

0 Arghya Roy Chaudhuri, et al. ∙

research

∙ 01/24/2019

PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits

We consider the problem of identifying any k out of the best m arms in a...

0 Arghya Roy Chaudhuri, et al. ∙

research

∙ 11/17/2017

RLWS: A Reinforcement Learning based GPU Warp Scheduler

The Streaming Multiprocessors (SMs) of a Graphics Processing Unit (GPU) ...

0 Jayvant Anantpur, et al. ∙

Success!

An error occurred