research
          
      
      ∙
      08/29/2023
    Pure Exploration under Mediators' Feedback
Stochastic multi-armed bandits are a sequential-decision-making framewor...
          
            research
          
      
      ∙
      05/07/2023
    Truncating Trajectories in Monte Carlo Reinforcement Learning
In Reinforcement Learning (RL), an agent acts in an unknown environment ...
          
            research
          
      
      ∙
      07/25/2022
    Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs
With the continuous growth of the global economy and markets, resource i...
          
            research
          
      
      ∙
      05/18/2021
    Meta-Reinforcement Learning by Tracking Task Non-stationarity
Many real-world domains are subject to a structured non-stationarity whi...
          
            research
          
      
      ∙
      07/01/2020
     
             
  
  
     
                             share
 share