Trust Region Policy Optimization (TRPO) is a popular and empirically
suc...
Inspired by double q learning algorithm, the double DQN algorithm was
or...
In recent years, there have been many deep structures for Reinforcement
...
The K-means algorithm is among the most commonly used data clustering
me...
In this paper, a novel stepwise learning approach based on estimating de...
In this paper, a new self-organizing fuzzy neural network model is prese...