Q Learning Algorithm - Search News

News

New “bandit” algorithm uses light for better bets - EurekAlert!

Unlike basic Q-learning algorithms, which generally focus on finding the optimal path to maximize rewards, the modified bandit Q-learning algorithm aims to learn the optimal Q value for every ...

JSTOR Daily10mon

Q-Learning for Risk-Sensitive Control on JSTOR

We propose for risk-sensitive control of finite Markov chains a counterpart of the popular Q-learning algorithm for classical Markov decision processes. The algorithm is shown to converge with ...

Results that may be inaccessible to you are currently showing.

Hide inaccessible results

News

New “bandit” algorithm uses light for better bets - EurekAlert!

Q-Learning for Risk-Sensitive Control on JSTOR

Trending now