A Weighted Smooth Q-Learning Algorithm

Vijesh, Antony; Sumithra Rudresha, Shreyas

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/15903

Full metadata record

DC Field	Value	Language
dc.contributor.author	Vijesh, Antony	en_US
dc.contributor.author	Sumithra Rudresha, Shreyas	en_US
dc.date.accessioned	2025-04-22T17:45:32Z	-
dc.date.available	2025-04-22T17:45:32Z	-
dc.date.issued	2025	-
dc.identifier.citation	Antony Vijesh, V., & Shreyas, S. R. (2025). A Weighted Smooth Q-Learning Algorithm. IEEE Control Systems Letters. https://doi.org/10.1109/LCSYS.2025.3551265	en_US
dc.identifier.issn	2475-1456	-
dc.identifier.other	EID(2-s2.0-105002334487)	-
dc.identifier.uri	https://doi.org/10.1109/LCSYS.2025.3551265	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/15903	-
dc.description.abstract	Q-learning and double Q-learning are well-known sample-based, off-policy reinforcement learning algorithms. However, Q-learning suffers from overestimation bias, while double Q-learning suffers from underestimation bias. To address these issues, this letter proposes a weighted smooth Q-learning (WSQL) algorithm. The proposed algorithm employs a weighted combination of the mellowmax operator and the log-sum-exp operator in place of the maximum operator. Firstly, a new stochastic approximation based result is derived and as a consequence the almost sure convergence of the proposed WSQL is presented. Further, a sufficient condition for the boundedness of WSQL algorithm is obtained. Numerical experiments are conducted on benchmark examples to validate the effectiveness of the proposed weighted smooth Q-learning algorithm. © 2017 IEEE.	en_US
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers Inc.	en_US
dc.source	IEEE Control Systems Letters	en_US
dc.subject	Estimation bias	en_US
dc.subject	log-sum-exp	en_US
dc.subject	mellowmax	en_US
dc.subject	reinforcement learning	en_US
dc.subject	stochastic approximation	en_US
dc.title	A Weighted Smooth Q-Learning Algorithm	en_US
dc.type	Journal Article	en_US
Appears in Collections:	Department of Mathematics

Files in This Item:

There are no files associated with this item.

Show simple item record

Altmetric Badge: