Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/15855
Full metadata record
DC FieldValueLanguage
dc.contributor.authorVijesh, Antonyen_US
dc.contributor.authorSumithra Rudresha, Shreyasen_US
dc.date.accessioned2025-04-11T06:15:40Z-
dc.date.available2025-04-11T06:15:40Z-
dc.date.issued2025-
dc.identifier.citationAntony Vijesh, V., & Shreyas, S. R. (2025). A Weighted Smooth Q-Learning Algorithm. IEEE Control Systems Letters. https://doi.org/10.1109/LCSYS.2025.3551265en_US
dc.identifier.issn2475-1456-
dc.identifier.otherEID(2-s2.0-105000627093)-
dc.identifier.urihttps://doi.org/10.1109/LCSYS.2025.3551265-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/15855-
dc.description.abstractQ-learning and double Q-learning are well-known sample-based, off-policy reinforcement learning algorithms. However, Q-learning suffers from overestimation bias, while double Q-learning suffers from underestimation bias. To address these issues, this letter proposes a weighted smooth Q-learning (WSQL) algorithm. The proposed algorithm employs a weighted combination of the mellowmax operator and the log-sum-exp operator in place of the maximum operator. Firstly, a new stochastic approximation based result is derived and as a consequence the almost sure convergence of the proposed WSQL is presented. Further, a sufficient condition for the boundedness of WSQL algorithm is obtained. Numerical experiments are conducted on benchmark examples to validate the effectiveness of the proposed weighted smooth Q-learning algorithm. © 2017 IEEE.en_US
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.sourceIEEE Control Systems Lettersen_US
dc.subjectEstimation biasen_US
dc.subjectLog-sum-expen_US
dc.subjectMellowmaxen_US
dc.subjectReinforcement learningen_US
dc.subjectStochastic approximationen_US
dc.titleA Weighted Smooth Q-Learning Algorithmen_US
dc.typeJournal Articleen_US
Appears in Collections:Department of Mathematics

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: