Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/15903
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Vijesh, Antony | en_US |
dc.contributor.author | Sumithra Rudresha, Shreyas | en_US |
dc.date.accessioned | 2025-04-22T17:45:32Z | - |
dc.date.available | 2025-04-22T17:45:32Z | - |
dc.date.issued | 2025 | - |
dc.identifier.citation | Antony Vijesh, V., & Shreyas, S. R. (2025). A Weighted Smooth Q-Learning Algorithm. IEEE Control Systems Letters. https://doi.org/10.1109/LCSYS.2025.3551265 | en_US |
dc.identifier.issn | 2475-1456 | - |
dc.identifier.other | EID(2-s2.0-105002334487) | - |
dc.identifier.uri | https://doi.org/10.1109/LCSYS.2025.3551265 | - |
dc.identifier.uri | https://dspace.iiti.ac.in/handle/123456789/15903 | - |
dc.description.abstract | Q-learning and double Q-learning are well-known sample-based, off-policy reinforcement learning algorithms. However, Q-learning suffers from overestimation bias, while double Q-learning suffers from underestimation bias. To address these issues, this letter proposes a weighted smooth Q-learning (WSQL) algorithm. The proposed algorithm employs a weighted combination of the mellowmax operator and the log-sum-exp operator in place of the maximum operator. Firstly, a new stochastic approximation based result is derived and as a consequence the almost sure convergence of the proposed WSQL is presented. Further, a sufficient condition for the boundedness of WSQL algorithm is obtained. Numerical experiments are conducted on benchmark examples to validate the effectiveness of the proposed weighted smooth Q-learning algorithm. © 2017 IEEE. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
dc.source | IEEE Control Systems Letters | en_US |
dc.subject | Estimation bias | en_US |
dc.subject | log-sum-exp | en_US |
dc.subject | mellowmax | en_US |
dc.subject | reinforcement learning | en_US |
dc.subject | stochastic approximation | en_US |
dc.title | A Weighted Smooth Q-Learning Algorithm | en_US |
dc.type | Journal Article | en_US |
Appears in Collections: | Department of Mathematics |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: