Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/16432
Title: | Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning |
Authors: | Sumithra Rudresha, Shreyas |
Keywords: | Deep reinforcement learning (RL);Markov decision processes (MDPs);overestimation bias;successive over-relaxation (SOR) |
Issue Date: | 2025 |
Publisher: | Institute of Electrical and Electronics Engineers Inc. |
Citation: | Shreyas, S. R. (2025). Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2025.3576581 |
Abstract: | Q-learning (QL) is a widely used algorithm in reinforcement learning (RL), but its convergence can be slow, especially when the discount factor is close to one. Successive over-relaxation (SOR) QL, which introduces a relaxation factor to speed up convergence, addresses this issue but has two major limitations. In the tabular setting, the relaxation parameter depends on transition probability, making it not entirely model-free, and it suffers from overestimation bias. To overcome these limitations, we propose a sample-based, model-free double SORQL (MF-DSORQL) algorithm. Theoretically and empirically, this algorithm is shown to be less biased than SORQL. Furthermore, in the tabular setting, the convergence analysis under boundedness assumptions on iterates is discussed. The proposed algorithm is extended to large-scale problems using deep RL. Finally, both the tabular version of the proposed algorithm and its deep RL extension are tested on benchmark examples. © 2012 IEEE. |
URI: | https://dx.doi.org/10.1109/TNNLS.2025.3576581 https://dspace.iiti.ac.in:8080/jspui/handle/123456789/16432 |
ISSN: | 2162-237X |
Type of Material: | Journal Article |
Appears in Collections: | Department of Mathematics |
Files in This Item:
There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: