Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning

Sumithra Rudresha, Shreyas

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/16432

Title:	Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning
Authors:	Sumithra Rudresha, Shreyas
Keywords:	Deep reinforcement learning (RL);Markov decision processes (MDPs);overestimation bias;successive over-relaxation (SOR)
Issue Date:	2025
Publisher:	Institute of Electrical and Electronics Engineers Inc.
Citation:	Shreyas, S. R. (2025). Double Successive Over-Relaxation Q-Learning With an Extension to Deep Reinforcement Learning. IEEE Transactions on Neural Networks and Learning Systems. https://doi.org/10.1109/TNNLS.2025.3576581
Abstract:	Q-learning (QL) is a widely used algorithm in reinforcement learning (RL), but its convergence can be slow, especially when the discount factor is close to one. Successive over-relaxation (SOR) QL, which introduces a relaxation factor to speed up convergence, addresses this issue but has two major limitations. In the tabular setting, the relaxation parameter depends on transition probability, making it not entirely model-free, and it suffers from overestimation bias. To overcome these limitations, we propose a sample-based, model-free double SORQL (MF-DSORQL) algorithm. Theoretically and empirically, this algorithm is shown to be less biased than SORQL. Furthermore, in the tabular setting, the convergence analysis under boundedness assumptions on iterates is discussed. The proposed algorithm is extended to large-scale problems using deep RL. Finally, both the tabular version of the proposed algorithm and its deep RL extension are tested on benchmark examples. © 2012 IEEE.
URI:	https://dx.doi.org/10.1109/TNNLS.2025.3576581 https://dspace.iiti.ac.in:8080/jspui/handle/123456789/16432
ISSN:	2162-237X
Type of Material:	Journal Article
Appears in Collections:	Department of Mathematics

Files in This Item:

There are no files associated with this item.

Show full item record

Altmetric Badge: