Please use this identifier to cite or link to this item:
https://dspace.iiti.ac.in/handle/123456789/10408
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Gutgutia, Yash Vardhan | en_US |
dc.contributor.author | Ahuja, Kapil [Guide] | en_US |
dc.date.accessioned | 2022-07-06T06:28:27Z | - |
dc.date.available | 2022-07-06T06:28:27Z | - |
dc.date.issued | 2022-05-25 | - |
dc.identifier.uri | https://dspace.iiti.ac.in/handle/123456789/10408 | - |
dc.description.abstract | Reinforcement Learning has witnessed significant advancement in solving various decision-making problems in Machine Learning (ML), most of which involve more than one agent. We categorize such problems as multi-agent problems and utilize Multi-Agent Reinforcement Learn ing (MARL) to solve them. In this project, we shall have a look at a family of multi-agent environments (PettingZoo) and analyze two state-of-the-art multi-agent algorithms - Multi-Agent Deep Deterministic Policy Gradient (MADDPG) and Deep Deterministic Policy Gradient (DDPG). We aim to train the agents in these newly developed multi-agent environments under both algorithms. After that, we shall analyze their training curves using appropriate benchmarking techniques and re-establish how MADDPG’s centralized critic plays an essential role in communication/coordination-based agents. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Department of Computer Science and Engineering, IIT Indore | en_US |
dc.relation.ispartofseries | BTP599;CSE 2022 GUT | - |
dc.subject | Computer Science and Engineering | en_US |
dc.title | Analysing emerging algorithms in multi-agent reinforcement learning | en_US |
dc.type | B.Tech Project | en_US |
Appears in Collections: | Department of Computer Science and Engineering_BTP |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
BTP_599_Yash_Vardhan_Gutgutia_180001064.pdf | 1.93 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
Altmetric Badge: