Analysing emerging algorithms in multi-agent reinforcement learning

Gutgutia, Yash Vardhan; Ahuja, Kapil [Guide]

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/10408

Full metadata record

DC Field	Value	Language
dc.contributor.author	Gutgutia, Yash Vardhan	en_US
dc.contributor.author	Ahuja, Kapil [Guide]	en_US
dc.date.accessioned	2022-07-06T06:28:27Z	-
dc.date.available	2022-07-06T06:28:27Z	-
dc.date.issued	2022-05-25	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/10408	-
dc.description.abstract	Reinforcement Learning has witnessed significant advancement in solving various decision-making problems in Machine Learning (ML), most of which involve more than one agent. We categorize such problems as multi-agent problems and utilize Multi-Agent Reinforcement Learn ing (MARL) to solve them. In this project, we shall have a look at a family of multi-agent environments (PettingZoo) and analyze two state-of-the-art multi-agent algorithms - Multi-Agent Deep Deterministic Policy Gradient (MADDPG) and Deep Deterministic Policy Gradient (DDPG). We aim to train the agents in these newly developed multi-agent environments under both algorithms. After that, we shall analyze their training curves using appropriate benchmarking techniques and re-establish how MADDPG’s centralized critic plays an essential role in communication/coordination-based agents.	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Computer Science and Engineering, IIT Indore	en_US
dc.relation.ispartofseries	BTP599;CSE 2022 GUT	-
dc.subject	Computer Science and Engineering	en_US
dc.title	Analysing emerging algorithms in multi-agent reinforcement learning	en_US
dc.type	B.Tech Project	en_US
Appears in Collections:	Department of Computer Science and Engineering_BTP

Files in This Item:

File	Description	Size	Format
BTP_599_Yash_Vardhan_Gutgutia_180001064.pdf		1.93 MB	Adobe PDF	View/Open

Show simple item record

Altmetric Badge: