Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/17366
Title: DNN training in geo-distributed data centers using VXLAN and EVPN
Authors: Inam, Naved
Supervisors: Sharma, Sidharth
Keywords: Computer Science and Engineering
Issue Date: 23-May-2025
Publisher: Department of Computer Science and Engineering, IIT Indore
Series/Report no.: MT368;
Abstract: The exponential growth of data and increasing demands for privacy and compliance drove the shift toward geo-distributed training of Deep Neural Networks (DNNs). Traditional centralized training in a single data center faced limitations such as WAN bandwidth constraints, high latency, and cross-border data restrictions. Conventional VLAN-based approaches also lacked scalability for multi-tenant, distributed environments due to limited ID space and poor isolation. To overcome these challenges, this thesis designed and evaluated a scalable, resilient, multi-tenant virtual network architecture for geo-distributed DNN training. The system used VXLAN as a Layer 2 overlay and EVPN over MP-BGP as the control plane to extend network reachability across data centers while enabling isolated virtual networks. A realistic emulated environment was built using Containerlab and FRR, with a spine-leaf topology modeling inter-data center communication. The network was enhanced with Equal Cost Multi Path (ECMP) routing for load balancing and Bidirectional Forwarding Detection (BFD) for fast failure recovery. Prometheus, combined with SNMP and ping exporters, enabled continuous monitoring of link health, device uptime, and overall network connectivity.
URI: https://dspace.iiti.ac.in:8080/jspui/handle/123456789/17366
Type of Material: Thesis_M.Tech
Appears in Collections:Department of Computer Science and Engineering_ETD

Files in This Item:
File Description SizeFormat 
MT_368_Naved_Inam_2302101010.pdf5.61 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: