A feature-enhanced shift graph convolutional network and its application in skeleton-based action recognition

Roy, Ananya

Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/14281

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	Tiwari, Aruna	-
dc.contributor.advisor	Singh, Sanjay	-
dc.contributor.author	Roy, Ananya	-
dc.date.accessioned	2024-08-17T11:18:11Z	-
dc.date.available	2024-08-17T11:18:11Z	-
dc.date.issued	2024-06-27	-
dc.identifier.uri	https://dspace.iiti.ac.in/handle/123456789/14281	-
dc.description.abstract	Human action recognition is the process of automatically identifying and classifying human actions in a video sequence. It involves analyzing the motion and appearance of humans in the video and recognizing the action they are performing. Human actions can be represented using various data modalities, such as RGB videos, skeleton graphs, depth sequences and heat maps. In recent years, skeleton-based action recognition has drawn a lot of attention in the area of computer vision. To extract skeletal data, pose estimation algorithms are used on action videos to track the key joints involved in the action. These joints are connected with edges representing the bones involved in the action, thus forming a graph structure of the human action. Skeleton data is lightweight as compared to video data, and is more robust against changes in appearance, lighting conditions, background clutter and camera viewpoints. Among existing methods, Graph Convolutional Networks (GCNs) have achieved exceptional results as they are highly efficient in feature extraction from non-euclidean or irregular data. However, most existing GCN-based methods are computationally expensive and have inflexible receptive fields, due to which their expressiveness is limited. As a result, focus has shifted towards building lightweight architectures which require fewer parameters. One such method is Shift-GCN [1], which uses shift graph operations that are both lightweight and increase the flexibility of receptive fields in both spatial and temporal dimensions. However, although this method captures non-local and distant spatial relationships in a lightweight and more efficient manner, it does not perform well on fine-grained actions that have subtle differences and require capturing graph connection information.	en_US
dc.language.iso	en	en_US
dc.publisher	Department of Computer Science and Engineering, IIT Indore	en_US
dc.relation.ispartofseries	MSR045;	-
dc.subject	Computer Science and Engineering	en_US
dc.title	A feature-enhanced shift graph convolutional network and its application in skeleton-based action recognition	en_US
dc.type	Thesis_MS Research	en_US
Appears in Collections:	Department of Computer Science and Engineering_ETD

Files in This Item:

File	Description	Size	Format
MSR045_Ananya_Roy_2104101013.pdf		1.47 MB	Adobe PDF	View/Open

Show simple item record

Altmetric Badge: