Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/10558
Title: Combining Holistic Source Code Representation with Siamese Neural Networks for Detecting Code Clones
Authors: Patel, Smit
Keywords: C++ (programming language);Cloning;Codes (symbols);Data flow analysis;Deep neural networks;Flow graphs;Network coding;Semantics;Syntactics;Trees (mathematics);Abstract syntax tree;Abstract Syntax Trees;Code clone;Control flow graph;Control-flow graphs;Deep learning;Functional code clone;Functional codes;Neural-networks;Siamese neural network;Embeddings
Issue Date: 2022
Publisher: Springer Science and Business Media Deutschland GmbH
Citation: Patel, S., & Sinha, R. (2022). Combining Holistic Source Code Representation with Siamese Neural Networks for Detecting Code Clones. In D. Clark, H. Menendez, & A. R. Cavalli (Eds.), Testing Software and Systems (Vol. 13045, pp. 148–159). Springer International Publishing. https://doi.org/10.1007/978-3-031-04673-5_12
Abstract: Code clones can be defined as two identical pieces of code having the same or similar functionality. Code clone detection is critical to improve and sustain code quality. Current methods are unable to extract semantic and syntactic features and classify code bases satisfactorily. We propose a novel two-stage machine-learning approach towards code clone detection. Firstly, multiple intermediate representations of source code are extracted and combined to generate a holistic embedding based on a recently proposed technique. Next, we use these embeddings to train an Intermediate Merge Siamese Neural Network to detect functional code clones. Siamese Neural Networks are a state-of-the-art machine learning architecture particularly suited to code clone detection. This novel combination allows for learning subtle syntactic and semantic features and identifying previously undetectable similarities. Our solution shows a significant improvement in code clone detection, as shown by experimental evaluation over the OJClone C++ dataset. © 2022, IFIP International Federation for Information Processing.
URI: https://doi.org/10.1007/978-3-031-04673-5_12
https://dspace.iiti.ac.in/handle/123456789/10558
ISBN: 978-3031046728
ISSN: 0302-9743
Type of Material: Conference Paper
Appears in Collections:Department of Mechanical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: