Please use this identifier to cite or link to this item: https://dspace.iiti.ac.in/handle/123456789/10558
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPatel, Smiten_US
dc.date.accessioned2022-07-15T10:45:40Z-
dc.date.available2022-07-15T10:45:40Z-
dc.date.issued2022-
dc.identifier.citationPatel, S., & Sinha, R. (2022). Combining Holistic Source Code Representation with Siamese Neural Networks for Detecting Code Clones. In D. Clark, H. Menendez, & A. R. Cavalli (Eds.), Testing Software and Systems (Vol. 13045, pp. 148–159). Springer International Publishing. https://doi.org/10.1007/978-3-031-04673-5_12en_US
dc.identifier.isbn978-3031046728-
dc.identifier.issn0302-9743-
dc.identifier.otherEID(2-s2.0-85130246343)-
dc.identifier.urihttps://doi.org/10.1007/978-3-031-04673-5_12-
dc.identifier.urihttps://dspace.iiti.ac.in/handle/123456789/10558-
dc.description.abstractCode clones can be defined as two identical pieces of code having the same or similar functionality. Code clone detection is critical to improve and sustain code quality. Current methods are unable to extract semantic and syntactic features and classify code bases satisfactorily. We propose a novel two-stage machine-learning approach towards code clone detection. Firstly, multiple intermediate representations of source code are extracted and combined to generate a holistic embedding based on a recently proposed technique. Next, we use these embeddings to train an Intermediate Merge Siamese Neural Network to detect functional code clones. Siamese Neural Networks are a state-of-the-art machine learning architecture particularly suited to code clone detection. This novel combination allows for learning subtle syntactic and semantic features and identifying previously undetectable similarities. Our solution shows a significant improvement in code clone detection, as shown by experimental evaluation over the OJClone C++ dataset. © 2022, IFIP International Federation for Information Processing.en_US
dc.language.isoenen_US
dc.publisherSpringer Science and Business Media Deutschland GmbHen_US
dc.sourceLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en_US
dc.subjectC++ (programming language)en_US
dc.subjectCloningen_US
dc.subjectCodes (symbols)en_US
dc.subjectData flow analysisen_US
dc.subjectDeep neural networksen_US
dc.subjectFlow graphsen_US
dc.subjectNetwork codingen_US
dc.subjectSemanticsen_US
dc.subjectSyntacticsen_US
dc.subjectTrees (mathematics)en_US
dc.subjectAbstract syntax treeen_US
dc.subjectAbstract Syntax Treesen_US
dc.subjectCode cloneen_US
dc.subjectControl flow graphen_US
dc.subjectControl-flow graphsen_US
dc.subjectDeep learningen_US
dc.subjectFunctional code cloneen_US
dc.subjectFunctional codesen_US
dc.subjectNeural-networksen_US
dc.subjectSiamese neural networken_US
dc.subjectEmbeddingsen_US
dc.titleCombining Holistic Source Code Representation with Siamese Neural Networks for Detecting Code Clonesen_US
dc.typeConference Paperen_US
Appears in Collections:Department of Mechanical Engineering

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Altmetric Badge: