• Türkçe
    • English
  • English 
    • Türkçe
    • English
  • Login
View Item 
  •   FSM Vakıf
  • Fakülteler / Faculties
  • Mühendislik Fakültesi / Faculty of Engineering
  • Yazılım Mühendisliği
  • View Item
  •   FSM Vakıf
  • Fakülteler / Faculties
  • Mühendislik Fakültesi / Faculty of Engineering
  • Yazılım Mühendisliği
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.

Optimizing Pre-Trained Code Embeddings With Triplet Loss for Code Smell Detection

Thumbnail

View/Open

Ana Makale (2.162Mb)

Access

info:eu-repo/semantics/openAccess

Date

2025

Author

İslamoğlu, Ertuğrul
Adalı, Ömer Kerem
Aydın, Musa
Nizam, Ali

Metadata

Show full item record

Citation

NİZAM, Ali, Ertuğrul İSLMAOĞLU, Ömer Kerem ADALI & Musa AYDIN. "Optimizing Pre-Trained Code Embeddings With Triplet Loss for Code Smell Detection." IEEE Access, 13 (2025): 1-16.

Abstract

Code embedding represents code semantics in vector form. Although code embedding-based systems have been successfully applied to various source code analysis tasks, further research is required to enhance code embedding for better code analysis capabilities, aiming to surpass the performance and functionality of static code analysis tools. In addition, standard methods for improving code embedding are essential to develop more effective embedding-based systems, similar to augmentation techniques in the image processing domain. This study aims to create a contrastive learning-based system to explore the potential of a generic method for enhancing code embedding for code classification tasks. A triplet lossbased deep learning network is designed to optimize in-class similarity and increase the distance between classes. An experimental dataset that contains code from Java, Python, and PHP programming languages and 4 different code smells is created by collecting code from open-source repositories on GitHub. We evaluate the proposed system’s effectiveness with widely used BERT, CodeBERT, and GraphCodeBERT pretrained models to create code embedding for the code classification task of code smell detection. Our findings indicate that the proposed system may offer improvements in accuracy, an average of 8% and a maximum of 13% for models. These results suggest that incorporating contrastive learning techniques into the generation process of code representation as a preprocessing step can enhance performance in code analysis.

Source

IEEE Access

Volume

13

URI

https://ieeexplore.ieee.org/document/10890964
https://hdl.handle.net/11352/5189

Collections

  • Bilgisayar Mühendisliği Bölümü [214]
  • Scopus İndeksli Yayınlar / Scopus Indexed Publications [756]
  • WOS İndeksli Yayınlar / WOS Indexed Publications [661]
  • Yapay Zeka ve Veri Mühendisliği Bölümü [13]
  • Yazılım Mühendisliği [21]



DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 




| Policy | Guide | Contact |

DSpace@FSM

by OpenAIRE
Advanced Search

sherpa/romeo

Browse

All of DSpaceCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypeLanguageDepartmentCategoryPublisherAccess TypeInstitution AuthorThis CollectionBy Issue DateAuthorsTitlesSubjectsTypeLanguageDepartmentCategoryPublisherAccess TypeInstitution Author

My Account

LoginRegister

Statistics

View Google Analytics Statistics

DSpace software copyright © 2002-2015  DuraSpace
Contact Us | Send Feedback
Theme by 
@mire NV
 

 


|| Policy || Guide || Library || FSM Vakıf University || OAI-PMH ||

FSM Vakıf University, İstanbul, Turkey
If you find any errors in content, please contact:

Creative Commons License
FSM Vakıf University Institutional Repository is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 Unported License..

DSpace@FSM:


DSpace 6.2

tarafından İdeal DSpace hizmetleri çerçevesinde özelleştirilerek kurulmuştur.