default search action
21st MSR 2024: Lisbon, Portugal
- Diomidis Spinellis, Alberto Bacchelli, Eleni Constantinou:
21st IEEE/ACM International Conference on Mining Software Repositories, MSR 2024, Lisbon, Portugal, April 15-16, 2024. ACM 2024, ISBN 979-8-4007-0587-8 - Gwenyth Lincroft, Minsung Cho, Katherine Hough, Mahsa Bazzaz, Jonathan Bell:
Thirty-Three Years of Mathematicians and Software Engineers: A Case Study of Domain Expertise and Participation in Proof Assistant Ecosystems. 1-13 - Can Li, Jingxuan Zhang, Yixuan Tang, Zhuhang Li, Tianyue Sun:
Boosting API Misuse Detection via Integrating API Constraints from Multiple Sources. 14-26 - Ricardo de Sousa Job, Andre Hora:
Availability and Usage of Platform-Specific APIs: A First Empirical Study. 27-31 - Jordan Samhi, Tegawendé F. Bissyandé, Jacques Klein:
AndroLibZoo: A Reliable Dataset of Libraries Based on Software Dependency Analysis. 32-36 - Damien Jaime, Joyce El Haddad, Pascal Poizat:
Goblin: A Framework for Enriching and Querying the Maven Central Dependency Graph. 37-41 - Mahmoud Jahanshahi, Audris Mockus:
Dataset: Copy-based Reuse in Open Source Software. 42-47 - Shane McIntosh:
Mining Our Way Back to Incremental Builds for DevOps Pipelines. 48-49 - Guoliang Zhao, Stefanos Georgiou, Ying Zou, Safwat Hassan, Derek Truong, Toby Corbin:
Enhancing Performance Bug Prediction Using Performance Code Metrics. 50-62 - Xingyu Li, Zheng Zhang, Zhiyun Qian, Trent Jaeger, Chengyu Song:
An Investigation of Patch Porting Practices of the Linux Kernel Ecosystem. 63-74 - Philip Oliver, Jens Dietrich, Craig Anslow, Michael Homer:
CrashJS: A NodeJS Benchmark for Automated Crash Reproduction. 75-87 - Xhulja Shahini, Andreas Metzger, Klaus Pohl:
An Empirical Study on Just-in-time Conformal Defect Prediction. 88-99 - Mahi Begoug, Moataz Chouchen, Ali Ouni, Eman Abdullah AlOmar, Mohamed Wiem Mkaouer:
Fine-Grained Just-In-Time Defect Prediction at the Block Level in Infrastructure-as-Code (IaC). 100-112 - Kaibo Liu, Yudong Han, Yiyang Liu, Jie M. Zhang, Zhenpeng Chen, Federica Sarro, Gang Huang, Yun Ma:
TrickyBugs: A Dataset of Corner-case Bugs in Plausible Programs. 113-117 - André Silva, Nuno Saavedra, Martin Monperrus:
GitBug-Java: A Reproducible Benchmark of Recent Java Bugs. 118-122 - Dirk Beyer, Lars Grunske, Matthias Kettl, Marian Lingsch Rosenfeld, Moeketsi Raselimo:
P3: A Dataset of Partial Program Patches. 123-127 - K. D. Pramod, W. T. N. De Silva, W. U. K. Thabrew, Ridwan Shariffdeen, Sandareka Wickramanayake:
BugsPHP: A dataset for Automated Program Repair in PHP. 128-132 - Zhipeng Liu, Meng Yan, Zhipeng Gao, Dong Li, Xiaohong Zhang, Dan Yang:
AW4C: A Commit-Aware C Dataset for Actionable Warning Identification. 133-137 - Suhaib Mujahid, Diego Elias Costa, Marco Castelluccio:
Predicting the Impact of Crashes Across Release Channels. 138-139 - Sangameshwar Patil, Balaraman Ravindran:
Zero-shot Learning based Alternatives for Class Imbalanced Learning Problem in Enterprise Software Defect Analysis. 140-141 - Liangxuan Wu, Yanjie Zhao, Xinyi Hou, Tianming Liu, Haoyu Wang:
ChatGPT Chats Decoded: Uncovering Prompt Patterns for Superior Solutions in Software Development Lifecycle. 142-146 - Konstantinos Moratis, Themistoklis Diamantopoulos, Dimitrios-Nikitas Nastos, Andreas L. Symeonidis:
Write me this Code: An Analysis of ChatGPT Quality for Producing Source Code. 147-151 - Mohammed Latif Siddiq, Lindsay Roney, Jiahao Zhang, Joanna C. S. Santos:
Quality Assessment of ChatGPT Generated Code and their Use by Developers. 152-156 - Balreet Grewal, Wentao Lu, Sarah Nadi, Cor-Paul Bezemer:
Analyzing Developer Use of ChatGPT Generated Code in Open Source GitHub Projects. 157-161 - Piotr Przymus, Mikolaj Fejzer, Jakub Narebski, Krzysztof Stencel:
How I Learned to Stop Worrying and Love ChatGPT. 162-166 - Kailun Jin, Chung-Yu Wang, Hung Viet Pham, Hadi Hemmati:
Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation. 167-171 - Rachna Raj, Diego Elias Costa:
The role of library versions in Developer-ChatGPT conversations. 172-176 - Md. Fazle Rabbi, Arifa I. Champa, Minhaz Fahim Zibran, Md. Rakibul Islam:
AI Writes, We Analyze: The ChatGPT Python Code Saga. 177-181 - Arifa I. Champa, Md. Fazle Rabbi, Costain Nachuma, Minhaz F. Zibran:
ChatGPT in Action: Analyzing Its Use in Software Development. 182-186 - Suad Mohamed, Abdullah Parvin, Esteban Parra:
Chatting with AI: Deciphering Developer Conversations with ChatGPT. 187-191 - Yue Zhang, Rachel Meredith, Wilson Reeves, Julia Coriolano, Muhammad Ali Babar, Akond Rahman:
Does Generative AI Generate Smells Related to Container Orchestration?: An Exploratory Study with Kubernetes Manifests. 192-196 - Ertugrul Sagdic, Arda Bayram, Md. Rakibul Islam:
On the Taxonomy of Developers' Discussion Topics with ChatGPT. 197-201 - Eman Abdullah AlOmar, Anushkrishna Venkatakrishnan, Mohamed Wiem Mkaouer, Christian D. Newman, Ali Ouni:
How to Refactor this Code? An Exploratory Study on Developer-ChatGPT Refactoring Conversations. 202-206 - Soham Deo, Divya Hinge, Omkar Sandip Chavan, Yaxuan Olivia Wang, Mohamed Wiem Mkaouer:
Analyzing Developer-ChatGPT Conversations for Software Refactoring: An Exploratory Study. 207-211 - Moataz Chouchen, Narjes Bessghaier, Mahi Begoug, Ali Ouni, Eman Abdullah AlOmar, Mohamed Wiem Mkaouer:
How Do So ware Developers Use ChatGPT? An Exploratory Study on GitHub Pull Requests. 212-216 - Joy Krishan Das, Saikat Mondal, Chanchal K. Roy:
Investigating the Utility of ChatGPT in the Issue Tracking System: An Exploratory Study. 217-221 - Saikat Mondal, Suborno Deb Bappon, Chanchal K. Roy:
Enhancing User Interaction in ChatGPT: Characterizing and Consolidating Multiple Prompts for Issue Resolution. 222-226 - Tao Xiao, Christoph Treude, Hideaki Hata, Kenichi Matsumoto:
DevGPT: Studying Developer-ChatGPT Conversations. 227-230 - Giovanni Rosa, Simone Scalabrino, Gregorio Robles, Rocco Oliveto:
Not all Dockerfile Smells are the Same: An Empirical Evaluation of Hadolint Writing Practices by Experts. 231-241 - Anamaria-Roberta Preda, Christoph Mayr-Dorn, Atif Mashkoor, Alexander Egyed:
Supporting High-Level to Low-Level Requirements Coverage Reviewing with Large Language Models. 242-253 - Md. Anaytul Islam, Muhammad Asaduzzaman, Shaowei Wang:
On the Executability of R Markdown Files. 254-264 - Souhaila Serbout, Cesare Pautasso:
APIstic: A Large Collection of OpenAPI Metrics. 265-277 - Hong Yi Lin, Patanamon Thongtanunam, Christoph Treude, Wachiraphan Charoenwet:
Improving Automated Code Reviews: Learning from Experience. 278-283 - Tushar Sharma:
Multi-faceted Code Smell Detection at Scale using DesigniteJava 2.0. 284-288 - Edi Sutoyo, Andrea Capiluppi:
SATDAUG - A Balanced and Augmented Dataset for Detecting Self-Admitted Technical Debt. 289-293 - Mingzhao Liang, Wachiraphan Charoenwet, Patanamon Thongtanunam:
Curated Email-Based Code Reviews Datasets. 294-298 - André C. Hora:
TestDossier: A Dataset of Tested Values Automatically Extracted from Test Execution. 299-303 - Saurabhsingh Rajput, Maria Kechagia, Federica Sarro, Tushar Sharma:
Greenlight: Highlighting TensorFlow APIs Energy Footprint. 304-308 - Kesina Baral, Jack Johnson, Junayed Mahmud, Sabiha Salma, Mattia Fazzini, Julia Rubin, Jeff Offutt, Kevin Moran:
Automating GUI-based Test Oracles for Mobile Apps. 309-321 - Liu Wang, Conghui Zheng, Haoyu Wang, Xiapu Luo, Gareth Tyson, Yi Wang, Shangguang Wang:
Global Prosperity or Local Monopoly? Understanding the Geography of App Popularity. 322-334 - Sabiha Salma, S M Hasan Mansur, Yule Zhang, Kevin Moran:
GuiEvo: Automated Evolution of Mobile Application GUIs. 335-347 - Magdalena Steinböck, Jakob Bleier, Mikka Rainer, Tobias Urban, Christine Utz, Martina Lindorfer:
Comparing Apples to Androids: Discovery, Retrieval, and Matching of iOS and Android Apps for Cross-Platform Analyses. 348-360 - Elliott Wen, Jiaxiang Zhou, Xiapu Luo, Giovanni Russello, Jens Dietrich:
Keep Me Updated: An Empirical Study on Embedded JavaScript Engines in Android Apps. 361-372 - Sahrima Jannat Oishwee, Natalia Stakhanova, Zadia Codabux:
Large Language Model vs. Stack Overflow in Addressing Android Permission Related Challenges. 373-383 - Yasaman Abedini, Mohammad Hadi Hajihosseini, Abbas Heydarnoori:
DATAR: A Dataset for Tracking App Releases. 384-388 - Marco Alecci, Pedro Jesús Ruiz Jiménez, Kevin Allix, Tegawendé F. Bissyandé, Jacques Klein:
AndroZoo: A Retrospective with a Glimpse into the Future. 389-393 - Oseremen Joy Idialu, Noble Saji Mathews, Rungroj Maipradit, Joanne M. Atlee, Meiyappan Nagappan:
Whodunit: Classifying Code as Human Authored or GPT-4 generated- A case study on CodeChef problems. 394-406 - Nafiseh Nikeghbal, Amir Hossein Kargaran, Abbas Heydarnoori:
GIRT-Model: Automated Generation of Issue Report Templates. 407-418 - Ahmed Saeed Alsayed, Hoa Khanh Dam, Chau Nguyen:
MicroRec: Leveraging Large Language Models for Microservice Recommendation. 419-430 - Wenxin Jiang, Jerin Yasmin, Jason Jones, Nicholas Synovic, Jiashen Kuo, Nathaniel Bielanski, Yuan Tian, George K. Thiruvathukal, James C. Davis:
PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software. 431-443 - Binger Chen, Jacek Golebiowski, Ziawasch Abedjan:
Data Augmentation for Supervised Code Translation Learning. 444-456 - Amir M. Mir, Mehdi Keshani, Sebastian Proksch:
On the Effectiveness of Machine Learning-based Call Graph Pruning: An Empirical Study. 457-468 - Giuseppe Colavito, Filippo Lanubile, Nicole Novielli, Luigi Quaranta:
Leveraging GPT-like LLMs to Automate Issue Labeling. 469-480 - Kei Koyanagi, Dong Wang, Kotaro Noguchi, Masanari Kondo, Alexander Serebrenik, Yasutaka Kamei, Naoyasu Ubayashi:
Exploring the Effect of Multiple Natural Languages on Code Suggestion Using GitHub Copilot. 481-486 - Md. Rakibul Islam, Md. Fazle Rabbi, Youngeun Jo, Arifa I. Champa, Ethan Young, Camden Wilson, Gavin Scott, Minhaz Fahim Zibran:
A Four-Dimension Gold Standard Dataset for Opinion Mining in Software Engineering. 487-491 - Anisha Islam, Kalvin Eng, Abram Hindle:
Opening the Valve on Pure-Data: Usage Patterns and Programming Practices of a Data-Flow Based Visual Programming Language. 492-497 - Daniel Sokolowski, David Spielmann, Guido Salvaneschi:
The PIPr Dataset of Public Infrastructure as Code Programs. 498-503 - Dario Amoroso d'Aragona, Alexander Bakhtin, Xiaozhou Li, Ruoyu Su, Lauren Adams, Ernesto Aponte, Francis Boyle, Patrick Boyle, Rachel Koerner, Joseph Lee, Fangchao Tian, Yuqing Wang, Jesse Nyyssölä, Ernesto Quevedo, Md Shahidur Rahaman, Amr S. Abdelfattah, Mika Mäntylä, Tomás Cerný, Davide Taibi:
A Dataset of Microservices-based Open-Source Projects. 504-509 - Christian Birchler, Cyrill Rohrbach, Timo Kehrer, Sebastiano Panichella:
SensoDat: Simulation-based Sensor Dataset of Self-driving Cars. 510-514 - Ramtin Ehsani, Mia Mohammad Imran, Robert Zita, Kostadin Damevski, Preetha Chatterjee:
Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads. 515-519 - Davi Tabosa, Oton Pinheiro, Lincoln S. Rocha, Windson Viana:
A Dataset of Atoms of Confusion in the Android Open Source Project. 520-524 - Andrea D'Angelo, Claudio Di Sipio, Cristiano Politowski, Riccardo Rubei:
PlayMyData: a curated dataset of multi-platform video games. 525-529 - Margaret-Anne D. Storey:
Questioning the Questions We Ask About the Impact of AI on Software Engineering : MSR 2024 Keynote. 530 - Harshitha Menon, Daniel Nichols, Abhinav Bhatele, Todd Gamblin:
Learning to Predict and Improve Build Successes in Package Ecosystems. 531-542 - Ajiromola Kola-Olawuyi, Nimmi Rashinika Weeraddana, Meiyappan Nagappan:
The Impact of Code Ownership of DevOps Artefacts on the Outcome of DevOps CI Builds. 543-555 - Zhili Zeng, Tao Xiao, Maxime Lamothe, Hideaki Hata, Shane McIntosh:
A Mutation-Guided Assessment of Acceleration Approaches for Continuous Integration: An Empirical Study of YourBase. 556-568 - Nyyti Saarimäki, Sira Vegas, Valentina Lenarduzzi, Davide Taibi, Mikel Robredo:
Cohort Studies for Mining Software Repositories. 569-570 - Rosalia Tufano, Antonio Mastropaolo, Federica Pepe, Ozren Dabic, Massimiliano Di Penta, Gabriele Bavota:
Unveiling ChatGPT's Usage in Open Source Projects: A Mining-based Study. 571-583 - Emna Ksontini, Aycha Abid, Rania Khalsi, Marouane Kessentini:
DRMiner: A Tool For Identifying And Analyzing Refactorings In Dockerfile. 584-594 - Jiaqi Wu, Lingfeng Bao, Xiaohu Yang, Xin Xia, Xing Hu:
A Large-Scale Empirical Study of Open Source License Usage: Practices and Challenges. 595-606 - Joel Castaño, Silverio Martínez-Fernández, Xavier Franch, Justus Bogner:
Analyzing the Evolution and Maintenance of ML Models on Hugging Face. 607-618 - Florian Sihler, Lukas Pietzschmann, Raphael Straub, Matthias Tichy, Andor Diera, Abdelhalim Hafedh Dahou:
On the Anatomy of Real-World R Code for Static Analysis. 619-630 - Huy Nguyen, Christoph Treude, Patanamon Thongtanunam:
Encoding Version History Context for Better Code Representation. 631-636 - Martin Weyssow, Claudio Di Sipio, Davide Di Ruscio, Houari A. Sahraoui:
CodeLL: A Lifelong Learning Dataset to Support the Co-Evolution of Data and Language Models of Code. 637-641 - Daniel Garijo, Miguel Arroyo, Esteban González, Christoph Treude, Nicola Tarocco:
Bidirectional Paper-Repository Tracing in Software Engineering. 642-646 - Mojtaba Mostafavi Ghahfarokhi, Arash Asgari, Mohammad Abolnejadian, Abbas Heydarnoori:
DistilKaggle: A Distilled Dataset of Kaggle Jupyter Notebooks. 647-651 - Sophia Vargas, Georg J. P. Link, JaYoung Lee:
Estimating Usage Of Open Source Projects. 652-653 - Georges Aaron Randrianaina, Djamel Eddine Khelladi, Olivier Zendra, Mathieu Acher:
Options Matter: Documenting and Fixing Non-Reproducible Builds in Highly-Configurable Systems. 654-664 - João Helis Bernardo, Daniel Alencar da Costa, Sérgio Queiroz de Medeiros, Uirá Kulesza:
How do Machine Learning Projects use Continuous Integration Practices? An Empirical Study on GitHub Actions. 665-676 - Guillaume Cardoen, Tom Mens, Alexandre Decan:
A dataset of GitHub Actions workflow histories. 677-681 - Pooya Rostami Mazrae, Alexandre Decan, Tom Mens:
gawd: A Differencing Tool for GitHub Actions Workflows. 682-686 - Natarajan Chidambaram, Tom Mens, Alexandre Decan:
RABBIT: A tool for identifying bot accounts based on their recent GitHub event history. 687-691 - Hassan Onsori Delicheh, Alexandre Decan, Tom Mens:
Quantifying Security Issues in Reusable JavaScript Actions in GitHub Workflows. 692-703 - Nicolás E. Díaz Ferreyra, Mojtaba Shahin, Mansooreh Zahedi, Sodiq Quadri, Riccardo Scandariato:
What Can Self-Admitted Technical Debt Tell Us About Security? A Mixed-Methods Study. 704-715 - Triet Huynh Minh Le, Xiaoning Du, Muhammad Ali Babar:
Are Latent Vulnerabilities Hidden Gems for Software Vulnerability Prediction? An Empirical Study. 716-727 - Nusrat Zahan, Philipp Burckhardt, Mikola Lysenko, Feross Aboukhadijeh, Laurie A. Williams:
MalwareBench: Malware samples are not enough. 728-732 - Simone Scalco, Ranindya Paramitha:
Hash4Patch: A Lightweight Low False Positive Tool for Finding Vulnerability Patch Commits. 733-737 - Chao Ni, Liyu Shen, Xiaohu Yang, Yan Zhu, Shaohua Wang:
MegaVul: A C/C++ Vulnerability Dataset with Comprehensive Code Representations. 738-742
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.