AI-assisted software evolution
My current projects focus on generative AI for code translation, library replacement, architecture modernization, and the responsible integration of AI into software engineering workflows.
Software Engineering Researcher
I study how software is built, maintained, reused, and evolved across organizations and open source ecosystems.
I am a Member of Technical Staff Researcher at the Software Engineering Institute at Carnegie Mellon University, working in the AI Workflow and Architecture Modernization group. My research connects empirical software engineering, software architecture, open source and InnerSource development, mining software repositories, and applied AI for software evolution.
Contact me at tapajitdey [at] cmu [dot] edu
Bio
Tapajit Dey is a Member of Technical Staff Researcher at the Software Engineering Institute at Carnegie Mellon University, working in the AI Workflow and Architecture Modernization group. His current work includes applied research on software architecture, large-scale refactoring, software project selection, open source project categorization, and generative AI for software translation and library replacement.
He completed his Ph.D. in Computer Science at the University of Tennessee, Knoxville with Dr. Audris Mockus, where his research focused on applying data mining and empirical software engineering techniques to analyze software ecosystems and software supply chains. Before joining SEI, he was a Postdoctoral Researcher and later Research Fellow at Lero, the Science Foundation Ireland Research Centre for Software at the University of Limerick, working with Prof. Brian Fitzgerald on InnerSource and open source software development.
He was one of the founding members of the Lero Open Source Program Office. He received his Bachelor's and Master's degrees under the dual-degree program from the Indian Institute of Technology, Kharagpur in Electronics and Electrical Communication Engineering and worked at IBM India Software Lab for three years before starting his Ph.D.
Upcoming
Artifacts Evaluation Co-Chair for the ICSE 2027 Artifact Evaluation Track.
Accepted paper at ICSE 2026 SEET: Prompting Without Principles: Are Students Transferring Software Engineering Knowledge to LLM Use?
Program committee member for the MSR 2026 Technical Papers Track.
Program committee member for the ESEC/FSE 2026 Tool Demonstrations Track.
Research
My current projects focus on generative AI for code translation, library replacement, architecture modernization, and the responsible integration of AI into software engineering workflows.
Applied research on architecture design, analysis, refactoring, and validation workflows for large and complex software systems, now in the context of AI workflow and architecture modernization.
Large-scale empirical studies of open source supply chains, project selection, project health, contributor behavior, and ecosystem structure.
Research on InnerSource adoption, incentives, newcomer pathways, and organizational practices for collaborative software development.
Projects
SEI-CMU project team member. Fiscal Year 2025 project, funded at $1.5M, investigating how generative AI can support software translation across languages at scale. Contributed to the funding proposal with James Ivers.
SEI-CMU project team member. Fiscal Year 2024 project, funded at $500K, exploring generative AI for automating large-scale library replacement. Contributed to the funding proposal with Ipek Ozkaya, James Ivers, and Robert Edman.
At Lero, led and contributed to work on InnerSource adoption and open source newcomers within the €6M Trustworthy Responsible Efficient Engineering of Software research program, in collaboration with industry partners.
Activity coordinator for TROPIC, a €190K National Open Research Forum-funded program to develop and roll out open research training across Ireland and across disciplines.
Teaching & Mentoring
Selected Publications
Tapajit Dey, Brian Fitzgerald, and Sherae Daniel. HICSS 2025.
Tapajit Dey, Jonathan Loungani, and James Ivers. PROMISE 2024.
Robert Healy, Kieran Conboy, Tapajit Dey, Edwin Lewzey, and Brian Fitzgerald. XP 2024.
Robert Healy, Tapajit Dey, Kieran Conboy, and Brian Fitzgerald. XP 2023.
Tapajit Dey, Willem Jiang, and Brian Fitzgerald. IEEE Software, 2022.
Tapajit Dey, Andrey Karnauch, and Audris Mockus. ICSE 2021.
For the full and current publication record, see Google Scholar, DBLP, and my CV.
Publication Records
| Work | Venue | Records |
|---|---|---|
| Why Program Increment Estimates Miss: Hidden Variables for Planning | Software Excellence Alliance Monthly Tech Talk, February 2026 | Video |
| CROSS: A Contributor-Project Interaction Lifecycle Model for Open Source Software | HICSS 2025 | Preprint |
| Smarter Project Selection for Software Engineering Research | PROMISE 2024 | DOI |
| Comparing Stability and Sustainability in Agile Systems | XP 2024 | Publication |
| Representation of Developer Expertise in Open Source Software | ICSE 2021 | DOI Preprint Slides Video Bib |
| Effect of Technical and Social Factors on Pull Request Quality for the NPM Ecosystem | ESEM 2020 | DOI Preprint Slides Video Bib |
| Detecting and Characterizing Bots That Commit Code | MSR 2020 | DOI Preprint Slides Video Bib |
| An Exploratory Study of Bot Commits | ICSE Workshops 2020 | DOI Preprint Slides Video Bib |
| Patterns of Effort Contribution and Demand in the NPM Ecosystem | PROMISE 2019 | DOI Preprint Slides Bib |
| Modeling Relationship between Post-Release Faults and Usage in Mobile Software | PROMISE 2018 | DOI Preprint Slides Bib |
| Are Software Dependency Supply Chain Metrics Useful in Predicting Change of Popularity of NPM Packages? | PROMISE 2018 | DOI Preprint Slides Bib |
The earlier generated publication pages are preserved as source evidence in the full publication record and the dataset and preprint record.
Recognitions
Awarded to the Lero Open Science Committee by the Young European Research Universities Network. Award announcement
Awarded for Patterns of Effort Contribution and Demand and User Classification based on Participation Patterns in NPM Ecosystem. Conference record
University of Tennessee, Knoxville, College of Engineering.
Service