<aside> 💡 Source Code for the project here: https://github.com/AstuteSource/SEERS

</aside>

GitHub - AstuteSource/SEERS: This repository serves as the public workspace for the software engineering cohort of the CIS department's 2024 Junior Seminar course.

Challenge: Mutatino testing, a valuable software testing technique, can be computationally expensive. Companies often spend significant time and resources running mutation tests on their code

Task: In collaboration with another student and Dr.Gregory M. Kapfhammer, I aimed to develop a tool that could predict mutation scores earlier in the testing process. This would allow developers to focus their efforts on mutations more likely to reveal defects and improve overall testing efficiency

Action: We focus on Python projects and investigated the use of machine learning models to predict mutation scores based on the analysis of code anti-patterns.

Screenshot 2024-05-23 at 10.50.24 PM.png

Screenshot 2024-05-23 at 10.50.37 PM.png

DETAILED REPORT

  1. Data Collection and Preprocessing

The dataset for study comprised 323 functions extracted from 12 open-source Python projects. Each function was analyzed to extract relevant information, including its code, mutation source, and a list of identified code patterns.

Data Preprocessing:

Data Preprocessing

  1. Pattern Extraction and Refinement

The initial set of code patterns exhibited several issues: