How Prompt Complexity Affects GPT-3.5 Mutation Generation Accuracy | HackerNoon
The comparisons reveal that GPT-3.5 excels in bug detection with the highest rates on Defects4J and ConDefects, showcasing its powerful mutation generation capabilities.
Comparing Costs, Usability and Results Diversity of Mutation Testing Techniques | HackerNoon
The analysis reveals that while GPT-3.5 and CodeLlama-30bInstruct generate a higher number of mutations, traditional methods are significantly faster with lower costs.