LLM4SDR

LLM4SDR is a novel approach that leverages Large Language Models (LLMs) to fully automate the construction of open-source software defect repositories. It systematically addresses key challenges in repository construction through three main phases:

Data Preparation LLM4SDR uses LLMs to generate high-quality commit descriptions by synthesizing information from commit messages, issue reports, pull requests, and related comments. This ensures that commit messages are accurate and informative, even when the original messages are incomplete or ambiguous.

Defect Patch Identification To detect defect-related (bug-fixing) patches, LLM4SDR employs a Random Forest (RF) model that uses diverse features, including code diff metrics and analyses generated by LLMs and the static analysis tool Semgrep . Combining these sources improves precision and recall in patch detection.

Critical Variable Identification LLM4SDR identifies variables related to software defects by combining a patch-based technique with LLM-driven refinement. The LLM filters and augments candidate variables to produce a final set of critical variables that directly contribute to defect introduction and repair.

Important document description

run_llm_message.py: Leverages LLMs to integrate information from multiple sources and generate detailed commit descriptions.

llm_analyzer.py: Analyzing commits using a large model.

train.py: Train a classifier model.

keyvar_extractor_llm.py: Using large models to assist in extracting key variables.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
GitHubAPI.py		GitHubAPI.py
LICENSE		LICENSE
README.md		README.md
keyvar_extractor_llm.py		keyvar_extractor_llm.py
llm_analyzer.py		llm_analyzer.py
llm_request.py		llm_request.py
method_extractor.py		method_extractor.py
patch_prompts.py		patch_prompts.py
prompt_cv.py		prompt_cv.py
run_llm_message.py		run_llm_message.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM4SDR

Important document description

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM4SDR

Important document description

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages