Date of Graduation

5-2024

Document Type

Thesis

Degree Name

Bachelor of Science in Computer Science

Degree Level

Undergraduate

Department

Computer Science and Computer Engineering

Advisor/Mentor

Li, Qinghua

Committee Member

Gauch, Susan

Second Committee Member

Panda, Brajendra

Abstract

Artificial intelligence has progressed rapidly in recent years, greatly revolutionizing the world and helping to automate increasingly complex tasks. However, there are still some disciplines where the problem of automation has not yet been thoroughly tackled, such as in software vulnerability management. Vulnerability management is a critical component of cybersecurity for an organization. Until recently, learning about a vulnerability required a security operator to manually search and read through online information and security advisories to find needed information. Doing so is a significantly time-consuming task, as these advisories tend to be quite lengthy and packed with information about various technologies. To simplify this process, we first present a new approach that can extract crucial information about a vulnerability from provided documents using large language models (LLMs) such as ChatGPT, generating a concise summary of the vulnerability from the documents. This approach can save much of the operator’s time that would need to be spent reading the lengthy documents. While this approach works well for those that already have documents about vulnerabilities to summarize, operators still need to find relevant documents for the vulnerability of interest first, e.g., through web search. To further automate this process, we then introduce a modified implementation of the basic summarization approach as a web browser extension, which can automatically generate a summary about a vulnerability of interest from web search results. That way, an operator only needs to manually type a search about a vulnerability, e.g., “CVE-2023-31147”, in a search engine, and then our solution can automatically generate a concise summary of valuable information about the vulnerability. This enhancement integrates a robust and automated source gathering method and introduces several new filtering parameters that allow users to specify requirements pertaining to the documents and summaries generated by the approach. The results of these approaches are then evaluated on their ability to include various important pieces of information in the generated summaries. We find that the basic approach of summarizing over provided documents performs exceptionally well. The web browser extension has a slight decrease in accuracy as the cost of automation but is still very accurate.

Keywords

Large Language Models; Prompt Engineering; Automatic Summarization; Vulnerability Management

Citation

Elder, W. (2024). Automatic Extraction of Vulnerability Information for Security Operators Using GPT Models. Computer Science and Computer Engineering Undergraduate Honors Theses Retrieved from https://scholarworks.uark.edu/csceuht/137

Download

Included in

Artificial Intelligence and Robotics Commons, Software Engineering Commons

COinS

Computer Science and Computer Engineering Undergraduate Honors Theses

Automatic Extraction of Vulnerability Information for Security Operators Using GPT Models

Date of Graduation

Document Type

Degree Name

Degree Level

Department

Advisor/Mentor

Committee Member

Second Committee Member

Abstract

Keywords

Citation

Included in

Search

Links

Browse

Contact Us

Computer Science and Computer Engineering Undergraduate Honors Theses

Automatic Extraction of Vulnerability Information for Security Operators Using GPT Models

Author

Date of Graduation

Document Type

Degree Name

Degree Level

Department

Advisor/Mentor

Committee Member

Second Committee Member

Abstract

Keywords

Citation

Included in

Share

Search

Links

Browse

Contact Us