Triaging runtime crash via LLM #253

fdt622 · 2024-05-07T13:38:13Z

Triaging runtime crash via LLM mentioned in #221

To reduce the workload of manually triaging runtime crash, we leverage LLM to identify the cause of crash. We pass fuzz target code and crash information to LLM, ask LLM with designed prompts,and save the reponse from LLM. The crash information contains stack trace and sanitizer output. We can also pass related project code to LLM in the future.

Conditions to LLM triaging

Currently, triaging runtime crash with LLM would be activated only when all the following conditions are met:

Build successfully.
Trigger crash.

Input to LLM triaging

To retrieve crash information from fuzz log, we add function extract_crash_info to class SemanticCheckResult in experiment/fuzz_target_error.py file. The retrieved information is stored in variable crash_info in class RunResult. The fuzz target code can be obtained from result/output-ProjectName-FunctionName/fixed_targets folder.

Prompt design

We require that the LLM: 1) definitively ascertain whether the crash is due to errors within the fuzz target or results from a vulnerability in the project under test; 2) deliver a thorough analysis of the findings. Two triage prompts, triage_priming.txt and triage_problem.txt, are appended to prompts/template_xml folder. Three functions, build_triage_prompt, _format_triage_priming, and _format_triage_problem, are added to llm_toolkit/prompt_builder.py file. The generated triage prompt would be saved in result/output-ProjectName-FunctionName/fixed_targets/TargetName-triage folder.

Below is a generated prompt example:

[{"role": "system", "content": "Given the following crash report and fuzz target, investigate the cause of the crash.\n"}, {"role": "user", "content": "Below is the crash report:\n<log>\n=================================================================\r\n\u001b[1m\u001b[31m==13==ERROR: AddressSanitizer: SEGV on unknown address 0x0000000003a7 (pc 0x000000572ba3 bp 0x7ffd25f0ba10 sp 0x7ffd25f0b970 T0)\r\n\u001b[1m\u001b[0m==13==The signal is caused by a READ memory access.\r\n==13==Hint: address points to the zero page.\r\nSCARINESS: 10 (null-deref)\r\n    #0 0x572ba3 in finalizeTable /src/liblouis/liblouis/compileTranslationTable.c:4377:13\r\n    #1 0x570e1c in _lou_getTranslationTable /src/liblouis/liblouis/compileTranslationTable.c:5059:8\r\n    #2 0x56c574 in LLVMFuzzerTestOneInput /src/liblouis/tests/fuzzing/table_fuzzer.cc:11:50\r\n    #3 0x43de33 in fuzzer::Fuzzer::ExecuteCallback(unsigned char const*, unsigned long) /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerLoop.cpp:611:15\r\n    #4 0x43d61a in fuzzer::Fuzzer::RunOne(unsigned char const*, unsigned long, bool, fuzzer::InputInfo*, bool, bool*) /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerLoop.cpp:514:3\r\n    #5 0x43ece9 in fuzzer::Fuzzer::MutateAndTestOne() /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerLoop.cpp:757:19\r\n    #6 0x43f9b5 in fuzzer::Fuzzer::Loop(std::__Fuzzer::vector<fuzzer::SizedFile, std::__Fuzzer::allocator<fuzzer::SizedFile> >&) /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerLoop.cpp:895:5\r\n    #7 0x42ed1f in fuzzer::FuzzerDriver(int*, char***, int (*)(unsigned char const*, unsigned long)) /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerDriver.cpp:912:6\r\n    #8 0x458372 in main /src/llvm-project/compiler-rt/lib/fuzzer/FuzzerMain.cpp:20:10\r\n    #9 0x7f72da84a082 in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x24082) (BuildId: 87b331c034a6458c64ce09c03939e947212e18ce)\r\n    #10 0x41f75d in _start (/out/table_fuzzer+0x41f75d)\r\n\r\nDEDUP_TOKEN: finalizeTable--_lou_getTranslationTable--LLVMFuzzerTestOneInput\r\nAddressSanitizer can not provide additional info.\r\nSUMMARY: AddressSanitizer: SEGV /src/liblouis/liblouis/compileTranslationTable.c:4377:13 in finalizeTable\r\n==13==ABORTING\r\nMS: 3 CMP-ChangeByte-InsertRepeatedBytes- DE: \"\\000\\000\\000\\001\"-; base unit: 9fe132914f92e1ad968c7b50f9c606d3922e86c6\r\n0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x0,0x23,0x20,0x5d,0x6b,0x6f,0x69,0x0,0x0,0x0,0x1,0x0,0x0,0x0,0x6f,0x4c,0x65,\r\n\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000\\000# ]koi\\000\\000\\000\\001\\000\\000\\000oLe\r\nartifact_prefix='./'; Test unit written to ./crash-578f79fcf8db7fb291a77fb9c20abf8c01eff949\r\nBase64: AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAACMgXWtvaQAAAAEAAABvTGU=\r\nstat::number_of_executed_units: 938\r\nstat::average_exec_per_sec:     0\r\nstat::new_units_added:          57\r\nstat::slowest_unit_time_sec:    0\r\nstat::peak_rss_mb:              46\r\n\n</log>\n\nBelow is the fuzz target:\n<code>\n#include <fuzzer/FuzzedDataProvider.h>\n#include <liblouis/internal.h>\n\nextern \"C\" int LLVMFuzzerTestOneInput(const uint8_t *data, size_t size) {\n    FuzzedDataProvider stream(data, size);\n\n    // Consume a random length string\n    std::string table_name = stream.ConsumeRandomLengthString();\n\n    // Call the _lou_getTranslationTable function\n    const TranslationTableHeader *table_header = _lou_getTranslationTable(table_name.c_str());\n\n    // Perform some operations with the table_header if needed\n    // For example, checking if the table_header is not NULL\n    if (table_header != nullptr) {\n        // Do something with the table_header\n    }\n\n    // Return 0 to indicate successful execution\n    return 0;\n}\n</code>\n\nTriage the crash:\nFirstly, you must clearly determine whether the crash is caused by errors within the fuzz target, such as incorrect function usage, improper parameter initializations, missing setup steps, etc., or if it stems from a vulnerability in the project under test.\nSecondly, you are required to provide a comprehensive analysis of your findings.\n<solution>"}]

LLM triaging

One function, triage_crash, is added to experiment/evaluator.py file. The core component of LLM triage, crash_triage.py, is appended to llm_toolkit folder. crash_triage.py mainly contains functions llm_triage and apply_llm_triage.

Response from LLM triaging

One function, parse_triage_response, is appended to llm_toolkit/output_parser.py file. The generated triage response from LLM would be saved in result/output-ProjectName-FunctionName/fixed_targets/TargetName-triage folder.

Below is a reponse example:

### Triage Analysis:

1. **Crash Cause:** The crash occurred in the `finalizeTable` function at line 4377 of `compileTranslationTable.c`. The crash is a segmentation fault (SEGV) caused by a READ memory access at an unknown address.

2. **Fuzz Target Analysis:**
   - The fuzz target reads input data using `FuzzedDataProvider` and extracts a random length string as `table_name`.
   - It then calls the `_lou_getTranslationTable` function with `table_name` as input.
   - The crash happens during the execution of the fuzz target, specifically within the `_lou_getTranslationTable` function.

3. **Root Cause Analysis:**
   - The crash is likely due to a bug in the `_lou_getTranslationTable` function or its interaction with the provided `table_name`.
   - The fuzz target does not seem to have any obvious issues that could directly cause a crash, such as invalid memory accesses or incorrect function usage.

4. **Next Steps:**
   - To investigate further, you need to analyze the implementation of the `_lou_getTranslationTable` function in the `compileTranslationTable.c` file.
   - Look for potential issues such as NULL pointer dereferences, out-of-bounds memory accesses, or incorrect handling of input data.
   - Consider running the fuzz target with different inputs to see if the crash is consistently reproducible.
   - Use debugging tools like AddressSanitizer, Valgrind, or GDB to pinpoint the exact cause of the crash within the `_lou_getTranslationTable` function.

5. **Recommendations:**
   - Review the implementation of `_lou_getTranslationTable` function for any potential vulnerabilities.
   - Validate input data handling within the function to ensure it can handle all possible inputs gracefully.
   - Test the function with various inputs, including edge cases, to uncover any hidden issues.
   - Consider adding additional checks and error handling mechanisms to prevent crashes in case of unexpected input scenarios.

6. **Conclusion:**
   - The crash is most likely caused by an issue within the `_lou_getTranslationTable` function or its interaction with the provided input data.
   - Further investigation into the function's implementation and input handling is necessary to identify and fix the root cause of the crash.

Additions within `results` folder

result/output-ProjectName-FunctionName/fixed_targets/TargetName-triage folder is created to store the generated prompt file and trirage rawoutput from LLM.

Additions within results report

Triage report is appended to results report web. We add a column Triage under Run logs to store triage reports. One function, get_triage, is added to report/web.py file.

Experiment based on the idea from google#221 and more.

DonggeLiu · 2024-05-21T04:24:37Z

BTW could you please also resolve the conflict if you have time?
Sorry that we pushed some changes before reviewing your code.

Related: https://github.com/google/oss-fuzz-gen/actions/runs/9175918219/job/25229988627

Collect code coverage for each of the successfully generated integrations. --------- Signed-off-by: David Korczynski <david@adalogics.com>

Signed-off-by: David Korczynski <david@adalogics.com>

…le#297) Signed-off-by: David Korczynski <david@adalogics.com>

- Adjusts the autogen.sh build to be more general (no need to have Makefile as this can be generated by the autogen.sh in some cases). - Uses a unified place for templating - Minor adjustment in directory name of shared folder "autogen-results-X" is now always just "autogen-results" --------- Signed-off-by: David Korczynski <david@adalogics.com>

fdt622 · 2024-05-29T04:13:18Z

BTW could you please also resolve the conflict if you have time? Sorry that we pushed some changes before reviewing your code.

Hi, I have modified and merged the code. There are two main changes.

One is to add a triage column to benchmark.html:
Mainly, I create class "TriageResult", add variable "triage" in evaluator.py and builder_runner.py.

The other is to add return statement at the final part of "if crash" condition:
https://github.com/fdt622/oss-fuzz-gen/blob/ade92be31ad1d106f6b7dcb4f1dbb1f73a7afec9/experiment/builder_runner.py#L332
This is to make sure ”crash_info“ is returned for further triage as long as the crash is triggered.

Please tell me if further changes are needed. Thank you.

fdt622 and others added 30 commits May 5, 2024 20:00

update gitignore

6464230

support azuregpt

80cf8b1

triage crash with llm, not been tested

32eb90d

Merge branch 'google:main' into main

cee82b0

fix triage condition

5efbf96

change regex for crash_info

5b2a89b

change regex for crash_info, change triage condition

5da55bb

change crash_info location

a55b0d6

debugging

bc8de3b

change regex for crash_info

3a8da5d

change regex for crash_info

7979b53

add todo

1a55de3

Merge branch 'google:main' into main

22d201f

add code format setting

e901d6e

modify triage prompt, supplement pr

a0a1774

add todo

d68fc4e

simplify

f848cd2

Create temp

32d1b0a

Experiment based on the idea from google#221 and more.

adjust triage dir

e976dfd

adjust triage dir

518bdf9

add example in pr

802e199

add triage to web

6c844db

fix get_triage failure

09adcb8

adjust triage conditions

dec34af

supplement web instructions in pr

9bcc46c

update pr

a3a9653

set NUM_EXP to 20

0306846

adjust triage conditions

278ecdc

setback NUM_EXP

683e662

ignore pr

4515e95

DonggeLiu and others added 28 commits May 21, 2024 23:55

experimental: add new issue trophies (google#288)

39bebba

Update vulnerable requests package (google#290)

e116dbc

Related: https://github.com/google/oss-fuzz-gen/actions/runs/9175918219/job/25229988627

experimental: add coverage stats (google#282)

92317fe

Collect code coverage for each of the successfully generated integrations. --------- Signed-off-by: David Korczynski <david@adalogics.com>

experimental: add two new build heuristics (google#292)

a3a2fbd

Signed-off-by: David Korczynski <david@adalogics.com>

delete .DS_Store in .gitignore

eafb74e

create namedtuple 'ParseResult'

bf8b5a2

move crash_info

4d829db

simplify 'if condition' in 'triage_crash'

d86d19e

use single quotes

07d2cf4

keep comment consistency

a4c5155

simplify 'if condition' in 'extract_crash_info'

aa314d1

name crash_triager

f312ba1

use single quote

305aa43

delete triage_all_crashes

d70f71c

change print to logging

de2744a

use single quote

c8a27ae

use single quote

756ac4d

run_all_experiments: fix project lang when generating benchmark (goog…

338d19b

…le#297) Signed-off-by: David Korczynski <david@adalogics.com>

change triage template

0de21f1

modify triage location in web report

31771fc

add triage result at benchmark.html

c2769ea

add warning when fail to match crash information

23543ff

merge

87e6755

try to fix triage's json serializ bug

19745ab

if crash, must give crash_info to triage

34750c5

add empty line after triage part on web

ade92be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Triaging runtime crash via LLM #253

Triaging runtime crash via LLM #253

fdt622 commented May 7, 2024

DonggeLiu commented May 21, 2024

fdt622 commented May 29, 2024

Triaging runtime crash via LLM #253

Are you sure you want to change the base?

Triaging runtime crash via LLM #253

Conversation

fdt622 commented May 7, 2024

Triaging runtime crash via LLM mentioned in #221

Conditions to LLM triaging

Input to LLM triaging

Prompt design

LLM triaging

Response from LLM triaging

Additions within results folder

Additions within results report

DonggeLiu commented May 21, 2024

fdt622 commented May 29, 2024

Additions within `results` folder