Logging System
Our logs contains system prompts, user prompts, model outputs, and debugging information.
- System Prompts: Each log starts with the System Prompts that introduces the CTF and specifics of the task
- User Prompts: Describes the challenge and instructions for the model to install necessary packages or connect to a container server.
- Model Outputs: Model's output, specifically crafted to respond to the user prompt
- Debugging Information: Debug messages and outputs from external tools
And we can analysis the logs using scripts:
python scripts/log_summary.py -m {MODEL_NAME} -l {PATH_TO_LOGS} -t {CHALLENGE_CATEGORY}
{MODEL_NAME}
: The model we used{PATH_TO_LOGS}
: The path to logs, which is introduced here here{CHALLENGE_CATEGORY}
: Optional. If not added, it will analysis all the logs in the log path, if added, we can choose from six categories: crypto, forensics, pwn, reverse, web, misc.
The analysis will show the challenge names, the solved rate of this challenge and any mistakes and reasons for not solving this challenge. This will assist you to evaluate and measure the performance.