ARTKIT Gandalf Challenge Download - ARTKIT Gandalf Challenge Source code download

English

中文(简体) 中文(繁体) 한국어 日本語 English Português Español Русский العربية Indonesia Deutsch Français ภาษาไทย

Home>Programming related>AI Source Code

ARTKIT Gandalf Challenge

AI Source Code

1.0.0

Download

Exposing Jailbreak Vulnerabilities in LLM Applications with ARTKIT

Automated prompt-based testing to extract passwords from the Gandalf Challenge's LLM system

Link to article: https://towardsdatascience.com/exposing-jailbreak-vulnerabilities-in-llm-applications-with-artkit-d2df5f56ece8

Background

As large language models (LLMs) become more widely adopted across different industries and domains, significant security risks have emerged and intensified. Several of these key concerns include breaches of data privacy, the potential for biases, and the risk of information manipulation.
Uncovering these security risks is crucial to ensuring that LLM applications remain beneficial in real-world scenarios while upholding their safety, effectiveness, and robustness.
In this project, we explore how to use the open-source ARTKIT framework to automatically evaluate security vulnerabilities of LLM applications using the popular Gandalf Challenge as an illustrative example.

Alt text

Files

gandalf_challenge.ipynb: Jupyter notebook containing the codes for the walkthrough

References

Official ARTKIT GitHub Repo
Play the Gandalf Challenge

Acknowledgements

Special thanks to Sean Anggani, Andy Moon, Matthew Wong, Randi Griffin, and Andrea Gao!

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2024-12-16
size 582.66KB
From Github

Related Applications

PBA Bowling Challenge latest version

2023-10-12
Sniper Challenge game

2023-08-27
WANNABE CHALLENGE

2023-04-08
Wheelie Challenge Chinese version

2023-04-07
BSL Winter Games Challenge

2022-08-20
Running Challenge

2022-07-29

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
node telegram bot api

AI Source Code

v0.50.0
typebot.io

AI Source Code

v3.1.2
python wechaty getting started

AI Source Code

1.0.0
waymo open dataset

Other source code

December 2023 Update
termwind

Other categories

v2.3.0
wp functions

Other categories

1.0.0

Related Information All