jailbreaking · GitHub Topics

#大语言模型#[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

ChatGPT jailbreak 大语言模型 prompt llm-security jailbreaking

Jupyter Notebook 3.23 k

7 个月前

cyberark / FuzzyAI

#大语言模型#A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

jailbreak jailbreaking 大语言模型人工智能安全 Fuzzing/Fuzz testing llm-evaluation llm-security ai-red-team

Jupyter Notebook 664

18 天前

epeth0mus / Fugu15

#IOS#Open Source iOS 15 - iOS 15.6 Jailbreak Project

iOS ios15 jailbreak jailbreaking

C 247

3 年前

rubaljain / frida-jb-bypass

Frida script to bypass the iOS application Jailbreak Detection

frida jailbreak jailbreaking

JavaScript 78

6 年前

tml-epfl / llm-past-tense

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

generalization jailbreaking 大语言模型 robustness

Python 70

6 个月前

doronz88 / pylera1n

#IOS#Python adaptation for pelara1n

iOS iphone jailbreak jailbreaking Python 命令行界面

Python 38

3 年前

dobriban / Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety a...

人工智能 alignment circuits 教学 fine-tuning hallucination inference interpretability jailbreaking 大语言模型 rlhf robustness safety transformers

2 个月前

LylaCoding / FriendGPT

#大语言模型#An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT

人工智能 ChatGPT Hacking jailbreaking

2 年前

Decimation / Cydia-GitHub-Template

Cydia repo

cydia apt jailbreaking Apple deb repo

HTML 23

6 年前

mehrankmlf / SecurityKit

Security Kit is a lightweight framework that helps to achieve a security layer

jailbreak obfuscation owasp 逆向工程安全 Swift Virtual Private Network cydia encryption-decryption jailbreaking

Swift 22

2 年前

Dylbin / dylbin.github.io

#IOS#APT distribution repository for jailbroken iOS devices (rootless / rootful).

iOS jailbreak cydia jailbreaking rootless

JavaScript 17

14 天前

Aeneon / TDK

During the Development of Suave7 and it's Predecessors, we've created a lot of Icons and UI-Images and we would like to share them with you. The Theme Developer Kit contains nearly 5.600 Icons, more t...

photoshop icons Image ios-ui jailbreak jailbreaking theme theme-development

22 天前

guillermo-moran / Eclipse-Dark-Mode

#IOS#Customizable Dark Mode Extension for iOS 13+

Dark Mode iOS jailbreak jailbreaking

Logos 14

5 年前

amazon-science / TurboFuzzLLM

TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice

ai-safety guardrails jailbreaking large-language-models red-teaming responsible-ai

Python 12

10 天前

hekatos / tweaks

Source code for bypass tweaks hosted under https://github.com/hekatos/repo. Licensed under 0BSD except submodules

jailbreaking bypass

Logos 11

3 年前

FuturraGroup / SecurityKit

SecurityKit is a lightweight, easy-to-use Swift library that helps protect iOS apps according to the OWASP MASVS standard, chapter v8, providing an advanced security and anti-tampering layer.

cydia encryption-decryption jailbreak jailbreaking obfuscation owasp 逆向工程安全 Swift Virtual Private Network

Swift 11

5 个月前

AetherPrior / TrickLLM

#自然语言处理#This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, and ...

jailbreaking 大语言模型自然语言处理

Jupyter Notebook 8

1 年前

Tobias-B-Besemer / Howto_-_iPhones

LV-Crew.org_(LVC)_-_Howto_-_iPhones

jailbreak jailbreaking iphone howto how-to howtos howto-tutorial

8 年前

liuyaojialiuyaojia / Awesome-LLM-Security-Paper

#大语言模型#Your best llm security paper library

data-extraction jailbreaking llm-security agent 大语言模型 prompt-injection

10 个月前

AmeliazOli / ChatGPT-Evil-Confidant-Mode

#大语言模型#"ChatGPT Evil Confidant Mode" delves into a controversial and unethical use of AI, highlighting how specific prompts can generate harmful and malicious responses from ChatGPT.

aitools 聊天机器人 ChatGPT ChatGPT API chatgpt3 jailbreak jailbreaking openai prompt prompts

1 年前