GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

jailbreaking

Website
Wikipedia
https://static.github-zh.com/github_avatars/verazuo?size=40
verazuo / jailbreak_llms

#大语言模型#[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

ChatGPTjailbreak大语言模型promptllm-securityjailbreaking
Jupyter Notebook 3.23 k
7 个月前
https://static.github-zh.com/github_avatars/cyberark?size=40
cyberark / FuzzyAI

#大语言模型#A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

jailbreakjailbreaking大语言模型人工智能安全Fuzzing/Fuzz testingllm-evaluationllm-securityai-red-team
Jupyter Notebook 664
18 天前
https://static.github-zh.com/github_avatars/epeth0mus?size=40
epeth0mus / Fugu15

#IOS#Open Source iOS 15 - iOS 15.6 Jailbreak Project

iOSios15jailbreakjailbreaking
C 247
3 年前
https://static.github-zh.com/github_avatars/rubaljain?size=40
rubaljain / frida-jb-bypass

Frida script to bypass the iOS application Jailbreak Detection

fridajailbreakjailbreaking
JavaScript 78
6 年前
https://static.github-zh.com/github_avatars/tml-epfl?size=40
tml-epfl / llm-past-tense

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

generalizationjailbreaking大语言模型robustness
Python 70
6 个月前
https://static.github-zh.com/github_avatars/doronz88?size=40
doronz88 / pylera1n

#IOS#Python adaptation for pelara1n

iOSiphonejailbreakjailbreakingPython命令行界面
Python 38
3 年前
https://static.github-zh.com/github_avatars/dobriban?size=40
dobriban / Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety a...

人工智能alignmentcircuits教学fine-tuninghallucinationinferenceinterpretabilityjailbreaking大语言模型rlhfrobustnesssafetytransformers
38
2 个月前
https://static.github-zh.com/github_avatars/LylaCoding?size=40
LylaCoding / FriendGPT

#大语言模型#An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT

人工智能ChatGPTHackingjailbreaking
31
2 年前
https://static.github-zh.com/github_avatars/Decimation?size=40
Decimation / Cydia-GitHub-Template

Cydia repo

cydiaaptjailbreakingAppledebrepo
HTML 23
6 年前
https://static.github-zh.com/github_avatars/mehrankmlf?size=40
mehrankmlf / SecurityKit

Security Kit is a lightweight framework that helps to achieve a security layer

jailbreakobfuscationowasp逆向工程安全SwiftVirtual Private Networkcydiaencryption-decryptionjailbreaking
Swift 22
2 年前
https://static.github-zh.com/github_avatars/Dylbin?size=40
Dylbin / dylbin.github.io

#IOS#APT distribution repository for jailbroken iOS devices (rootless / rootful).

iOSjailbreakcydiajailbreakingrootless
JavaScript 17
14 天前
https://static.github-zh.com/github_avatars/Aeneon?size=40
Aeneon / TDK

During the Development of Suave7 and it's Predecessors, we've created a lot of Icons and UI-Images and we would like to share them with you. The Theme Developer Kit contains nearly 5.600 Icons, more t...

photoshopiconsImageios-uijailbreakjailbreakingthemetheme-development
15
22 天前
https://static.github-zh.com/github_avatars/guillermo-moran?size=40
guillermo-moran / Eclipse-Dark-Mode

#IOS#Customizable Dark Mode Extension for iOS 13+

Dark ModeiOSjailbreakjailbreaking
Logos 14
5 年前
https://static.github-zh.com/github_avatars/amazon-science?size=40
amazon-science / TurboFuzzLLM

TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice

ai-safetyguardrailsjailbreakinglarge-language-modelsred-teamingresponsible-ai
Python 12
10 天前
https://static.github-zh.com/github_avatars/hekatos?size=40
hekatos / tweaks

Source code for bypass tweaks hosted under https://github.com/hekatos/repo. Licensed under 0BSD except submodules

jailbreakingbypass
Logos 11
3 年前
https://static.github-zh.com/github_avatars/FuturraGroup?size=40
FuturraGroup / SecurityKit

SecurityKit is a lightweight, easy-to-use Swift library that helps protect iOS apps according to the OWASP MASVS standard, chapter v8, providing an advanced security and anti-tampering layer.

cydiaencryption-decryptionjailbreakjailbreakingobfuscationowasp逆向工程安全SwiftVirtual Private Network
Swift 11
5 个月前
https://static.github-zh.com/github_avatars/AetherPrior?size=40
AetherPrior / TrickLLM

#自然语言处理#This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, and ...

jailbreaking大语言模型自然语言处理
Jupyter Notebook 8
1 年前
https://static.github-zh.com/github_avatars/Tobias-B-Besemer?size=40
Tobias-B-Besemer / Howto_-_iPhones

LV-Crew.org_(LVC)_-_Howto_-_iPhones

jailbreakjailbreakingiphonehowtohow-tohowtoshowto-tutorial
7
8 年前
https://static.github-zh.com/github_avatars/liuyaojialiuyaojia?size=40
liuyaojialiuyaojia / Awesome-LLM-Security-Paper

#大语言模型#Your best llm security paper library

data-extractionjailbreakingllm-securityagent大语言模型prompt-injection
6
10 个月前
https://static.github-zh.com/github_avatars/AmeliazOli?size=40
AmeliazOli / ChatGPT-Evil-Confidant-Mode

#大语言模型#"ChatGPT Evil Confidant Mode" delves into a controversial and unethical use of AI, highlighting how specific prompts can generate harmful and malicious responses from ChatGPT.

aitools聊天机器人ChatGPTChatGPT APIchatgpt3jailbreakjailbreakingopenaipromptprompts
6
1 年前
loading...