GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

jailbreaking

Website
Wikipedia
https://static.github-zh.com/github_avatars/verazuo?size=40
verazuo / jailbreak_llms

#大语言模型#[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).

ChatGPTjailbreak大语言模型promptllm-securityjailbreaking
Jupyter Notebook 3.17 k
6 个月前
https://static.github-zh.com/github_avatars/cyberark?size=40
cyberark / FuzzyAI

#大语言模型#A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

jailbreakjailbreaking大语言模型人工智能安全Fuzzing/Fuzz testingllm-evaluationllm-securityai-red-team
Jupyter Notebook 603
12 天前
https://static.github-zh.com/github_avatars/epeth0mus?size=40
epeth0mus / Fugu15

#IOS#Open Source iOS 15 - iOS 15.6 Jailbreak Project

iOSios15jailbreakjailbreaking
C 247
3 年前
https://static.github-zh.com/github_avatars/rubaljain?size=40
rubaljain / frida-jb-bypass

Frida script to bypass the iOS application Jailbreak Detection

fridajailbreakjailbreaking
JavaScript 78
6 年前
https://static.github-zh.com/github_avatars/tml-epfl?size=40
tml-epfl / llm-past-tense

Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]

generalizationjailbreaking大语言模型robustness
Python 69
5 个月前
https://static.github-zh.com/github_avatars/doronz88?size=40
doronz88 / pylera1n

#IOS#Python adaptation for pelara1n

iOSiphonejailbreakjailbreakingPython命令行界面
Python 37
2 年前
https://static.github-zh.com/github_avatars/dobriban?size=40
dobriban / Principles-of-AI-LLMs

Materials for the course Principles of AI: LLMs at UPenn (Stat 9911, Spring 2025). LLM architectures, training paradigms (pre- and post-training, alignment), test-time computation, reasoning, safety a...

人工智能alignmentcircuits教学fine-tuninghallucinationinferenceinterpretabilityjailbreaking大语言模型rlhfrobustnesssafetytransformers
34
1 天前
https://static.github-zh.com/github_avatars/LylaCoding?size=40
LylaCoding / FriendGPT

#大语言模型#An extensive prompt to make a friendly persona from a chatbot-like model like ChatGPT

人工智能ChatGPTHackingjailbreaking
31
2 年前
https://static.github-zh.com/github_avatars/Decimation?size=40
Decimation / Cydia-GitHub-Template

Cydia repo

cydiaaptjailbreakingAppledebrepo
HTML 23
6 年前
https://static.github-zh.com/github_avatars/mehrankmlf?size=40
mehrankmlf / SecurityKit

Security Kit is a lightweight framework that helps to achieve a security layer

jailbreakobfuscationowasp逆向工程安全SwiftVirtual Private Networkcydiaencryption-decryptionjailbreaking
Swift 21
2 年前
https://static.github-zh.com/github_avatars/Dylbin?size=40
Dylbin / dylbin.github.io

#IOS#iOS APT distribution repository for rootful and rootless jailbreaks

iOSjailbreakcydiajailbreakingrootless
JavaScript 16
3 个月前
https://static.github-zh.com/github_avatars/Aeneon?size=40
Aeneon / TDK

During the Development of Suave7 and it's Predecessors, we've created a lot of Icons and UI-Images and we would like to share them with you. The Theme Developer Kit contains nearly 5.600 Icons, more t...

photoshopiconsImageios-uijailbreakjailbreakingthemetheme-development
15
3 个月前
https://static.github-zh.com/github_avatars/guillermo-moran?size=40
guillermo-moran / Eclipse-Dark-Mode

#IOS#Customizable Dark Mode Extension for iOS 13+

Dark ModeiOSjailbreakjailbreaking
Logos 14
5 年前
https://static.github-zh.com/github_avatars/hekatos?size=40
hekatos / tweaks

Source code for bypass tweaks hosted under https://github.com/hekatos/repo. Licensed under 0BSD except submodules

jailbreakingbypass
Logos 11
3 年前
https://static.github-zh.com/github_avatars/AetherPrior?size=40
AetherPrior / TrickLLM

#自然语言处理#This repository contains the code for the paper "Tricking LLMs into Disobedience: Formalizing, Analyzing, and Detecting Jailbreaks" by Abhinav Rao, Sachin Vashishta*, Atharva Naik*, Somak Aditya, and ...

jailbreaking大语言模型自然语言处理
Jupyter Notebook 8
1 年前
https://static.github-zh.com/github_avatars/FuturraGroup?size=40
FuturraGroup / SecurityKit

SecurityKit is a lightweight, easy-to-use Swift library that helps protect iOS apps according to the OWASP MASVS standard, chapter v8, providing an advanced security and anti-tampering layer.

cydiaencryption-decryptionjailbreakjailbreakingobfuscationowasp逆向工程安全SwiftVirtual Private Network
Swift 8
3 个月前
https://static.github-zh.com/github_avatars/Tobias-B-Besemer?size=40
Tobias-B-Besemer / Howto_-_iPhones

LV-Crew.org_(LVC)_-_Howto_-_iPhones

jailbreakjailbreakingiphonehowtohow-tohowtoshowto-tutorial
7
8 年前
https://static.github-zh.com/github_avatars/liuyaojialiuyaojia?size=40
liuyaojialiuyaojia / Awesome-LLM-Security-Paper

#大语言模型#Your best llm security paper library

data-extractionjailbreakingllm-securityagent大语言模型prompt-injection
6
9 个月前
https://static.github-zh.com/github_avatars/AmeliazOli?size=40
AmeliazOli / ChatGPT-Evil-Confidant-Mode

#大语言模型#"ChatGPT Evil Confidant Mode" delves into a controversial and unethical use of AI, highlighting how specific prompts can generate harmful and malicious responses from ChatGPT.

aitools聊天机器人ChatGPTChatGPT APIchatgpt3jailbreakjailbreakingopenaipromptprompts
5
1 年前
https://static.github-zh.com/github_avatars/AmeliazOli?size=40
AmeliazOli / ChatGPT-Developer-Mode

#大语言模型#ChatGPT Developer Mode is a jailbreak prompt introduced to perform additional modifications and customization of the OpenAI ChatGPT model.

aitoolaitoolsAndroidchatbot-applicationChatGPTChatGPT APIchatgpt-appchatgpt-botchatgpt-pluginchatgpt-pluginschatgpt3iOSjailbreakjailbreakingpromptpromptsWeb app
5
1 年前
loading...