prompt-testing · GitHub Topics

#大语言模型#Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma...

大语言模型 prompt-engineering prompts llmops prompt-testing Testing rag evaluation evaluation-framework llm-eval llm-evaluation llm-evaluation-framework 持续集成 CI/CD pentesting red-teaming vulnerability-scanners

TypeScript 7.8 k

15 小时前

msoedov / agentic_security

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

llm-security ai-red-team llm-evaluation llm-evaluation-framework prompt-testing agent-framework

Python 1.57 k

3 天前

babelcloud / LLM-RGB

#大语言模型#LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

benchmark 大语言模型 prompt prompt-engineering prompt-testing

TypeScript 163

2 个月前

prompt-foundry / typescript-sdk

#大语言模型#The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.

prompt-engineering prompt-management prompt-testing TypeScript llm-eval llm-evaluation open-ai gpt gpt-3 gpt-4 大语言模型 llm-ops llmops

TypeScript 6

1 年前

calibrtr / llm-prompt-test

#大语言模型#LLM Prompt Test helps you test Large Language Models (LLMs) prompts to ensure they consistently meet your expectations.

large-language-models 大语言模型 prompt prompt-engineering prompt-testing prompts Testing Test automation Test-driven development

TypeScript 5

1 年前

yukinagae / genkitx-promptfoo

#大语言模型#Community Plugin for Genkit to use Promptfoo

人工智能 evaluation evaluation-framework Firebase genkit 大语言模型 llm-eval llm-evaluation llm-evaluation-framework llmops 插件 prompt prompt-testing prompts Testing

TypeScript 4

7 个月前

yukinagae / promptfoo-sample

#大语言模型#Sample project demonstrates how to use Promptfoo, a test framework for evaluating the output of generative AI models

evaluation evaluation-framework 大语言模型 llm-eval llm-evaluation llm-evaluation-framework llmops prompt-testing prompts Testing

1 年前

jairerazodev / prompt-testing

prompt-testing

3 年前

abdullahkhalid00 / prompt-db

#大语言模型#A collection of prompts that I use on a day-to-day basis for work and leisure.

ChatGPT jinja2 Markdown prompt-engineering prompt-testing prompts text

1 年前

yukinagae / genkit-promptfoo-sample

#大语言模型#Sample implementation demonstrating how to use Firebase Genkit with Promptfoo

evaluation evaluation-framework genkit 大语言模型 llm-eval llm-evaluation llm-evaluation-framework llmops prompt-testing prompts Testing

TypeScript 0

1 年前

Sigmakib2 / openai-prompt-testing-playground

#大语言模型#A dynamic and interactive playground for testing and refining prompts with OpenAI's language models. Includes customizable inputs for prompts, advanced model settings, and live response streaming for ...

人工智能 ChatGPT openai playground prompt prompt-engineering prompt-testing

HTML 0

7 个月前