GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

prompt-testing

Website
Wikipedia
https://static.github-zh.com/github_avatars/promptfoo?size=40
promptfoo / promptfoo

#大语言模型#Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with comma...

大语言模型prompt-engineeringpromptsllmopsprompt-testingTestingragevaluationevaluation-frameworkllm-evalllm-evaluationllm-evaluation-framework持续集成CI/CDpentestingred-teamingvulnerability-scanners
TypeScript 8.37 k
5 小时前
msoedov/agentic_security
https://static.github-zh.com/github_avatars/msoedov?size=40
msoedov / agentic_security

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

llm-securityai-red-teamllm-evaluationllm-evaluation-frameworkprompt-testingagent-framework
Python 1.67 k
2 天前
https://static.github-zh.com/github_avatars/babelcloud?size=40
babelcloud / LLM-RGB

#大语言模型#LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

benchmark大语言模型promptprompt-engineeringprompt-testing
TypeScript 164
4 个月前
https://static.github-zh.com/github_avatars/aralyekta?size=40
aralyekta / prompttester

Test, compare, and optimize your AI prompts in minutes

llm-evaluationllm-toolsprompt-testing
JavaScript 8
1 个月前
https://static.github-zh.com/github_avatars/prompt-foundry?size=40
prompt-foundry / typescript-sdk

#大语言模型#The prompt engineering, prompt management, and prompt evaluation tool for TypeScript, JavaScript, and NodeJS.

prompt-engineeringprompt-managementprompt-testingTypeScriptllm-evalllm-evaluationopen-aigptgpt-3gpt-4大语言模型llm-opsllmops
TypeScript 6
1 年前
https://static.github-zh.com/github_avatars/calibrtr?size=40
calibrtr / llm-prompt-test

#大语言模型#LLM Prompt Test helps you test Large Language Models (LLMs) prompts to ensure they consistently meet your expectations.

large-language-models大语言模型promptprompt-engineeringprompt-testingpromptsTestingTest automationTest-driven development
TypeScript 5
1 年前
https://static.github-zh.com/github_avatars/yukinagae?size=40
yukinagae / genkitx-promptfoo

#大语言模型#Community Plugin for Genkit to use Promptfoo

人工智能evaluationevaluation-frameworkFirebasegenkit大语言模型llm-evalllm-evaluationllm-evaluation-frameworkllmops插件promptprompt-testingpromptsTesting
TypeScript 4
8 个月前
https://static.github-zh.com/github_avatars/yukinagae?size=40
yukinagae / promptfoo-sample

#大语言模型#Sample project demonstrates how to use Promptfoo, a test framework for evaluating the output of generative AI models

evaluationevaluation-framework大语言模型llm-evalllm-evaluationllm-evaluation-frameworkllmopsprompt-testingpromptsTesting
1
1 年前
https://static.github-zh.com/github_avatars/jairerazodev?size=40
jairerazodev / prompt-testing

prompt-testing

prompt-testing
1
3 年前
https://static.github-zh.com/github_avatars/abdullahkhalid00?size=40
abdullahkhalid00 / prompt-db

#大语言模型#A collection of prompts that I use on a day-to-day basis for work and leisure.

ChatGPTjinja2Markdownprompt-engineeringprompt-testingpromptstext
1
1 年前
https://static.github-zh.com/github_avatars/ashleysally00?size=40
ashleysally00 / promptfoo-quickstart-guide

#大语言模型#Quickstart guide for using PromptFoo to evaluate LLM prompts via CLI or Colab.

cli-toolcolab大语言模型openaiprompt-engineeringprompt-testing
1
1 个月前
https://static.github-zh.com/github_avatars/radoslaw-sz?size=40
radoslaw-sz / maia

#大语言模型#A pytest-based framework for testing multi AI agents systems. It provides a flexible and extensible platform for complex multi-agent simulations. Supports many integrations like LiteLLM, CrewAI, LangC...

agents人工智能框架大语言模型PythonTestingai-testing-toolprompt-engineeringprompt-testingagentic
Python 1
7 天前
https://static.github-zh.com/github_avatars/yukinagae?size=40
yukinagae / genkit-promptfoo-sample

#大语言模型#Sample implementation demonstrating how to use Firebase Genkit with Promptfoo

evaluationevaluation-frameworkgenkit大语言模型llm-evalllm-evaluationllm-evaluation-frameworkllmopsprompt-testingpromptsTesting
TypeScript 0
1 年前
https://static.github-zh.com/github_avatars/Sigmakib2?size=40
Sigmakib2 / openai-prompt-testing-playground

#大语言模型#A dynamic and interactive playground for testing and refining prompts with OpenAI's language models. Includes customizable inputs for prompts, advanced model settings, and live response streaming for ...

人工智能ChatGPTopenaiplaygroundpromptprompt-engineeringprompt-testing
HTML 0
8 个月前
https://static.github-zh.com/github_avatars/snowz123?size=40
snowz123 / team-agents

#大语言模型#🐙 Team Agents unifica 82 especialistas en IA para resolver desafíos con chat inteligente, analista de requisitos y subida de documentos. Plataforma futurista y modular.

agentagent-simulationsagentsAzureChatGPTgenerative-ailangchain大语言模型llm-evaluationllm-evaluation-frameworkllm-securitymulti-agentsprompt-testing
Python 0
2 天前
https://static.github-zh.com/github_avatars/bluewave-labs?size=40
bluewave-labs / evalwise

#大语言模型#EvalWise is a developer-friendly platform for LLM evaluation and red teaming that helps test AI models for safety, compliance, and performance issues

evals大语言模型llm-evaluationllmopsprompt-engineeringprompt-testingragrag-evaluation
Python 0
10 天前