rag-evaluation · GitHub Topics

#大语言模型#🐢 Open-Source Evaluation & Testing for AI & LLM systems

mlops ml-validation ml-testing llmops responsible-ai fairness-ai llm-eval llm-evaluation rag-evaluation ai-security llm-security ai-red-team red-team-tools 大语言模型

Python 4.74 k

23 天前

Marker-Inc-Korea / AutoRAG

#大语言模型#AutoRAG: An Open-Source Framework for Retrieval-Augmented Generation (RAG) Evaluation & Optimization with AutoML-Style Automation

analysis automl benchmarking document-parser embeddings evaluation 大语言模型 llm-evaluation llm-ops Open Source ops optimization pipeline Python qa rag rag-evaluation retrieval-augmented-generation

Python 4.15 k

1 个月前

Agenta-AI / agenta

The open-source LLMOps platform: prompt playground, prompt management, LLM evaluation, and LLM observability all in one place.

llm-tools prompt-engineering prompt-management llm-evaluation llm-framework rag-evaluation llm-observability llm-as-a-judge llm-monitoring llm-platform llm-playground llmops-platform

Python 3.03 k

1 天前

frutik / Awesome-RAG

rag rag-evaluation

345

1 年前

vectara / open-rag-eval

Open source RAG evaluation package

evaluation-metrics rag retrieval-augmented-generation 监控 rag-evaluation

Python 269

23 天前

LLAMATOR-Core / llamator

#自然语言处理#Framework for testing vulnerabilities of large language models (LLM).

attack 大语言模型自然语言处理 Python 安全 ai-security red-team hallucinations llm-security rag-evaluation 人工智能 rag jailbreak owasp red-team-tools agent vulnerability

Python 134

1 天前

mts-ai / rurage

information-retrieval llm-evaluation question-answering rag rag-evaluation

Python 31

4 个月前

oztrkoguz / RAG-Framework-Evaluation

This project aims to compare different Retrieval-Augmented Generation (RAG) frameworks in terms of speed and performance.

autogen crewai langchain llamaindex rag rag-evaluation swarms

Python 14

1 年前

ioannis-papadimitriou / rag-playground

A framework for systematic evaluation of retrieval strategies and prompt engineering in RAG systems, featuring an interactive chat interface for document analysis.

聊天机器人 llm-inference rag-evaluation retrieval-augmented-generation

Python 9

7 个月前

rostyslavshovak / RAG-Retrieval-Augmented-Generation

RAG Chatbot for Financial Analysis

gradio-interface Open Source qdrant-vector-database rag retrieval-augmented-generation langchain pdf rag-evaluation

Python 8

5 个月前

simranjeet97 / Learn_RAG_from_Scratch_LLM

Learn Retrieval-Augmented Generation (RAG) from Scratch using LLMs from Hugging Face and Langchain or Python

人工智能 generative-ai llm-apps llm-evaluation llm-framework llm-training rag rag-evaluation retrieval-augmented-generation

Jupyter Notebook 5

6 个月前

shaadclt / EvalRAG

A comprehensive evaluation toolkit for assessing Retrieval-Augmented Generation (RAG) outputs using linguistic, semantic, and fairness metrics

rag rag-evaluation

Python 4

3 个月前

fkapsahili / EntRAG

#大语言模型#EntRAG - Enterprise RAG Benchmark

benchmark dataset evaluation generative-ai 大语言模型 rag retrieval-augmented-generation retrieval knowledge-graph llm-evaluation rag-evaluation

Python 3

2 个月前

Kaos599 / BetterRAG

BetterRAG: Powerful RAG evaluation toolkit for LLMs. Measure, analyze, and optimize how your AI processes text chunks with precision metrics. Perfect for RAG systems, document processing, and embeddin...

embeddings evaluation evaluation-framework optimization rag rag-evaluation

Python 1

4 个月前

sprakash21 / aws-genai-rageval-bot

RAG Pipeline Evaluation and monitoring on AWS using RAGAS

genai-chatbot 监控 rag-evaluation

Python 1

9 个月前

AnasAber / MLflow_with_RAG

Using MLflow to deploy your RAG pipeline, using LLamaIndex, Langchain and Ollama/HuggingfaceLLMs/Groq

evaluation-metrics mlflow mlops rag rag-evaluation llamaindex CI/CD 部署

Python 1

6 个月前

keitabroadwater / llm-eval-lab

A web sandbox for hands-on learning of LLM and RAG Evaluation

evaluation-framework FastAPI gpt4 llm-evaluation llmops Next rag-evaluation

TypeScript 0

5 个月前

OranDanon / RAG-application

RAG Chatbot over pre-defined set of articles about LangChain

generative-ai rag rag-evaluation

Python 0

4 个月前

OranDanon / Gen-AI-Assignment

Home assignment featuring two AI projects: a Medical Q&A Bot for Israeli HMOs and a National Insurance Form Extractor. Built with Azure OpenAI to demonstrate practical GenAI implementation skills.

azure-ai generative-ai rag-evaluation

Python 0

4 个月前

jhaayush2004 / RAG-Evaluation

Different approaches to evaluate RAG !!!

langchain rag rag-evaluation wandb

Jupyter Notebook 0

1 年前