#

factuality

https://static.github-zh.com/github_avatars/stanford-oval?size=40

#自然语言处理#WikiChat is an improved RAG. It stops the hallucination of large language models by retrieving data from a corpus.

Python 1.52 k
5 个月前
https://static.github-zh.com/github_avatars/Libr-AI?size=40

Loki: Open-source solution designed to automate the process of verifying factuality

Python 1.11 k
1 年前
https://static.github-zh.com/github_avatars/google-deepmind?size=40

Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".

Python 640
1 个月前
https://static.github-zh.com/github_avatars/voidism?size=40

Official implementation for the paper "DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models"

Python 512
8 个月前
https://static.github-zh.com/github_avatars/amazon-science?size=40

RefChecker provides automatic checking pipeline and benchmark dataset for detecting fine-grained hallucinations generated by Large Language Models.

Python 392
4 个月前
https://static.github-zh.com/github_avatars/shmsw25?size=40

A package to evaluate factuality of long-form generation. Original implementation of our EMNLP 2023 paper "FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation"

Python 380
5 个月前
https://static.github-zh.com/github_avatars/chaitanyamalaviya?size=40

[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers

Python 133
2 年前
https://static.github-zh.com/github_avatars/voidism?size=40

Code for the EMNLP 2024 paper "Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps"

Python 130
1 年前
https://static.github-zh.com/github_avatars/BharathxD?size=40

#大语言模型#This AI fact-checking system, built with LangGraph, dissects text into verifiable claims, cross-referencing them with real-world evidence via web searches. It then generates detailed accuracy reports,...

TypeScript 67
1 个月前
https://static.github-zh.com/github_avatars/salesforce?size=40

#自然语言处理#Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"

Jupyter Notebook 59
8 个月前
https://static.github-zh.com/github_avatars/amazon-science?size=40

Implementation of the paper "FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations (NAACL 2022)"

Python 50
2 年前
https://static.github-zh.com/github_avatars/dmis-lab?size=40

OLAPH: Improving Factuality in Biomedical Long-form Question Answering

Python 38
1 年前
https://static.github-zh.com/github_avatars/pphuc25?size=40

#自然语言处理#Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation

Python 35
2 年前
https://static.github-zh.com/github_avatars/ChanLiang?size=40

#大语言模型#[EMNLP 2023] Beyond Factuality: A Comprehensive Evaluation of Large Language Models as Knowledge Generators

Python 32
2 年前
https://static.github-zh.com/github_avatars/zjunlp?size=40

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality

Python 29
2 个月前
https://static.github-zh.com/github_avatars/JayZhang42?size=40

#大语言模型#SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python 28
9 个月前
https://static.github-zh.com/github_avatars/khuangaf?size=40

Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

Jupyter Notebook 26
1 年前
https://static.github-zh.com/github_avatars/MiuLab?size=40

Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"

Jupyter Notebook 19
1 年前
https://static.github-zh.com/github_avatars/amazon-science?size=40

#大语言模型#Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"

Python 15
9 个月前
https://static.github-zh.com/github_avatars/mbzuai-nlp?size=40

#大语言模型#A lightweight, agent-style framework for fact-checking atomic claims using iterative retrieval and verification. Reduces LLM and search cost while maintaining strong factuality performance.

Python 12
3 个月前
loading...
Website
Wikipedia