mmlu · GitHub Topics

#自然语言处理#A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5.69 k

1 年前

#自然语言处理#A series of large language models developed by Baichuan Intelligent Technology

Python 4.12 k

10 个月前

#自然语言处理#A 13B large language model developed by Baichuan Intelligent Technology

Python 2.97 k

2 年前

#大语言模型#A Contamination-free Multi-task Language Understanding Benchmark [Official, ACL 2025]

119

4 个月前

[NeurIPS 2023 Spotlight] In-Context Impersonation Reveals Large Language Models' Strengths and Biases

Python 22

9 个月前

#数据仓库#AGI-Elo: How Far Are We From Mastering A Task?

Python 6

4 个月前

#大语言模型#CLI tool to evaluate LLM factuality on MMLU benchmark.

Python 2

4 天前

LLMs' performance analysis on CPU, GPU, Execution Time and Energy Usage

Java 0

1 年前