#

streetfighterai

https://static.github-zh.com/github_avatars/OpenGenerativeAI?size=40

#大语言模型#Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Jupyter Notebook 1.45 k
6 个月前
https://static.github-zh.com/github_avatars/mennahasan31?size=40

#自然语言处理#llm_benchmark is a comprehensive benchmarking tool for evaluating the performance of various Large Language Models (LLMs) on a range of natural language processing tasks. It provides a standardized fr...

0
7 个月前
Website
Wikipedia