Benchmark for evaluating open-ended generation
2020-11-24
否
2024-11-06T03:22:01Z
#大语言模型#Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts,用于评估和提升大模型的安全性。
Technical Report: Is ChatGPT a Good NLG Evaluator? A Preliminary Study
#计算机科学#Free MLOps course from DataTalks.Club
Windows compile of bitsandbytes for use in text-generation-webui.
A framework for few-shot evaluation of language models.
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Diffusion-LM
Code for paper "G-Eval: NLG Evaluation using GPT-4 with Better Human Alignment"
#自然语言处理#Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.
#大语言模型#利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
0 条讨论