#

qlora

bitsandbytes-foundation/bitsandbytes
https://static.github-zh.com/github_avatars/bitsandbytes-foundation?size=40
Python 7.58 k
9 小时前
https://static.github-zh.com/github_avatars/yangjianxin1?size=40

#大语言模型#Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Python 6.55 k
1 年前
https://static.github-zh.com/github_avatars/iusztinpaul?size=40

🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦�...

Jupyter Notebook 3.35 k
9 个月前
https://static.github-zh.com/github_avatars/ssbuild?size=40
Python 1.55 k
6 个月前
https://static.github-zh.com/github_avatars/X-D-Lab?size=40
Python 687
1 年前
https://static.github-zh.com/github_avatars/jianzhnie?size=40

#大语言模型#Easy and Efficient Finetuning LLMs. (Supported LLama, LLama2, LLama3, Qwen, Baichuan, GLM , Falcon) 大模型高效量化训练+部署.

Python 613
8 个月前
https://static.github-zh.com/github_avatars/X-D-Lab?size=40

#大语言模型#🌿孙思邈中文医疗大模型(Sunsimiao):提供安全、可靠、普惠的中文医疗大模型

Python 459
1 年前
https://static.github-zh.com/github_avatars/yangjianxin1?size=40

#大语言模型#Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Python 413
2 年前
https://static.github-zh.com/github_avatars/ddzipp?size=40

AutoAudit—— the LLM for Cyber Security 网络安全大语言模型

HTML 346
7 个月前
https://static.github-zh.com/github_avatars/WangRongsheng?size=40

#大语言模型#The official codes for "Aurora: Activating chinese chat capability for Mixtral-8x7B sparse Mixture-of-Experts through Instruction-Tuning"

Python 266
1 年前
https://static.github-zh.com/github_avatars/taishan1994?size=40

对llama3进行全参微调、lora微调以及qlora微调。

Python 208
1 年前
https://static.github-zh.com/github_avatars/yangjianxin1?size=40

#大语言模型#LongQLoRA: Extent Context Length of LLMs Efficiently

Python 166
2 年前
https://static.github-zh.com/github_avatars/ssbuild?size=40
Python 145
1 年前
loading...
Website
Wikipedia