unsloth · GitHub Topics

#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

fine-tuning llama 大语言模型 lora mistral gemma llama3 unsloth deepseek deepseek-r1 gemma3 text-to-speech tts qwen qwen3 agent 人工智能 openai gpt-oss

Python 45.45 k

14 小时前

unslothai / notebooks

100+ Fine-tuning LLM Notebooks on Google Colab, Kaggle, and more.

unsloth

Jupyter Notebook 3.64 k

2 天前

neural-maze / rick-llm

#大语言模型#Make Llama 3.1 8B talk in Rick Sanchez’s style

llama3 大语言模型 ollama unsloth huggingface

Jupyter Notebook 116

8 个月前

GAD-cell / vlm-grpo

An implementation of GRPO for Unsloth's VLMs training

grpo huggingface unsloth vlm reinforcement-learning

Python 73

1 个月前

sinanuozdemir / oreilly-pytorch-dl

#计算机科学#Code for Deep Learning for Modern AI

bert 深度学习 llama3 大语言模型 neural-networks distillation llama mnist quantization clip diffusion dreambooth multimodal unsloth

Jupyter Notebook 45

6 个月前

Breeze648 / MedCoT-7B

#自然语言处理#本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调，通过 QLoRA 量化和 Unsloth 加速训练，显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势，实现高效、准确且具有解释性的医学问答系统。

人工智能 chain-of-thought deepseek-r1 distillation 大语言模型 lora medical-application 自然语言处理 qlora qwen unsloth

Python 28

6 个月前

Cre4T3Tiv3 / unsloth-llama3-alpaca-lora

#大语言模型#Advanced 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs for instruction-following specialization. Demonstrates cutting-e...

4bit alpaca colab finetuning gradio huggingface instruction-tuning llama3 大语言模型 lora Open Source peft qlora transformers unsloth

Jupyter Notebook 26

2 个月前

qqqqqf-q / Qing-Digital-Self

#大语言模型#数字分身项目,并且包含了搭建(复现)教程 Qing's digital self, including setup tutorial

人工智能聊天机器人大语言模型 qlora qwen unsloth digital-twin finetune finetune-llm huggingface

Python 24

9 天前

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-awar...

finetuning gguf huggingface meta qlora quantization Open Source production-ready unsloth gpu inference training peft

Jupyter Notebook 21

7 个月前

0xZee / DeepSeek-R1-FineTuning

Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation

deepseek-r1 lora qlora reinforcement-learning unsloth

Jupyter Notebook 15

7 个月前

Eviltr0N / Make-AI-Clone-of-Yourself

#大语言模型#Cloning Yourself using your whatsapp chat history and training a model on it.

人工智能 finetuning llama3 llama3-finetune unsloth WhatsApp whatsapp-clone 大语言模型 ollama

Jupyter Notebook 15

1 年前

deep-div / Fine-Tuning-LLMs-and-VisionModels

#大语言模型#Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on c...

deepseek finetuning-llms gemma generative-ai huggingface large-language-models llama 大语言模型 transformers unsloth

Jupyter Notebook 14

8 天前

alisonmitchell / Biomedical-Knowledge-Graph

Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.

arxiv coreference-resolution groq knowledge-graph llamaindex named-entity-recognition relation-extraction unsloth langchain biomedical

Jupyter Notebook 14

9 个月前

QuangNguyen2910 / AutClothingChatbot

#大语言模型#PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and...

聊天机器人 langchain 大语言模型 rag unsloth vector-database

Jupyter Notebook 13

1 年前

muhammad-fiaz / finetune-web-ui

#数据仓库#Finetune Web UI is a user-interface for training and deploying pre-trained models.

数据集 fine-tuning finetune finetuning-llms generative-ai gpt huggingface large-language-models transformers unsloth gradio

Python 10

1 个月前

IAmSkyDra / finetune-quantize-llms

Materials for CSE Summer School Hackathon 2024

大语言模型 unsloth

Jupyter Notebook 10

10 个月前

SrikarVeluvali / Astor-AI

#大语言模型#AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...

Flask huggingface llama3 大语言模型 MongoDB ollama React transformers unsloth

Jupyter Notebook 10

10 个月前

bastienpo / unsloth_finetuning

Finetuning of Gemma-2 2B for structured output

人工智能 fine-tuning gemma2 llamacpp Python unsloth

Jupyter Notebook 9

1 年前

mirabdullahyaser / LLaMA3-Financial-Analyst

#大语言模型#LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.

embeddings finance financial-analysis fine-tuning huggingface llama3 大语言模型 lora rag unsloth vector-database 聊天机器人 question-answering

Jupyter Notebook 8

7 个月前

harshit433 / ResurrectAI

#计算机科学#ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, Resur...

finetuning-llms Firebase Flask Flutter html-css-javascript llama3-1 ollama Python unsloth 人工智能聊天机器人 conversational-ai language-model 机器学习

Dart 8

1 年前