#大语言模型#Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.
#大语言模型#Make Llama 3.1 8B talk in Rick Sanchez’s style
#计算机科学#Code for Deep Learning for Modern AI
Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-awar...
#自然语言处理#本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性的医学问答系统。
An implementation of GRPO for Unsloth's VLMs training
#大语言模型#Fine-Tuning LLMs (Gemma, LLaMA, Mistral, etc.) A practical guide to fine-tuning various large language models using popular frameworks. Includes examples, scripts, and tips for efficient training on c...
#大语言模型#PTIT's Major Project: Website Programming - This repo contains a chatbot for a clothing store. The chatbot acts as an employee with specific knowledge about clothing consultation, website support, and...
#大语言模型#Cloning Yourself using your whatsapp chat history and training a model on it.
Information extraction from unstructured text to build a knowledge graph using techniques from traditional NLP to pre-trained transformers and LLMs for NER and Linking, and Relation Extraction.
Fine-Tuning of DeepSeek-Style Reasoning Models | RL + Quantization Implementation
#大语言模型#AstorAI is a user-friendly medical chatbot powered by Retrieval-Augmented Generation (RAG) and the advanced LLama 3 model. It offers real-time, accurate responses to a wide range of medical queries, e...
Finetuning of Gemma-2 2B for structured output
#大语言模型#Fine-tuning Llama 3.2 3B Instruct model for text generation using Unsloth AI
#数据仓库#Finetune Web UI is a user-interface for training and deploying pre-trained models.
#计算机科学#ResurrectAI is an AI-driven chat application designed to bring the wisdom and knowledge of great historical personalities to life. Leveraging advanced language models and fine-tuning techniques, Resur...
#大语言模型#PDF 문서에서 GPU 가속 처리로 고품질 질의응답(QA) 데이터를 자동 생성하고 LLM을 효율적으로 파인튜닝하는 솔루션입니다. Unstructured 라이브러리와 AWS Bedrock Claude로 도메인 특화 QA 쌍을 생성하고, LoRA 기법으로 경량 모델을 훈련합니다.
#大语言模型#LLM-powered financial analyst using LoRA-tuned Llama-3 and RAG pipeline to answer complex queries over SEC 10-K filings with contextual accuracy.
Análise Avançada de Dados com Causalidade e Aprendizado por Reforço