GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gptq

Website
Wikipedia
https://static.github-zh.com/github_avatars/intel?size=40
intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

low-precisionpruningsparsityauto-tuningknowledge-distillationquantizationquantization-aware-trainingpost-training-quantizationsmoothquantlarge-language-modelsgptqint8
Python 2.43 k
3 天前
https://static.github-zh.com/github_avatars/ModelCloud?size=40
ModelCloud / GPTQModel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

gptqpeftquantizationtransformersvllm
Python 606
18 天前
https://static.github-zh.com/github_avatars/intel?size=40
intel / auto-round

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM.

gptqquantizationrounding
Python 499
2 天前
https://static.github-zh.com/github_avatars/shm007g?size=40
shm007g / LLaMA-Cult-and-More

#大语言模型#Large Language Models for All, 🦙 Cult and More, Stay in touch !

alpacaChatGPTgptllamaggmlgpt4gptqvicunaPyTorchTensorflowtransformersdeepspeed大语言模型
HTML 446
2 年前
https://static.github-zh.com/github_avatars/bobazooba?size=40
bobazooba / xllm

#大语言模型#🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

alpacacerebrasChatGPT深度学习深度神经网络gptgpt-4gptqlarge-language-modelsllamallama2大语言模型mistralopenaivicunaZephyr RTOSPyTorchtorch
Python 403
1 年前
https://static.github-zh.com/github_avatars/1b5d?size=40
1b5d / llm-api

#大语言模型#Run any Large Language Model behind a unified API

ChatGPTgptqhuggingfacelangchainllamallamacpp大语言模型llm-inference机器学习Python
Python 169
2 年前
https://static.github-zh.com/github_avatars/chenhunghan?size=40
chenhunghan / ialacol

#大语言模型#🪶 Lightweight OpenAI drop-in replacement for Kubernetes

人工智能helmKuberneteslangchain大语言模型PythonopenaicloudnativeggmlgpullamacppCUDAgptqllm-inferencellm-serving
Python 145
1 年前
https://static.github-zh.com/github_avatars/abhinand5?size=40
abhinand5 / gptq_for_langchain

#大语言模型#A guide about how to use GPTQ models with langchain

人工智能gptgptqlangchainlanguage-model大语言模型quantizationwizardlm
Jupyter Notebook 40
2 年前
https://static.github-zh.com/github_avatars/ziwang-com?size=40
ziwang-com / zero-lora

#大语言模型#zero零训练llm调参

gptgptqllama大语言模型lora
31
2 年前
https://static.github-zh.com/github_avatars/tripathiarpan20?size=40
tripathiarpan20 / self-improvement-4all

Private self-improvement coaching with open-source LLMs

faisslangchainPythongptqtransformers
Python 15
1 年前
https://static.github-zh.com/github_avatars/hcd233?size=40
hcd233 / Aris-AI-Model-Server

#大语言模型#An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API

人工智能embeddingFastAPIgptq大语言模型MLXopenai-compatible-apiragrerankersentence-transformersvllm
Python 14
2 个月前
https://static.github-zh.com/github_avatars/chinoll?size=40
chinoll / chatsakura

#大语言模型#ChatSakura:Open-source multilingual conversational model.(开源多语言对话大模型)

gradioPyTorchbloomChatGPTinstruct-gpt大语言模型gptqtransformers
Python 14
2 年前
https://static.github-zh.com/github_avatars/seyf1elislam?size=40
seyf1elislam / LocalLLM_OneClick_Colab

#大语言模型#Run gguf LLM models in Latest Version TextGen-webui

colab-notebookggufgptq大语言模型localllamalocalllmPython
Jupyter Notebook 11
8 个月前
https://static.github-zh.com/github_avatars/matlok-ai?size=40
matlok-ai / bampe-weights

#大语言模型#This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for ...

人工智能blip2foundational-modelsgenerative-aigptqimage-to-image大语言模型safetensorsstable-diffusiontifftransformersblenderblender-python深度学习
Python 9
1 年前
https://static.github-zh.com/github_avatars/Aqirito?size=40
Aqirito / A.L.I.C.E

#大语言模型#A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system

langchainlangchain-python大语言模型text-generationtext-to-speechttsvitsAnime人工智能Genshin ImpactwaifuFastAPIgptqhuggingface-transformerspygmalionREST API
Python 9
4 个月前
https://static.github-zh.com/github_avatars/bobazooba?size=40
bobazooba / shurale

#自然语言处理#Conversation AI model for open domain dialogs

cerebrasChatGPT深度学习深度神经网络gptgpt-4gptqlarge-language-modelsllamallama2大语言模型mistral自然语言处理openaiPyTorchtorchtransformersvicuna
Python 4
2 年前
https://static.github-zh.com/github_avatars/upunaprosk?size=40
upunaprosk / quantized-lm-confidence

#自然语言处理#Code for NAACL paper When Quantization Affects Confidence of Large Language Models?

compressiongptq自然语言处理quantizationefficient-modellarge-language-models大语言模型
Jupyter Notebook 3
6 个月前
https://static.github-zh.com/github_avatars/amajji?size=40
amajji / LLM-Quantization-Techniques-Absmax-Zeropoint-GPTQ-GGUF

#大语言模型#LLM quantization techniques: absmax, zero-point, GPTQ and GGUF

ggmlggufgptqllamacpp大语言模型quantizationquantization-aware-training
Jupyter Notebook 2
10 个月前
https://static.github-zh.com/github_avatars/SujanNeupane42?size=40
SujanNeupane42 / NEPSE-Chatbot-Using-Retrieval-augmented-generation-and-reranking

#大语言模型#This project will develop a NEPSE chatbot using an open-source LLM, incorporating sentence transformers, vector database and reranking.

faissFlaskgptqlangchain大语言模型Pythonretrieval-augmented-generationsentence-transformersvector-database
Jupyter Notebook 2
1 年前
https://static.github-zh.com/github_avatars/SujanNeupane42?size=40
SujanNeupane42 / LLM_Quantization

#自然语言处理#Quantizing LLMs using GPTQ

gptqhuggingface大语言模型机器学习自然语言处理quantization
Jupyter Notebook 0
1 年前
loading...