GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

gptq

Website
Wikipedia
https://static.github-zh.com/github_avatars/intel?size=40
intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

low-precisionpruningsparsityauto-tuningknowledge-distillationquantizationquantization-aware-trainingpost-training-quantizationsmoothquantlarge-language-modelsgptqint8
Python 2.46 k
2 天前
https://static.github-zh.com/github_avatars/ModelCloud?size=40
ModelCloud / GPTQModel

Production ready LLM model compression/quantization toolkit with hw accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

gptqpeftquantizationtransformersvllm
Python 703
14 天前
https://static.github-zh.com/github_avatars/intel?size=40
intel / auto-round

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM. Export your models effortlessly to autogpt...

gptqquantizationrounding
Python 552
3 天前
https://static.github-zh.com/github_avatars/shm007g?size=40
shm007g / LLaMA-Cult-and-More

#大语言模型#Large Language Models for All, 🦙 Cult and More, Stay in touch !

alpacaChatGPTgptllamaggmlgpt4gptqvicunaPyTorchTensorflowtransformersdeepspeed大语言模型
HTML 445
2 年前
https://static.github-zh.com/github_avatars/bobazooba?size=40
bobazooba / xllm

#大语言模型#🦖 X—LLM: Cutting Edge & Easy LLM Finetuning

alpacacerebrasChatGPT深度学习深度神经网络gptgpt-4gptqlarge-language-modelsllamallama2大语言模型mistralopenaivicunaZephyr RTOSPyTorchtorch
Python 403
2 年前
https://static.github-zh.com/github_avatars/1b5d?size=40
1b5d / llm-api

#大语言模型#Run any Large Language Model behind a unified API

ChatGPTgptqhuggingfacelangchainllamallamacpp大语言模型llm-inference机器学习Python
Python 169
2 年前
https://static.github-zh.com/github_avatars/chenhunghan?size=40
chenhunghan / ialacol

#大语言模型#🪶 Lightweight OpenAI drop-in replacement for Kubernetes

人工智能helmKuberneteslangchain大语言模型PythonopenaicloudnativeggmlgpullamacppCUDAgptqllm-inferencellm-serving
Python 145
1 年前
https://static.github-zh.com/github_avatars/abhinand5?size=40
abhinand5 / gptq_for_langchain

#大语言模型#A guide about how to use GPTQ models with langchain

人工智能gptgptqlangchainlanguage-model大语言模型quantizationwizardlm
Jupyter Notebook 40
2 年前
https://static.github-zh.com/github_avatars/ziwang-com?size=40
ziwang-com / zero-lora

#大语言模型#zero零训练llm调参

gptgptqllama大语言模型lora
31
2 年前
https://static.github-zh.com/github_avatars/tripathiarpan20?size=40
tripathiarpan20 / self-improvement-4all

Private self-improvement coaching with open-source LLMs

faisslangchainPythongptqtransformers
Python 15
1 年前
https://static.github-zh.com/github_avatars/hcd233?size=40
hcd233 / Aris-AI-Model-Server

#大语言模型#An OpenAI Compatible API which integrates LLM, Embedding and Reranker. 一个集成 LLM、Embedding 和 Reranker 的 OpenAI 兼容 API

人工智能embeddingFastAPIgptq大语言模型MLXopenai-compatible-apiragrerankersentence-transformersvllm
Python 15
6 天前
https://static.github-zh.com/github_avatars/chinoll?size=40
chinoll / chatsakura

#大语言模型#ChatSakura:Open-source multilingual conversational model.(开源多语言对话大模型)

gradioPyTorchbloomChatGPTinstruct-gpt大语言模型gptqtransformers
Python 13
2 年前
https://static.github-zh.com/github_avatars/taishan1994?size=40
taishan1994 / LLM-Quantization

#大语言模型#记录量化LLM中的总结。

gptq大语言模型quantizationqwen3
Python 12
7 天前
https://static.github-zh.com/github_avatars/seyf1elislam?size=40
seyf1elislam / LocalLLM_OneClick_Colab

#大语言模型#Run gguf LLM models in Latest Version TextGen-webui

colab-notebookggufgptq大语言模型localllamalocalllmPython
Jupyter Notebook 12
10 个月前
https://static.github-zh.com/github_avatars/matlok-ai?size=40
matlok-ai / bampe-weights

#大语言模型#This repository is for profiling, extracting, visualizing and reusing generative AI weights to hopefully build more accurate AI models and audit/scan weights at rest to identify knowledge domains for ...

人工智能blip2foundational-modelsgenerative-aigptqimage-to-image大语言模型safetensorsstable-diffusiontifftransformersblenderblender-python深度学习
Python 9
2 年前
https://static.github-zh.com/github_avatars/Aqirito?size=40
Aqirito / A.L.I.C.E

#大语言模型#A.L.I.C.E (Artificial Labile Intelligence Cybernated Existence). A REST API of A.I companion for creating more complex system

langchainlangchain-python大语言模型text-generationtext-to-speechttsvitsAnime人工智能Genshin ImpactwaifuFastAPIgptqhuggingface-transformerspygmalionREST API
Python 9
6 个月前
https://static.github-zh.com/github_avatars/bobazooba?size=40
bobazooba / shurale

#自然语言处理#Conversation AI model for open domain dialogs

cerebrasChatGPT深度学习深度神经网络gptgpt-4gptqlarge-language-modelsllamallama2大语言模型mistral自然语言处理openaiPyTorchtorchtransformersvicuna
Python 4
2 年前
https://static.github-zh.com/github_avatars/SujanNeupane42?size=40
SujanNeupane42 / NEPSE-Chatbot-Using-Retrieval-augmented-generation-and-reranking

#大语言模型#This project will develop a NEPSE chatbot using an open-source LLM, incorporating sentence transformers, vector database and reranking.

faissFlaskgptqlangchain大语言模型Pythonretrieval-augmented-generationsentence-transformersvector-database
Jupyter Notebook 3
2 年前
https://static.github-zh.com/github_avatars/upunaprosk?size=40
upunaprosk / quantized-lm-confidence

#自然语言处理#Code for NAACL paper When Quantization Affects Confidence of Large Language Models?

compressiongptq自然语言处理quantizationefficient-modellarge-language-models大语言模型
Jupyter Notebook 3
7 个月前
https://static.github-zh.com/github_avatars/lpalbou?size=40
lpalbou / model-quantizer

#自然语言处理#Effortlessly quantize, benchmark, and publish Hugging Face models with cross-platform support for CPU/GPU. Reduce model size by 75% while maintaining performance.

cross-platformgptqhuggingfaceinference大语言模型机器学习model-compression自然语言处理optimizationPythonPyTorchquantizationtransformers
Python 2
5 个月前
loading...