GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

efficient-inference

Website
Wikipedia
https://static.github-zh.com/github_avatars/huawei-noah?size=40
huawei-noah / Efficient-AI-Backbones

Efficient AI Backbones including GhostNet, TNT and MLP, developed by Huawei Noah's Ark Lab.

convolutional-neural-networksefficient-inferenceimagenetmodel-compressionTensorflowPyTorchghostnettransformerpretrained-modelsvision-transformer
Python 4.24 k
3 个月前
https://static.github-zh.com/github_avatars/SqueezeAILab?size=40
SqueezeAILab / LLMCompiler

#自然语言处理#[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling

function-calling大语言模型llm-agentllm-agentsparallel-function-callefficient-inferencelarge-language-modelsllamallama2llm-framework自然语言处理transformer
Python 1.7 k
1 年前
https://static.github-zh.com/github_avatars/snap-research?size=40
snap-research / EfficientFormer

#计算机科学#EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]

深度学习detectionefficient-inferenceefficient-neural-networksPyTorchsemantic-segmentationtransformerimagenettransformers
Python 1.05 k
2 年前
https://static.github-zh.com/github_avatars/huawei-noah?size=40
huawei-noah / AdderNet

Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"

PyTorchimagenetconvolutional-neural-networkscvpr2020efficient-inference
Python 959
3 年前
https://static.github-zh.com/github_avatars/horseee?size=40
horseee / DeepCache

[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free

diffusion-modelsefficient-inferencemodel-compressionstable-diffusion
Python 896
1 年前
https://static.github-zh.com/github_avatars/VITA-Group?size=40
VITA-Group / LightGaussian

[NeurIPS 2024 Spotlight]"LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS", Zhiwen Fan, Kevin Wang, Kairun Wen, Zehao Zhu, Dejia Xu, Zhangyang Wang

3d-reconstructionefficient-inferencegaussian-splatting
Python 706
6 个月前
https://static.github-zh.com/github_avatars/SqueezeAILab?size=40
SqueezeAILab / SqueezeLLM

#自然语言处理#[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization

efficient-inferencelarge-language-models大语言模型model-compression自然语言处理post-training-quantizationquantizationtext-generationtransformerllamalocalllm
Python 691
10 个月前
https://static.github-zh.com/github_avatars/Zhen-Dong?size=40
Zhen-Dong / Awesome-Quantization-Papers

#Awesome#List of papers related to neural network quantization in recent AI conferences and journals.

quantizationAwesome Listspapersneural-networksmodel-compressionedge-computingefficient-inferencediffusion-modelslarge-language-models
645
3 个月前
https://static.github-zh.com/github_avatars/liuzhuang13?size=40
liuzhuang13 / slimming

#计算机科学#Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.

深度学习convolutional-neural-networksefficient-inference
Lua 568
6 年前
https://static.github-zh.com/github_avatars/SqueezeAILab?size=40
SqueezeAILab / KVQuant

#自然语言处理#[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

compressionefficient-inferenceefficient-modellarge-language-modelsllama大语言模型localllamalocalllmmistralmodel-compression自然语言处理quantizationtext-generationtransformer
Python 357
10 个月前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / speculative-decoding

#计算机科学#Explorations into some recent techniques surrounding speculative decoding

人工智能深度学习efficient-inferencetransformers
Python 268
6 个月前
https://static.github-zh.com/github_avatars/xuyang-liu16?size=40
xuyang-liu16 / Awesome-Generation-Acceleration

📚 Collection of awesome generation acceleration resources.

diffusion-modelsefficient-deep-learningefficient-inferencetext-to-imagetext-to-videoimage-generationvideo-generation
259
2 个月前
https://static.github-zh.com/github_avatars/Picovoice?size=40
Picovoice / picollm

#自然语言处理#On-device LLM Inference Powered by X-Bit Quantization

大语言模型compressionefficient-inferencegemmagenerative-ailanguage-modellanguage-modelsllamallama2llama3mistralmixtralmodel-compression自然语言处理quantization自托管llm-inference
Python 247
10 天前
https://static.github-zh.com/github_avatars/SYSU-SAIL?size=40
SYSU-SAIL / SMSR

[CVPR 2021] Exploring Sparsity in Image Super-Resolution for Efficient Inference

super-resolutionsparsityefficient-inference
Python 239
4 年前
https://static.github-zh.com/github_avatars/changlin31?size=40
changlin31 / DS-Net

(CVPR 2021, Oral) Dynamic Slimmable Network

pruningnetwork-pruningmodel-compressionefficient-inference
Python 229
3 年前
https://static.github-zh.com/github_avatars/xindongzhang?size=40
xindongzhang / ELAN

[ECCV2022] Efficient Long-Range Attention Network for Image Super-resolution

efficient-inferencesuper-resolutiontransformer
Python 227
3 年前
https://static.github-zh.com/github_avatars/czg1225?size=40
czg1225 / AsyncDiff

[NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising

diffusion-modelsdistributed-computingefficient-inferencestable-diffusiontext-to-imagetext-to-video
Python 203
4 个月前
https://static.github-zh.com/github_avatars/liuziwei7?size=40
liuziwei7 / mobile-id

#人脸识别#Deep Face Model Compression

机器视觉深度学习face-recognitionmodel-compressionefficient-inference
MATLAB 196
7 年前
https://static.github-zh.com/github_avatars/cure-lab?size=40
cure-lab / DeciWatch

#计算机科学#[ECCV 2022] Official implementation of the paper "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

3d-pose-estimationefficient-inferencehuman-pose-estimation深度学习efficiencyefficient-neural-networkspose-estimationPyTorcheccveccv2022
Python 180
3 年前
https://static.github-zh.com/github_avatars/SimonAytes?size=40
SimonAytes / SoT

#大语言模型#Official code repository for Sketch-of-Thought (SoT)

人工智能efficient-inference大语言模型llm-inferenceprompting
Python 120
1 个月前
loading...