GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

fp8

Website
Wikipedia
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / TransformerEngine

#计算机科学#A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory ut...

CUDA深度学习gpu机器学习PythonPyTorchfp8jax
Python 2.48 k
3 天前
https://static.github-zh.com/github_avatars/Azure?size=40
Azure / MS-AMP

#计算机科学#Microsoft Automatic Mixed Precision Library

amp深度学习fp8gpuPyTorchtransformer
Python 608
9 个月前
https://static.github-zh.com/github_avatars/intel?size=40
intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

cpufp8gpuint8llm-inferencesparsityllamacpp
C++ 349
10 个月前
https://static.github-zh.com/github_avatars/aredden?size=40
aredden / flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

diffusionfluxfp8PyTorchquantization
Python 269
8 个月前
https://static.github-zh.com/github_avatars/graphcore-research?size=40
graphcore-research / jax-scalify

#大语言模型#JAX Scalify: end-to-end scaled arithmetics

fp8大语言模型jaxlow-precision
Python 16
8 个月前
https://static.github-zh.com/github_avatars/zsxkib?size=40
zsxkib / cog-step-video-t2v

Cog Single GPU Quantized Implementation of Step-Video-T2V

fp8replicate
Python 1
4 个月前
https://static.github-zh.com/github_avatars/umangyadav?size=40
umangyadav / py_fp8

FP8 dtypes enumeration in python

fp8
C++ 0
2 年前