fp8

#计算机科学#A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory ut...

CUDA 深度学习 gpu 机器学习 Python PyTorch fp8 jax

Python 2.75 k

2 天前

Azure / MS-AMP

#计算机科学#Microsoft Automatic Mixed Precision Library

amp 深度学习 fp8 gpu PyTorch transformer

Python 621

1 年前

intel / neural-speed

An innovative library for efficient LLM inference via low-bit quantization

cpu fp8 gpu int8 llm-inference sparsity llamacpp

C++ 348

1 年前

aredden / flux-fp8-api

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

diffusion flux fp8 PyTorch quantization

Python 280

1 年前

graphcore-research / jax-scalify

#大语言模型#JAX Scalify: end-to-end scaled arithmetics

fp8 大语言模型 jax low-precision

Python 16

1 年前

MurrellGroup / Microfloats.jl

Narrow precision floating point types

floating-point fp8

Julia 5

10 天前

zerfoo / zerfoo

#计算机科学#A modular, accelerator-ready machine learning framework built in Go that speaks float8/16/32/64. Designed with clean architecture, strong typing, and native concurrency for scalable, production-ready ...

autodiff 深度学习 distributed-training float16 float8 fp8 Go 机器学习神经网络 onnx transformer

Go 4

1 个月前

zsxkib / cog-step-video-t2v

Cog Single GPU Quantized Implementation of Step-Video-T2V

fp8 replicate

Python 1

7 个月前

mukullokhande99 / XR-NPE

Python implementations for multi-precision quantization in computer vision and sensor fusion workloads, targeting the XR-NPE Mixed-Precision SIMD Neural Processing Engine. The code includes visual ine...

fp8 object-detection posit quantization visual-inertial-odometry

Jupyter Notebook 1

1 个月前