#大语言模型#A high-throughput and memory-efficient inference and serving engine for LLMs
#计算机科学#Open deep learning compiler stack for cpu, gpu and specialized accelerators
#大语言模型#Simple, scalable AI model deployment on GPU clusters
#计算机科学#A deep learning package for many-body potential energy representation and molecular dynamics
#计算机科学#Large-scale LLM inference engine
stdgpu: Efficient STL-like Data Structures on the GPU
Dockerfiles for the various software layers defined in the ROCm software platform
Abstraction Library for Parallel Kernel Acceleration 🦙
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance p...
Agenium Scale vectorization library for CPUs and GPUs
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale te...
HPC solver for nonlinear optimization problems
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm