#大语言模型#vLLM 是一个高效的开源库,用于加速大语言模型推理,通过优化内存管理和分布式处理实现高吞吐量和低延迟。
#计算机科学#Open deep learning compiler stack for cpu, gpu and specialized accelerators
#大语言模型#Simple, scalable AI model deployment on GPU clusters
#计算机科学#A deep learning package for many-body potential energy representation and molecular dynamics
#计算机科学#Large-scale LLM inference engine
stdgpu: Efficient STL-like Data Structures on the GPU
Dockerfiles for the various software layers defined in the ROCm software platform
Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale te...
Abstraction Library for Parallel Kernel Acceleration 🦙
Main repository for QMCPACK, an open-source production level many-body ab initio Quantum Monte Carlo code for computing the electronic structure of atoms, molecules, and solids with full performance p...
Kubernetes (k8s) device plugin to enable registration of AMD GPU to a container cluster
Agenium Scale vectorization library for CPUs and GPUs
HPC solver for nonlinear optimization problems