#算法刷题#使用Java实现的算法和数据结构集合
BLAS-like Library Instantiation Software Framework
High-efficiency floating-point neural network inference operators for mobile, server, and Web
Acceleration package for neural networks on multi-core CPUs
Multi-Threaded FP32 Matrix Multiplication on x86 CPUs
#计算机科学#The HPC toolbox: fused matrix multiplication, convolution, data-parallel strided tensor primitives, OpenMP facilities, SIMD, JIT Assembler, CPU detection, state-of-the-art vectorized BLAS for floats a...
#计算机科学#Introduction to PyTorch, covering tensor initialization, operations, indexing, and reshaping.
#计算机科学#A library and extension that provides objects for scientific computing in PHP.
#计算机科学#[DEPRECATED] Moved to ROCm/rocm-libraries repo
💥 Fast matrix-multiplication as a self-contained Python library – no system dependencies!
Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm
#计算机科学#Sparse matrix formats for linear algebra supporting scientific and machine learning applications
DBCSR: Distributed Block Compressed Sparse Row matrix library
Accelerated General (FP32) Matrix Multiplication from scratch in CUDA
[Experimental] LLVM-accelerated Generic Linear Algebra Subprograms