Performance-portable, length-agnostic SIMD with runtime dispatch
Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Redpanda, YDB and StarRocks
Intel Homomorphic Encryption Acceleration Library accelerates modular arithmetic operations used in homomorphic encryption by leveraging AVX512 and IFM52 available on Intel's 3rd Generation Xeon Scala...
A curated list of awesome SIMD frameworks, libraries and software
(REOS) Radar and ElectroOptical Simulation Framework written in Fortran.
The fastest Run-Length-Encoding on the Planet (for x64)
Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512
Benchmark to show which is the fastest memcpy.
DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal add...
Vector Dossier is a CLI tool that statically analyzes vectorization depth of programs and libraries
Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures