#

avx-512

https://static.github-zh.com/github_avatars/RoaringBitmap?size=40

Roaring bitmaps in C (and C++), with SIMD (AVX2, AVX-512 and NEON) optimizations: used by Apache Doris, ClickHouse, Redpanda, YDB and StarRocks

C 1.69 k
8 天前
simdutf/simdutf
https://static.github-zh.com/github_avatars/simdutf?size=40

Unicode routines (UTF8, UTF16, UTF32) and Base64: billions of characters per second using SSE2, AVX2, NEON, AVX-512, RISC-V Vector Extension, LoongArch64, POWER. Part of Node.js, WebKit/Safari, Ladybi...

C++ 1.51 k
5 天前
https://static.github-zh.com/github_avatars/aff3ct?size=40

Portable wrapper for SIMD and vector instructions written in C++11. Compatible with NEON, SSE, AVX, AVX-512 and SVE (length specific).

C++ 508
3 个月前
https://static.github-zh.com/github_avatars/IntelLabs?size=40

Intel Homomorphic Encryption Acceleration Library accelerates modular arithmetic operations used in homomorphic encryption by leveraging AVX512 and IFM52 available on Intel's 3rd Generation Xeon Scala...

C++ 245
2 个月前
https://static.github-zh.com/github_avatars/awesome-simd?size=40

A curated list of awesome SIMD frameworks, libraries and software

203
1 年前
https://static.github-zh.com/github_avatars/swojtasiak?size=40

A general purpose machine code manipulation library for x86-32 (IA-32) and x86-64 (AMD64) architectures (Assembler, Disassembler, Library).

C 91
2 年前
https://static.github-zh.com/github_avatars/simdutf?size=40

Fast C++ function "is_utf8": checks if the input is valid UTF-8. Made of a single source file. Optimized for ARM NEON, x64 SSE, AVX2 and AVX-512.

C++ 67
1 年前
https://static.github-zh.com/github_avatars/nidud?size=40
Assembly 60
11 天前
https://static.github-zh.com/github_avatars/twest820?size=40

AVX-512 documentation beyond what Intel provides

55
2 年前
https://static.github-zh.com/github_avatars/rainerzufalldererste?size=40
C 37
6 个月前
https://static.github-zh.com/github_avatars/romz-pl?size=40

Algorithms for matrix matrix multiplication, dgemm, AVX-256, AVX-512

C++ 19
8 个月前
https://static.github-zh.com/github_avatars/MamarezaAlipour?size=40
C++ 16
2 个月前
https://static.github-zh.com/github_avatars/jonicho?size=40

A generic and efficient SIMD implementation of MSB Radix Sort with separate key and payload datastreams that supports arbitrary key and payload data types written in C++ accompanied by a bachelor's th...

C++ 15
8 个月前
https://static.github-zh.com/github_avatars/ammarfaizi2?size=40

Benchmark to show which is the fastest memcpy.

Assembly 12
5 年前
https://static.github-zh.com/github_avatars/UWASL?size=40

DedupBench is a benchmarking tool for data chunking techniques used in data deduplication. DedupBench is designed for extensibility, allowing new chunking techniques to be implemented with minimal add...

C++ 10
1 个月前
https://static.github-zh.com/github_avatars/ashvardanian?size=40

Vector Dossier is a CLI tool that statically analyzes vectorization depth of programs and libraries

Jupyter Notebook 10
8 个月前
https://static.github-zh.com/github_avatars/quasilyte?size=40

Utility that was used to generate initial Go AVX-512 encoder test suite.

Assembly 9
6 年前
https://static.github-zh.com/github_avatars/tugrul512bit?size=40

Running GPGPU-like kernels on CPU with auto-vectorization for SSE/AVX/AVX512 SIMD Architectures

C++ 9
2 年前
loading...
Website
Wikipedia