Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Jun 12, 2024 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
TensorFlow binaries supporting AVX, FMA, SSE
SIMD Vector Classes for C++
Up to 200x Faster Inner Products and Vector Similarity — for Python, JavaScript, Rust, C, and Swift, supporting f64, f32, f16 real & complex, i8, and binary vectors using SIMD for both x86 AVX2 & AVX-512 and Arm NEON & SVE 📐
The Vector Optimized Library of Kernels
A simple C library for compressing lists of integers using binary packing
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
Agenium Scale vectorization library for CPUs and GPUs
TensorFlow binaries supporting AVX, FMA, SSE
High performance algorithms in C#: SIMD/SSE, multi-core and faster
Fast decoder for VByte-compressed integers
Fast random number generators: Vectorized (SIMD) version of xorshift128+
High-performance dictionary coding
UME::SIMD A library for explicit simd vectorization.
A fast implementation of single-pattern substring search using SIMD acceleration.
DSP library for signal processing
Collection of incredibly fast hashmaps
(REOS) Radar and Electro-Optical Simulation Framework written in C++.
Add a description, image, and links to the simd-instructions topic page so that developers can more easily learn about it.
To associate your repository with the simd-instructions topic, visit your repo's landing page and select "manage topics."