Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Mar 14, 2023 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
TensorFlow binaries supporting AVX, FMA, SSE
C++ wrappers for SIMD intrinsics and parallelized, optimized mathematical functions (SSE, AVX, AVX512, NEON, SVE))
SIMD Vector Classes for C++
The Vector Optimized Library of Kernels
A simple C library for compressing lists of integers using binary packing
A C++ library to compress and intersect sorted lists of integers using SIMD instructions
Agenium Scale vectorization library for CPUs and GPUs
TensorFlow binaries supporting AVX, FMA, SSE
High performance algorithms in C#: SIMD/SSE, multi-core and faster
Fast decoder for VByte-compressed integers
High-performance dictionary coding
Fast random number generators: Vectorized (SIMD) version of xorshift128+
UME::SIMD A library for explicit simd vectorization.
A fast implementation of single-pattern substring search using SIMD acceleration.
Fast differential coding functions (using SIMD instructions)
Fast C header-only library for popcnt, pospopcnt, and set algebraic operations
DSP library for signal processing
This project aims to rename all C# intrinsic names to their more compact C/C++ counterparts that the industry uses.
Add a description, image, and links to the simd-instructions topic page so that developers can more easily learn about it.
To associate your repository with the simd-instructions topic, visit your repo's landing page and select "manage topics."