#
avx512
Here are 83 public repositories matching this topic...
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
syntax-highlighting
visual-studio
assembly
x86-64
assembler
avx2
nasm
assembly-language-programming
code-completion
avx512
visual-studio-extension
disassembly
masm
-
Updated
Jan 9, 2022 - Python
oneAPI Deep Neural Network Library (oneDNN)
library
performance
deep-neural-networks
deep-learning
cpp
processor
opencl
x64
x86-64
openmp
avx2
amx
sse41
tbb
aarch64
avx512
intel-openmp-runtime
bfloat16
oneapi
onednn
dpcpp
xe-architecture
-
Updated
Jun 8, 2022 - C++
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
c-plus-plus
machine-learning
arm
neural-network
neon
image-processing
avx
sse
simd
avx2
sse2
sse41
avx512
powerpc
altivec
vsx
ssse3
simd-library
haar-cascade
lbp
-
Updated
Jun 7, 2022 - C++
Performance-portable, length-agnostic SIMD with runtime dispatch
neon
wasm
avx
simd
intrinsics
avx2
simd-programming
avx512
simd-parallelism
simd-instructions
simd-library
sse42
avx-instructions
simd-intrinsics
avx-512
-
Updated
Jun 8, 2022 - C++
lkskstlr
commented
Oct 25, 2021
Hi,
thank you very much for this cool library.
I noticed that simde_mm512_load_ps is missing while simde_mm512_loadu_ps is implemented. However, both seem to exist for Intel intrinsics (see screenshot). Maybe I oversaw one or maybe they do the same anyways?
Thanks
Lukas

cpp
neon
c-plus-plus-11
avx
sse
simd
vectorization
avx512
mathematical-functions
simd-instructions
simd-intrinsics
-
Updated
Jun 5, 2022 - C++
Fast, modern C++ DSP framework, FFT, Sample Rate Conversion, FIR/IIR/Biquad Filters (SSE, AVX, AVX-512, ARM NEON)
audio
cplusplus
dft
cxx
travis-ci
dsp
cpp14
intel
avx
clang
simd
header-only
fast-fourier-transform
cpp17
cplusplus-14
fft
digital-signal-processing
avx512
ser
audio-processing
cplusplus-17
discrete-fourier-transform
-
Updated
May 20, 2022 - C++
SIMD Vector Classes for C++
c-plus-plus
cpp
portable
neon
cpp14
parallel
parallel-computing
avx
sse
cpp11
simd
cpp17
avx2
simd-programming
vectorization
avx512
simd-instructions
simd-vector
data-parallel
-
Updated
May 30, 2022 - C++
Accelerate SHA256 computations in pure Go using AVX512, SHA Extensions for x86 and ARM64 for ARM. On AVX512 it provides an up to 8x improvement (over 3 GB/s per core). SHA Extensions give a performance boost of close to 4x over native.
-
Updated
Jun 17, 2021 - Go
Vectorized SQL for JSON at scale: fast, simple, schemaless
go
json
sql
log
serverless
high-performance
simd
query-engine
avx512
vectorized
schemaless
indexless
avx-512
-
Updated
Jun 7, 2022 - Go
SIMD Library for Evaluating Elementary Functions, vectorized libm and DFT
android
ios
arm
neon
cuda
avx
simd
elementary-functions
sse2
fft
vectorization
math-library
aarch64
avx512
powerpc
vsx
vector-math
s390x
quadruple-precision
sve
-
Updated
Jul 9, 2021 - C
Agenium Scale vectorization library for CPUs and GPUs
hpc
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
simd-library
sse42
rocm
cpp20
sve
neon128
cpp20-library
vectorization-library
-
Updated
Oct 21, 2021 - C
Storage for my snippets, toy programs, etc.
-
Updated
Apr 4, 2022 - C++
Open Source Architecture Code Analyzer
python
hpc
latency
assembly
avx
x86
throughput
avx2
performance-analysis
avx512
out-of-order
critical-path
port-mapping
performance-modeling
arm64v8
sve
in-core
loop-carried-dependency
-
Updated
Apr 8, 2022 - Jupyter Notebook
Examples of C# code compiled to GPU by hybridizer
visual-studio
compiler
dotnet
gpu
optimization
parallel
cuda
avx
avx2
vectorization
avx512
hybridizer-essentials
-
Updated
Sep 5, 2019 - C#
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
-
Updated
Jan 4, 2022 - C++
python
c
openmp
avx
simd
cosmology
astrophysics
galaxies
large-scale-structure
pair-counting
intrinsics
avx2
avx512
sse42
correlation-functions
-
Updated
Jun 3, 2022 - C
UME::SIMD A library for explicit simd vectorization.
benchmark
cpp
neon
vector
cpp14
avx
cpp11
simd
performance-tuning
cpp17
code-generation
avx2
simd-programming
vectorization
avx512
simd-instructions
altivec
instruction-set-architecture
scalar-types
ume
-
Updated
Jan 19, 2018 - C++
RV: A Unified Region Vectorizer for LLVM
compiler
llvm
openmp
simd
avx2
vectorization
avx512
compiler-optimization
arm-neon
loop-optimization
spmd
outer-loops
function-vectorizer
-
Updated
Apr 19, 2022 - C++
Improve this page
Add a description, image, and links to the avx512 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the avx512 topic, visit your repo's landing page and select "manage topics."
Though we include Boost JSON as a dependency for benchmarking purposes, we do not include it as part of our benchmarks currently.