#
avx2
Here are 148 public repositories matching this topic...
Visual Studio extension for assembly syntax highlighting and code completion in assembly files and the disassembly window
syntax-highlighting
visual-studio
assembly
x86-64
assembler
avx2
nasm
assembly-language-programming
code-completion
avx512
visual-studio-extension
disassembly
masm
-
Updated
Jul 15, 2021 - Python
oneAPI Deep Neural Network Library (oneDNN)
library
performance
deep-neural-networks
deep-learning
cpp
processor
opencl
x64
x86-64
openmp
avx2
amx
sse41
tbb
aarch64
avx512
intel-openmp-runtime
bfloat16
oneapi
onednn
dpcpp
xe-architecture
-
Updated
Jul 22, 2021 - C++
C++ image processing and machine learning library with using of SIMD: SSE, AVX, AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM.
c-plus-plus
machine-learning
arm
neural-network
neon
image-processing
avx
sse
simd
avx2
sse2
sse41
avx512
powerpc
altivec
vsx
ssse3
simd-library
haar-cascade
lbp
-
Updated
Jul 23, 2021 - C++
Open
Relaxed SIMD support
nemequ
commented
Jul 11, 2021
The WebAssembly people are working on a relaxed SIMD proposal which mostly just provides alternatives for already-implemented functions, but allows for some differences between different implementations (e.g., allowing different results for out-of-range values, NaNs, etc.).
This should be pretty easy issue to resolve; we can mostly just copy the
SIMD Vector Classes for C++
c-plus-plus
cpp
portable
neon
cpp14
parallel
parallel-computing
avx
sse
cpp11
simd
cpp17
avx2
simd-programming
vectorization
avx512
simd-instructions
simd-vector
data-parallel
-
Updated
Jul 8, 2021 - C++
Performance-optimized wheels for TensorFlow (SSE, AVX, FMA, XLA, MPI)
-
Updated
Jul 15, 2019
Native Go version of HighwayHash with optimized assembly implementations on Intel and ARM. Able to process over 10 GB/sec on a single core on Intel CPUs - https://en.wikipedia.org/wiki/HighwayHash
-
Updated
Mar 25, 2021 - Go
C++ SIMD Noise Library
neon
simplex
fractal
sse
simd
noise
cellular
avx2
perlin
perlin-noise
white-noise
noise-library
noise-3d
fastnoise-simd
simplex-noise
fastnoise
-
Updated
Mar 20, 2021 - C++
Fastest Integer Compression
encoding
library
compression
time-series
simd
compressor
integer-compression
intersection
inverted-index
avx2
sse2
-
Updated
Jul 1, 2021 - C
Performance-portable, length-agnostic SIMD with runtime dispatch
-
Updated
Jul 22, 2021 - C++
TurboRLE-Fastest Run Length Encoding
-
Updated
Feb 25, 2020 - C
Storage for my snippets, toy programs, etc.
-
Updated
Feb 18, 2021 - C++
Examples of C# code compiled to GPU by hybridizer
visual-studio
compiler
dotnet
gpu
optimization
parallel
cuda
avx
avx2
vectorization
avx512
hybridizer-essentials
-
Updated
Sep 5, 2019 - C#
Node.js implementation of HighwayHash, Google's fast and strong hash function
-
Updated
Jan 25, 2021 - JavaScript
Fast inference engine for Transformer models
deep-neural-networks
cpp
neon
openmp
parallel-computing
cuda
avx
intrinsics
avx2
neural-machine-translation
opennmt
quantization
gemm
mkl
thrust
transformer-models
onednn
-
Updated
Jul 22, 2021 - C++
Open Source Architecture Code Analyzer
python
hpc
latency
assembly
avx
x86
throughput
avx2
performance-analysis
avx512
out-of-order
critical-path
port-mapping
performance-modeling
arm64v8
sve
in-core
loop-carried-dependency
-
Updated
Jul 21, 2021 - Jupyter Notebook
Agenium Scale vectorization library for CPUs and GPUs
hpc
neon
cuda
avx
simd
avx2
sse2
simd-programming
aarch64
avx512
simd-instructions
simd-library
sse42
rocm
cpp20
sve
neon128
cpp20-library
vectorization-library
-
Updated
Jul 21, 2021 - Python
python
c
openmp
avx
simd
cosmology
astrophysics
galaxies
large-scale-structure
pair-counting
intrinsics
avx2
avx512
sse42
correlation-functions
-
Updated
Jul 22, 2021 - C
TensorFlow wheels built for latest CUDA/CuDNN and enabled performance flags: SSE, AVX, FMA; XLA
-
Updated
Feb 12, 2020
SIMD (SWAR/SSE/SSE4/AVX2/AVX512F/ARM Neon) of Karp-Rabin algorithm's modification
-
Updated
Mar 4, 2020 - C++
Improve this page
Add a description, image, and links to the avx2 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the avx2 topic, visit your repo's landing page and select "manage topics."
Though we include Boost JSON as a dependency for benchmarking purposes, we do not include it as part of our benchmarks currently.