quantization

[🔥updating ...] AI 自动量化交易机器人 Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment. 📃 online docs: https://ufund-me.github.io/Qbot ✨ :news: qbot-mini: https://github.com/Charmve/iQuant

machine-learning quantitative-finance trademarks quantization funds strategies quantitative-trading pytrade

Updated Jun 12, 2023
Jupyter Notebook

huawei-noah / Pretrained-Language-Model

Star

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

pretrained-models quantization knowledge-distillation model-compression large-scale-distributed

Updated May 21, 2023
Python

guillaumekln / faster-whisper

Star

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

Updated Jun 10, 2023
Python

aaron-xichen / pytorch-playground

Star

Base pretrained models and datasets in pytorch (MNIST, SVHN, CIFAR10, CIFAR100, STL10, AlexNet, VGG16, VGG19, ResNet, Inception, SqueezeNet)

pytorch quantization pytorch-tutorial pytorch-tutorials

Updated Nov 22, 2022
Python

666DZY666 / micronet

Star

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、reg…

Updated Oct 6, 2021
Python

neuralmagic / deepsparse

Star

Inference runtime offering GPU-class performance on CPUs and APIs to integrate ML into your application

nlp computer-vision ml inference pytorch machinelearning pruning object-detection pretrained-models quantization auto-ml cpus onnx sparsification cpu-inference-api deepsparse-engine

Updated Jun 13, 2023
Python

quic / aimet

Star

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

open-source machine-learning opensource deep-neural-networks compression deep-learning pruning quantization auto-ml network-quantization network-compression

Updated Jun 12, 2023
Python

tensorflow / model-optimization

Star

A toolkit to optimize ML models for deployment for Keras and TensorFlow, including quantization and pruning.

machine-learning sparsity compression deep-learning tensorflow optimization keras ml pruning quantization model-compression quantized-training quantized-neural-networks quantized-networks

Updated Jun 13, 2023
Python

PaddlePaddle / PaddleSlim

Star

PaddleSlim is an open-source library for deep model compression and architecture search.

sparsity compression detection transformer segmentation pruning quantization nas bert tensorrt distillation ernie yolov5 yolov6 yolov7

Updated Jun 14, 2023
Python

huggingface / optimum

Star

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

training optimization intel transformers inference pytorch quantization onnx tflite onnxruntime graphcore habana

Updated Jun 14, 2023
Python

intel / neural-compressor

Star

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

sparsity deep-learning pruning quantization knowledge-distillation auto-tuning low-precision quantization-aware-training post-training-quantization smoothquant

Updated Jun 14, 2023
Python

open-mmlab / mmrazor

Star

OpenMMLab Model Compression Toolbox and Benchmark.

detection pytorch classification segmentation pruning darts quantization nas knowledge-distillation spos autoslim

Updated Jun 14, 2023
Python

htqin / awesome-model-quantization

Star

A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

awesome deep-learning quantization binarization model-compression model-acceleration binary-network binarized-neural-networks lightweight-neural-network model-quantization efficient-deep-learning

Updated Jun 14, 2023

openvinotoolkit / training_extensions

Star

Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™

machine-learning computer-vision deep-learning pytorch semi-supervised-learning image-classification object-detection transfer-learning image-segmentation quantization action-recognition automl incremental-learning anomaly-detection hyper-parameter-optimization self-supervised-learning openvino neural-networks-compression datumaro

Updated Jun 14, 2023
Python

OpenNMT / CTranslate2

Star

Fast inference engine for Transformer models

Updated Jun 13, 2023
C++

Improve this page

Add a description, image, and links to the quantization topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the quantization topic, visit your repo's landing page and select "manage topics."

Learn more

quantization

Here are 429 public repositories matching this topic...

ymcui / Chinese-LLaMA-Alpaca

nebuly-ai / nebuly

kornelski / pngquant

IntelLabs / distiller

IntelLabs / nlp-architect

UFund-Me / Qbot

huawei-noah / Pretrained-Language-Model

guillaumekln / faster-whisper

aaron-xichen / pytorch-playground

666DZY666 / micronet

neuralmagic / deepsparse

quic / aimet

tensorflow / model-optimization

PaddlePaddle / PaddleSlim

huggingface / optimum

intel / neural-compressor

open-mmlab / mmrazor

htqin / awesome-model-quantization

openvinotoolkit / training_extensions

OpenNMT / CTranslate2

Improve this page

Add this topic to your repo