#

inference

Here are 904 public repositories matching this topic...

google / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

android c-plus-plus calculator machine-learning framework computer-vision deep-learning inference pipeline-framework stream-processing video-processing perception mobile-development audio-processing graph-framework graph-based mediapipe

Updated Mar 16, 2023
C++

ColossalAI

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

ai deep-learning hpc distributed-computing inference big-model large-scale data-parallelism model-parallelism pipeline-parallelism foundation-models heterogeneous-training

Updated Mar 16, 2023
Python

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

Updated Mar 16, 2023
C++

ggerganov / whisper.cpp

Sponsor

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated Mar 16, 2023
C

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

machine-learning compression deep-learning gpu inference pytorch zero data-parallelism model-parallelism mixture-of-experts pipeline-parallelism billion-parameters trillion-parameters

Updated Mar 16, 2023
Python

aws / amazon-sagemaker-examples

Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.

training aws data-science machine-learning reinforcement-learning deep-learning examples jupyter-notebook inference sagemaker mlops

Updated Mar 16, 2023
Jupyter Notebook

NVIDIA / TensorRT

NVIDIA® TensorRT™, an SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications.

deep-learning inference nvidia tensorrt

Updated Mar 16, 2023
C++

nebuly-ai / nebullvm

Plug and play modules to optimize the performances of your AI systems 🚀

machine-learning deep-learning neural-network compiler tensorflow gpu optimization pypi transformers inference pytorch computing quantization tensorrt edge-computing tvm onnx openvino huggingface

Updated Mar 16, 2023
Python

Linzaer / Ultra-Light-Fast-Generic-Face-Detector-1MB

💎1MB lightweight face detection model (1MB轻量级人脸检测模型)

arm inference face-detection mnn ncnn

Updated Feb 10, 2022
Python

dusty-nv / jetson-inference

Hello AI World guide to deploying deep-learning inference networks and deep vision primitives with TensorRT and NVIDIA Jetson.

Updated Mar 15, 2023
C++

gcanti / io-ts

Sponsor

Runtime type system for IO decoding/encoding

typescript validation types runtime inference

Updated Feb 12, 2023
TypeScript

ts-pattern

gvergnaud / ts-pattern

Sponsor

🎨 The exhaustive Pattern Matching library for TypeScript, with smart type inference.

javascript typescript matching pattern pattern-matching branching inference ts conditions type-inference exhaustive

Updated Mar 8, 2023
TypeScript

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

machine-learning cloud deep-learning gpu inference edge datacenter

Updated Mar 16, 2023
Python

openvinotoolkit / openvino

OpenVINO™ Toolkit repository

performance deep-learning inference inference-engine openvino model-optimizer

Updated Mar 16, 2023
C++

Tencent / TNN

TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and …

ocr deep-learning tensorflow inference pytorch tengine face-detection tensorrt mnn coreml ncnn openvino hairsegmentaion

Updated Mar 15, 2023
C++

NVIDIA-AI-IOT / torch2trt

An easy to use PyTorch to TensorRT converter

inference pytorch classification tensorrt jetson-tx2 jetson-xavier jetson-nano

Updated Feb 21, 2023
Python

Trusted-AI / adversarial-robustness-toolbox

Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

python machine-learning privacy ai attack extraction inference artificial-intelligence evasion red-team poisoning adversarial-machine-learning blue-team adversarial-examples adversarial-attacks trusted-ai trustworthy-ai

Updated Mar 16, 2023
Python

openvinotoolkit / open_model_zoo

Pre-trained Deep Learning models and demos (high quality and extremely fast)

demo model-zoo model models inference cnn-model caffemodel tensorflow-models pytorch-models deep-learning-models openvino onnx-models openvino-toolkit openvino-models openvino-model-zoo

Updated Mar 14, 2023
Python

typedb

vaticle / typedb

TypeDB: a strongly-typed database

database graph logic inference knowledge-graph graph-theory graph-database graphdb knowledge-base type-system strongly-typed graph-visualisation relational knowledge-representation reasoning enterprise-knowledge-graph hyper-relational typedb typeql

Updated Mar 10, 2023
Java

bytedance / lightseq

LightSeq: A High Performance Library for Sequence Processing and Generation

training cuda inference transformer accelerate bart beam-search sampling gpt bert multilingual-nmt diverse-decoding

Updated Mar 9, 2023
C++

Improve this page

Add a description, image, and links to the inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the inference topic, visit your repo's landing page and select "manage topics."