MEGVII Research

Megvii Research

Continuous Innovation Expands Horizons and Broadens the Mind Leveraging Cutting-Edge Technologies to Create Tangible Value

homepage | bilibili | 知乎 | 中文主页

The following list also includes projects from some of Megvii Research's affiliated organizations:

Foundation Model | Base Detection | 3D team | MegEngine

name	description
ACON	CVPR 2021 Activate or Not: Learning Customized Activation
AGFlow	Learning Optical Flow with Adaptive Graph Reasoning (AGFlow, AAAI-2022).
AnchorDETR	An official implementation of the Anchor DETR.
AngleNAS	Angle-based Search Space Shrinking for Neural Architecture Search
Arch-Net	Arch-Net: Model Distillation for Architecture Agnostic Model Deployment.
AutoAssign	AutoAssign: Differentiable Label Assignment for Dense Object Detection
BBN	BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition.
BEVDepth	BEVDepth is a new 3D object detector with a trustworthy depth estimation.
BasesHomo	"Motion Basis Learning for Unsupervised Deep Homography Estimation with Subspace Projection".
BorderDet	BorderDet: Border Feature for Dense Object Detection
CR-DA-DET	Exploring Categorical Regularization for Domain Adaptive Object Detection (CR-DA-DET).
CREStereo	Official MegEngine implementation of CREStereo(CVPR 2022 Oral).
CamLaserCalibraTool	Extrinsic Calibration of a Camera and 2d Laser
CamOdomCalibraTool	The tool to calibrate extrinsic param between camera and wheel
Co-mining	Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021.
CoNR	CoNR: Collaborative Neural Rendering using Anime Character Sheets.
CrowdDetection	Detection in Crowded Scenes: One Proposal, Multiple Predictions
D2C-SR	ECCV2022 "D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution".
DCLS-SR	"Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022.
DPGN	DPGN: Distribution Propagation Graph Network for Few-shot Learning.
DeFCN	End-to-End Object Detection with Fully Convolutional Network
DenseTeacher	DenseTeacher: Dense Pseudo-Label for Semi-supervised Object Detection
DetNAS	DetNAS: Backbone Search for Object Detection
DisAlign	Distribution Alignment: A Unified Framework for Long-tail Visual Recognition
Docs	MegEngine Documentations
Documentation	MegEngine Official Documentation
DynamicRouting	Learning Dynamic Routing for Semantic Segmentation
ECCV2022-RIFE	Official MegEngine Implementation of Real-Time Intermediate Flow Estimation for Video Frame Interpolation
ECCV2022-RIFE	ECCV2022-Real-Time Intermediate Flow Estimation for Video Frame Interpolation.
ED-Net	A Lightweight Encoder-Decoder Path for Deep Residual Networks.
End-to-end-ASR-Transformer	An end to end ASR Transformer model training repo
FINet	This is the official MegEngine implementation of FINet: Dual Branches Feature Interaction for Partial-to-Partial Point Cloud Registration, AAAI 2022
FQ-ViT	[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer.
FSCE	fewshot object detector, described in our CVPR 2021 paper, FSCE: Few-Shot Object Detection via Contrastive Proposal Encoding.
FSSD_OoD_Detection	Feature Space Singularity for Out-of-Distribution Detection(SafeAI 2021).
FST-Matching	the FST-Matching Model.
FunnelAct	Funnel Activation for Visual Recognition
GFSD	Generalized Few-Shot Object Detection without Forgetting
GeneGAN	Pytorch version of GeneGAN
GyroFlow	The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning
HDR-Transformer	the ECCV 2022 paper: Ghost-free High Dynamic Range Imaging with Context-aware Transformer.
HINet	HINet: Half Instance Normalization Network for Image Restoration
HomoGAN	[CVPR2022] Unsupervised Homography Estimation with Coplanarity-Aware GAN
Hub	基于旷视研究院领先的深度学习算法，提供满足多业务场景的预训练模型
ICCV2019-LearningToPaint	ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning.
ICD	This is the official implementation of the paper "Instance-conditional Knowledge Distillation for Object Detection", based on MegEngine and Pytorch
Iter-E2EDET	"Progressive End-to-End Object Detection in Crowded Scenes".
KD-MVS	Code for ECCV2022 paper 'KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo'.
KPAFlow	KPA-Flow. Learning Optical Flow with Kernel Patch Attention (CVPR-2022).
LGD	the detection self-distillation framework LGD.
LLA	LLA is the first one-stage detector that surpasses two-stage detectors (e.g., Faster R-CNN) on CrowdHuman dataset
LabelEnc	LabelEnc: A New Intermediate Supervision Method for Object Detection
MABN	Moving Average Batch Normalization
MEMD	Megvii Electric Moped Detector (ONNX based inference).
ML-GCN	Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019.
MM2022-ViCoPerceptualHeadGeneration	MM2022 Workshop-Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer.
MOTR	[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
MSCL	[ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation.
MSPN	Multi-Stage Pose Network
MegBA	MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment
MegEngine	MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
MegFlow	Efficient ML solution for long-tailed demands
MegPeak	Megpeak is a tool for testing processor peak computation, now support arm, x86 and GPU driven by OpenCL processor.
MegRay	A communication library for deep learning
MegSpot	MegSpot是一款高效、专业、跨平台的图片&视频对比应用
MetaPruning	MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning
Models	采用MegEngine实现的各种主流深度学习模型
NAFNet	The state-of-the-art image restoration model without nonlinear activation functions.
NBNet	NBNet: Noise Basis Learning for Image Denoising with Subspace Projection
NIPS2017-LearningToRunACE	2nd solution of NIPS2017 LearningToRun Competition.
NeRF	NeRF implementation in MegEngine
NeurIPS2021-ML4CO-KIDA	1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task.
OMNet	OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration, ICCV 2021, MegEngine implementation
OTA	Optimal Transport Assignment for Object Detection
OdomLaserCalibraTool	Extrinsic Calibration of a Odom and 2d Laser
PCB	CVPR 2022 paper "Relieving Long-tailed Instance Segmentation via Pairwise Class Balance".
PETR	[ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection
PMN	[ACMMM 2022] Learnability Enhancement for Low-light Raw Denoising: Where Paired Real Data Meets Noise Modeling.
PMRID	ECCV2020 - Practical Deep Raw Image Denoising on Mobile Devices
Portraits_Correction	[CVPR2022] Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer.
RG-SENet_SP-SENet	Delving Deep into Spatial Pooling for Squeeze-and-Excitation Networks.
RLNAS	Neural Architecture Search with Random Labels(RLNAS)
RealFlow	RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos[ECCV 2022 Oral].
RepLKNet	Official MegEngine implementation of RepLKNet
RepVGG	RepVGG: Making VGG-style ConvNets Great Again (CVPR-2021)
Resource	天元（MegEngine）的周边资源，包括技术文章、活动、最新资讯等。
SOLQ	"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.
SSQL-ECCV2022	SSQL (Accepted to ECCV2022 oral presentation).
ShuffleNet-Series	ShuffleNet Series by Megvii Research.
SinglePathOneShot	Single Path One-Shot by Megvii Research.
Sobolev_INRs	[ECCV 2022] The official experimental code of "Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives".
Sparsebit	A model compression and acceleration toolbox based pytorch.
TLC	Test-time Local Converter.
TP-LSD	Official implementation of paper "TP-LSD: Tri-points based line segment detector" .
TransMVSNet	(CVPR 2022) TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers.
TreeEnergyLoss	[CVPR2022] Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation.
TreeFilter-Torch	This project provides a cuda implementation for "Learnable Tree Filter for Structure-preserving Feature Transform" (NeurIPS2019) on PyTorch
WeightNet	WeightNet: Revisiting the Design Space of Weight Network
YOLOF	You Only Look One-level Feature (YOLOF), CVPR2021
YOLOX	MegEngine implementation of YOLOX
YOLOX	YOLOX is an anchor-free version of YOLO, with a simpler design but better performance! It aims to bridge the gap between research and industrial communities.
basecls	A codebase & model zoo for pretrained backbone based on MegEngine.
basecore	basecore is a simple repo that provides deep learning frame for MegEngine.
cutlass-bak	modified cutlass
cutlass	CUDA Templates for Linear Algebra Subroutines
cv-master-ex	torch version of instant-ngp, image rendering.
cvpods	The aim of cvpods is to achieve efficient experiments management and smooth tasks-switching
hpargparse	argparse extension for hpman.
hpman	A hyperparameter manager for deep learning experiments.
hpnevergrad	A nevergrad extension for hpman
introduction-neural-3d-reconstruction	Course materials for Introduction to Neural 3D Reconstruction.
juicefs-python	JuiceFS Python SDK.
mdistiller	[CVPR2022] Decoupled Knowledge Distillation
megengine-face-recognition	CV Master , bilibili, megstudio
megfile	Megvii FILE Library - Working with Files in Python.
megvii-pku-dl-course	Homepage for the joint course of Megvii Inc. and Peking University on Deep Learning.
megvii-tsinghua-dl-course	Slides with modifications for a course at Tsinghua University.
mgeconvert	MegEngine到其他框架的转换器
neural-painter	Paint artistic patterns using random neural network.
protoclip	ProtoCLIP in paper Prototypical Contrastive Language Image Pretraining.
pytorch-gym	Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch.
revisitAIRL	[ECCV2022] Revisiting the Critical Factors of Augmentation-Invariant Representation Learning
swin-transformer	Swin-Transformer implementation in MegEngine. This is a showcase for training on GPU with less memory by leveraging MegEngine DTR technique
tf-cpn	Cascade Pyramid Netwrok.
tf-tutorials	Tutorials for deep learning course here.
video_analyst	A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on.
zipfls	the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothing.

If you have questions about this page, please contact zhanghuiying@megvii.com or huangzhewei@megvii.com.

MEGVII Research

Megvii Research

homepage | bilibili | 知乎 | 中文主页

Pinned

Repositories

People

Top languages

Most used topics