Continuous Innovation Expands Horizons and Broadens the Mind Leveraging Cutting-Edge Technologies to Create Tangible Value
The following list also includes projects from some of Megvii Research's affiliated organizations:
Foundation Model | Base Detection | 3D team | MegEngine
| name | description |
|---|---|
| 4K-Face | 4K-Face: A Dataset with Huge Scale-variance Faces |
| AAAI2023-PVD | Official Implementation of PVD:One is All: Bridging the Gap Between Neural Radiance Fields Architectures with Progressive Volume Distillation |
| ACON | None |
| AGFlow | Learning Optical Flow with Adaptive Graph Reasoning (AGFlow, AAAI-2022) |
| AnchorDETR | An official implementation of the Anchor DETR. |
| AngleNAS | None |
| Arch-Net | Arch-Net: Model Distillation for Architecture Agnostic Model Deployment |
| AutoAssign | Pytorch implementation of "AutoAssign: Differentiable Label Assignment for Dense Object Detection" |
| BBN | The official PyTorch implementation of paper BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition |
| BEVDepth | Official code for BEVDepth. |
| BEVStereo | Official code for BEVStereo |
| BasesHomo | The official PyTorch implementation of the paper "Motion Basis Learning for Unsupervised Deep Homography Estimation with Subspace Projection". |
| BorderDet | BorderDet: Border Feature for Dense Object Detection(ECCV2020 Oral) |
| CADDM | Official implementation of ID-unaware Deepfake Detection Model |
| CR-DA-DET | The official PyTorch implementation of paper Exploring Categorical Regularization for Domain Adaptive Object Detection (CR-DA-DET) |
| CREStereo | Official MegEngine implementation of CREStereo(CVPR 2022 Oral). |
| CVPR2023-DMVFN | CVPR2023 (highlight) - A Dynamic Multi-Scale Voxel Flow Network for Video Prediction |
| CVPR2023-UniDistill | UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View |
| CamLaserCalibraTool | Extrinsic Calibration of a Camera and 2d Laser |
| CamOdomCalibraTool | The tool to calibrate extrinsic param between camera and wheel |
| Co-mining | Co-mining: Self-Supervised Learning for Sparsely Annotated Object Detection, AAAI 2021. |
| CoNR | Official implementation of CoNR: Collaborative Neural Rendering using Anime Character Sheets |
| CrowdDetection | None |
| D2C-SR | Official MegEngine implementation of ECCV2022 "D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution". |
| DCLS-SR | Official PyTorch implementation of the paper "Deep Constrained Least Squares for Blind Image Super-Resolution", CVPR 2022. |
| DPGN | This repository contains the official implementation of DPGN: Distribution Propagation Graph Network for Few-shot Learning. |
| DVN | None |
| DeFCN | End-to-End Object Detection with Fully Convolutional Network |
| DenseTeacher | DenseTeacher: Dense Pseudo-Label for Semi-supervised Object Detection |
| DetNAS | None |
| DisAlign | Implementation of "Distribution Alignment: A Unified Framework for Long-tail Visual Recognition"(CVPR 2021) |
| DiverseBranchBlock | None |
| Docs | MegEngine Documentations |
| Documentation | MegEngine Official Documentation |
| DynamicRouting | Learning Dynamic Routing for Semantic Segmentation |
| ECCV2022-RIFE | Official MegEngine Implementation of Real-Time Intermediate Flow Estimation for Video Frame Interpolation |
| ECCV2022-RIFE | ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation |
| ED-Net | PyTorch implementation of A Lightweight Encoder-Decoder Path for Deep Residual Networks. |
| End-to-end-ASR-Transformer | An end to end ASR Transformer model training repo |
| FINet | This is the official MegEngine implementation of FINet: Dual Branches Feature Interaction for Partial-to-Partial Point Cloud Registration, AAAI 2022 |
| FQ-ViT | [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer |
| FSCE | None |
| FSSD_OoD_Detection | Feature Space Singularity for Out-of-Distribution Detection. (SafeAI 2021) |
| FST-Matching | Official implementation of the FST-Matching Model. |
| FullMatch | Official implementation of FullMatch (CVPR2023) |
| FunnelAct | None |
| GFSD | This project provides an implementation for "Generalized Few-Shot Object Detection without Forgetting" (CVPR2021) on PyTorch. |
| GeneGAN | Pytorch version of GeneGAN |
| GyroFlow | The official MegEngine implementation of the ICCV 2021 paper: GyroFlow: Gyroscope-Guided Unsupervised Optical Flow Learning |
| HDR-Transformer | The official MegEngine implementation of the ECCV 2022 paper: Ghost-free High Dynamic Range Imaging with Context-aware Transformer |
| HINet | None |
| HomoGAN | This is the official implementation of HomoGAN, CVPR2022 |
| Hub | 基于旷视研究院领先的深度学习算法,提供满足多业务场景的预训练模型 |
| ICCV2019-LearningToPaint | ICCV2019 - Learning to Paint With Model-based Deep Reinforcement Learning |
| ICD | This is the official implementation of the paper "Instance-conditional Knowledge Distillation for Object Detection", based on MegEngine and Pytorch. |
| Inference-Demo | 推理样例 |
| Iter-E2EDET | Official implementation of the paper "Progressive End-to-End Object Detection in Crowded Scenes" |
| KD-MVS | Code for ECCV2022 paper 'KD-MVS: Knowledge Distillation Based Self-supervised Learning for Multi-view Stereo' |
| KPAFlow | PyTorch implementation of KPA-Flow. Learning Optical Flow with Kernel Patch Attention (CVPR-2022) |
| LBHomo | This is the official PyTorch implementation of Semi-supervised Deep Large-baseline Homography Estimation with Progressive Equivalence Constraint, AAAI 2023 |
| LGD | Official Implementation of the detection self-distillation framework LGD. |
| LLA | Official implementation of our paper "LLA: Loss-aware Label Assignment for Dense Pedestrian Detection" in Pytorch. |
| LabelEnc | None |
| MABN | None |
| MEMD | Megvii Electric Moped Detector (ONNX based inference) |
| ML-GCN | PyTorch implementation of Multi-Label Image Recognition with Graph Convolutional Networks, CVPR 2019. |
| MM2022-ViCoPerceptualHeadGeneration | MM2022 Workshop-Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer |
| MOTR | [ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer |
| MOTRv2 | [CVPR2023] MOTRv2: Bootstrapping End-to-End Multi-Object Tracking by Pretrained Object Detectors |
| MSCL | [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation |
| MSPN | Multi-Stage Pose Network |
| MegBA | MegBA: A GPU-Based Distributed Library for Large-Scale Bundle Adjustment |
| MegCC | MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器 |
| MegCat | A Deep Learning Project about cats. |
| MegDiffusion | MegEngine implementation of Diffusion Models. |
| MegEngine-Benchmark | None |
| MegEngine-DMVFN | None |
| MegEngine | MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架 |
| MegFlow | Efficient ML solution for long-tailed demands. |
| MegPeak | None |
| MegRay | A communication library for deep learning |
| MegSpot | MegSpot是一款高效、专业、跨平台的图片&视频对比应用 |
| Megvii-BaseDetection | You are welcomed to join us! |
| MetaPruning | None |
| Models | 采用MegEngine实现的各种主流深度学习模型 |
| NAFNet | The state-of-the-art image restoration model without nonlinear activation functions. |
| NBNet | NBNet: Noise Basis Learning for Image Denoising with Subspace Projection |
| NIPS2017-LearningToRunACE | 2nd place solution of NIPS2017 LearningToRun Competition. |
| NeRF | NeRF implementation in MegEngine |
| NeurIPS2021-ML4CO-KIDA | 1st Solution For NeurIPS 2021 Competition on ML4CO Dual Task |
| OMNet | OMNet: Learning Overlapping Mask for Partial-to-Partial Point Cloud Registration, ICCV 2021, MegEngine implementation. |
| OTA | Official implementation of our CVPR2021 paper "OTA: Optimal Transport Assignment for Object Detection" in Pytorch. |
| OccDepth | Maybe the first academic open work on stereo 3D SSC method with vision-only input. |
| OdomLaserCalibraTool | Extrinsic Calibration of a Odom and 2d Laser |
| PCB | Official code for CVPR 2022 paper "Relieving Long-tailed Instance Segmentation via Pairwise Class Balance". |
| PETR | [ECCV2022] PETR: Position Embedding Transformation for Multi-View 3D Object Detection |
| PMN | [ACMMM 2022 Best Paper Runner-Up Award] Learnability Enhancement for Low-light Raw Denoising: Where Paired Real Data Meets Noise Modeling |
| PMRID | ECCV2020 - Practical Deep Raw Image Denoising on Mobile Devices |
| Portraits_Correction | None |
| RG-SENet_SP-SENet | PyTorch implementation of Delving Deep into Spatial Pooling for Squeeze-and-Excitation Networks. |
| RLNAS | None |
| RealFlow | The official implementation of the ECCV 2022 Oral paper: RealFlow: EM-based Realistic Optical Flow Dataset Generation from Videos |
| RepLKNet | Official MegEngine implementation of RepLKNet |
| RepVGG | None |
| Resource | None |
| RevCol | Official Code of Paper "Reversible Column Networks" |
| SMP | None |
| SOLQ | "SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer. |
| SSQL-ECCV2022 | PyTorch implementation of SSQL (Accepted to ECCV2022 oral presentation) |
| ShuffleNet-Series | None |
| SimpleDG | This is the training and test code for ECCV2022 workshop NICO challenge |
| SinglePathOneShot | None |
| Sobolev_INRs | [ECCV 2022] The official experimental code of "Sobolev Training for Implicit Neural Representations with Approximated Image Derivatives" |
| Sparsebit | A model compression and acceleration toolbox based on pytorch. |
| TLC | Test-time Local Converter |
| TP-LSD | Official implementation of paper "TP-LSD: Tri-points based line segment detector" . |
| TPS-CVPR2023 | None |
| TransMVSNet | (CVPR 2022) TransMVSNet: Global Context-aware Multi-view Stereo Network with Transformers. |
| Transfer-Learning-Library | Transfer Learning Library for Domain Adaptation, Task Adaptation, and Domain Generalization |
| TreeEnergyLoss | [CVPR2022] Tree Energy Loss: Towards Sparsely Annotated Semantic Segmentation |
| TreeFilter-Torch | Learnable Tree Filter for Structure-preserving Feature Transform |
| US3L-CVPR2023 | PyTorch implementation of US3L (Accepted to CVPR2023) |
| WeightNet | None |
| YOLOF | None |
| YOLOX | MegEngine implementation of YOLOX |
| YOLOX | YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/ |
| awesome-megengine | Awesome Resources about MegEngine |
| basecls | A codebase & model zoo for pretrained backbone based on MegEngine. |
| basecls | A codebase & model zoo for pretrained backbone based on MegEngine. |
| basecore | basecore is a simple repo that provides deep learning frame for MegEngine. |
| basedet | An object detection codebase based on MegEngine. |
| cheat_sheet_for_pytorch_immigrant | 一份给从 PyTorch 迁移过来的用户的小抄 |
| cpp_redis | C++11 Lightweight Redis client: async, thread-safe, no dependency, pipelining, multi-platform |
| cpuinfo | CPU INFOrmation library (x86/x86-64/ARM/ARM64, Linux/Windows/Android/macOS/iOS) |
| cudnn-frontend | None |
| cutlass-bak | modified cutlass |
| cutlass | CUDA Templates for Linear Algebra Subroutines |
| cv-master-ex | torch version of instant-ngp, image rendering |
| cvpods | All-in-one Toolbox for Computer Vision Research. |
| examples | A set of examples around MegEngine |
| flatbuffers | FlatBuffers: Memory Efficient Serialization Library |
| gflags | The gflags package contains a C++ library that implements commandline flags processing. It includes built-in support for standard types such as string and the ability to define flags in the source file in which they are used. Online documentation available at: |
| hpargparse | argparse extension for hpman |
| hpman | A hyperparameter manager for deep learning experiments. |
| hpnevergrad | A nevergrad extension for hpman |
| introduction-neural-3d-reconstruction | Course materials for Introduction to Neural 3D Reconstruction |
| invis | invisible megengine API |
| juicefs-python | JuiceFS Python SDK |
| mdistiller | The official implementation of [CVPR2022] Decoupled Knowledge Distillation https://arxiv.org/abs/2203.08679 |
| megengine-face-recognition | None |
| megenginelite-rs | None |
| megfile | Megvii FILE Library - Working with Files in Python |
| megvii-pku-dl-course | Homepage for the joint course of Megvii Inc. and Peking University on Deep Learning. |
| megvii-tsinghua-dl-course | Slides with modifications for a course at Tsinghua University. |
| mgeconvert | MegEngine到其他框架的转换器 |
| midout | Reduce binary size by removing code blocks |
| mperf-libpfm4 | forked from https://sourceforge.net/p/perfmon2/libpfm4/ci/master/tree/ |
| mperf | mperf是一个面向移动/嵌入式平台的算子性能调优工具箱 |
| neural-painter | Paint artistic patterns using random neural network. |
| oneDNN | oneAPI Deep Neural Network Library (oneDNN) |
| protoclip | Official pytorch implementation of ProtoCLIP in paper Prototypical Contrastive Language Image Pretraining |
| pydata-sphinx-theme | Bootstrap-based sphinx theme from the PyData community |
| pytorch-gym | Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch |
| revisitAIRL | [ECCV2022] Revisiting the Critical Factors of Augmentation-Invariant Representation Learning |
| storage | provide Checkpoint for users. |
| swin-transformer | Swin-Transformer implementation in MegEngine. This is a showcase for training on GPU with less memory by leveraging MegEngine DTR technique. |
| tf-cpn | Cascade Pyramid Netwrok |
| tf-tutorials | Tutorials for deep learning course here: |
| torch2mge | None |
| video_analyst | A series of basic algorithms that are useful for video understanding, including Single Object Tracking (SOT), Video Object Segmentation (VOS) and so on. |
| xopr | Experimental Operator Library for MegEngine |
| zipfls | This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothing. |