Open Source

I am dedicated to open source endeavors, which I believe is the fundamental element for the sustainable development of the AI community. Here are some open source projects I've contributed to or maintained, covering areas from computer vision to deep learning frameworks.

Agentic Paper Reading Tool: Resophy

Read research papers faster with AI. Resophy is an HTML-based AI paper reader with AI Translation & Analysis, Daily arXiv Recommendations, and Vibe Coding Oriented design.

CodeCookbook

Cookbook to Craft Good Code

In this guide, we'll dive into the essentials of crafting great code. We'll go through everything from how to name things clearly and highlight tools that make coding better and easier.

MMOCR

OpenMMLab Text Detection, Recognition and Understanding Toolbox.

Scene Text Recognition

Scene Text Recognition Recommendations

Long-time maintaining project for recording latest papers, datasets, algorithms, and SOTAs for scene text recognition.

OCR-SAM

OCR-SAM

Combining MMOCR with Segment Anything & Stable Diffusion. Automatically detect, recognize and segment text instances, with several downstream tasks, e.g., Text Removal and Text Inpainting.

Efficient Deep Learning

Efficient Deep Learning

A collection of resources and techniques for efficient deep learning, covering model compression, pruning, quantization, and knowledge distillation.

Text Recognition on Cross Domain Datasets

Text Recognition on Cross Domain Datasets

Improved Text recognition algorithms on different text domains like scene text, handwritten, document, Chinese/English.

Structured Dreambooth LoRA

Structured Dreambooth LoRA

Dreambooth (LoRA) with well-organized code structure. Naive adaptation from Diffusers.