-
ShenZhen University
- ShenZhen,China
Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Stars
Rethinking Image Forgery Detection and Localization
This repository contains demos I made with the Transformers library by HuggingFace.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A Massive Multi-Level Multi-Subject Knowledge Evaluation benchmark
手写文字擦除第1名方案,水印智能消除赛第1名
CnSTD: 基于 PyTorch/MXNet 的 中文/英文 场景文字检测(Scene Text Detection)、数学公式检测(Mathematical Formula Detection, MFD)、篇章分析(Layout Analysis)的Python3 包
Use Deeplabv3+ to achieve a handwriting earser on Chinese papers
An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conv…
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…
Official implementation of Mumpy(BMVC 2024)
Code Release for MViTv2 on Image Recognition.
[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition
A deep learning library for video understanding research.
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Official repository of paper “IML-ViT: Benchmarking Image manipulation localization by Vision Transformer”
Unofficial Implementation for FAST(ICCV 2021)
📜 A minimalist personal website embodying the purity of paper and freshness of snow.
AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓,同时包含工作和科研过程中的新想法、新问题、新资源与新项目
[ACM MM'23] UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization
『京墨』开源的中华文化宝典 APP,诗(词)文(名句)、汉字、成语、词语、歇后语、绕口令、传统节日、传统色、节气、人物等。
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
(NIPS 2022) Rethinking Alignment in Video Super-Resolution Transformers
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Upload a photo of your room to generate your dream room with AI.
[ECCV'2020] STTN: Learning Joint Spatial-Temporal Transformations for Video Inpainting