An Open-Source Python3 tool for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conv…

Jupyter Notebook 1,822 178 Updated Aug 19, 2024

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 43,008 7,717 Updated Oct 2, 2024

yuxiaoxiangyong / Multilateral-Temporal-view-Pyramid-Transformer-for-Video-Inpainting-Detection

Official implementation of Mumpy(BMVC 2024)

2 Updated Jul 23, 2024

facebookresearch / mvit

Code Release for MViTv2 on Image Recognition.

Python 391 46 Updated Sep 9, 2024

taoyang1122 / adapt-image-models

Forked from amazon-science/adapt-image-models

[ICLR'23] AIM: Adapting Image Models for Efficient Video Action Recognition

Python 267 21 Updated Sep 17, 2023

facebookresearch / pytorchvideo

A deep learning library for video understanding research.

Python 3,293 409 Updated Aug 13, 2024

facebookresearch / SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Python 6,542 1,207 Updated Aug 13, 2024

echonoshy / cgft-llm

Practice to LLM.

Jupyter Notebook 355 64 Updated Oct 1, 2024

facebookresearch / mae

PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

Python 7,203 1,202 Updated Jul 23, 2024

SunnyHaze / IML-ViT

Official repository of paper “IML-ViT: Benchmarking Image manipulation localization by Vision Transformer”

Jupyter Notebook 189 23 Updated Sep 28, 2024

zyx0814 / Pichome

一款图片与媒体文件管理功能强大的开源网盘程序

PHP 828 87 Updated Sep 13, 2024

yuxiaoxiangyong / Frequency-Aware-Spatiotemporal-Transformers-for-Video-Inpainting-Detection

Unofficial Implementation for FAST(ICCV 2021)

1 Updated Jul 23, 2024

Innei / Shiro

📜 A minimalist personal website embodying the purity of paper and freshness of snow.

TypeScript 3,375 719 Updated Oct 5, 2024

315386775 / DeepLearing-Interview-Awesome-2024

AIGC-interview/CV-interview/LLMs-interview面试问题与答案集合仓，同时包含工作和科研过程中的新想法、新问题、新资源与新项目

1,597 165 Updated Sep 29, 2024

ShujinW / Deep-Video-Inpainting-Localization

Under construction

Python 10 Updated Nov 20, 2022

ymhzyj / UMMAFormer

[ACM MM'23] UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization

Python 46 1 Updated May 16, 2024

hefengbao / jingmo

『京墨』开源的中华文化宝典 APP，诗（词）文（名句）、汉字、成语、词语、歇后语、绕口令、传统节日、传统色、节气、人物等。

Kotlin 1,683 159 Updated Sep 14, 2024

open-mmlab / mmagic

OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…

Jupyter Notebook 6,890 1,055 Updated Aug 6, 2024