Skip to content
View Hydragon516's full-sized avatar
😁
😁

Highlights

  • Pro

Block or report Hydragon516

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simple Finetuning Starter Code for Segment Anything

Python 130 17 Updated Apr 13, 2023

Python API for Tuya WiFi smart devices using a direct local area network (LAN) connection or the cloud (TuyaCloud API).

Python 953 172 Updated Aug 5, 2024

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Jupyter Notebook 2,104 150 Updated Jun 6, 2024

[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Python 46 6 Updated Sep 1, 2024

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Python 913 27 Updated Jul 31, 2024

[OpenPAR] An open-source framework for Pedestrian Attribute Recognition, based on PyTorch

Python 69 8 Updated Sep 14, 2024

[ECCV'24] Official Implementation of SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance

Python 95 9 Updated Aug 31, 2024
Python 23 4 Updated Sep 14, 2024

[ECCV 2024] SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution

Python 68 4 Updated Sep 26, 2024
Python 97 7 Updated Feb 13, 2023

PyTorch implementation for the paper Don't Look into the Dark: Latent Codes for Pluralistic Image Inpainting (CVPR2024).

Python 11 2 Updated Jul 2, 2024

[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Python 1,368 114 Updated Jul 17, 2024

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Python 795 60 Updated Jul 6, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,327 969 Updated Oct 5, 2024

Universal Monocular Metric Depth Estimation

Python 588 47 Updated Jul 1, 2024

[ECCV 2024] Unified Embedding Alignment for Open-Vocabulary Video Instance Segmentation

Python 30 1 Updated Jul 11, 2024

[arXiv 2024] Improving Unsupervised Video Object Segmentation via Fake Flow Generation

Python 12 1 Updated Aug 18, 2024

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Jupyter Notebook 896 59 Updated Mar 25, 2023

Latte: Latent Diffusion Transformer for Video Generation.

Python 1,653 177 Updated Sep 28, 2024

[ECCV 2024] ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Python 25 Updated Aug 6, 2024

[CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing'

Python 114 10 Updated Jun 18, 2024
JavaScript 3 Updated Sep 12, 2024

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Jupyter Notebook 408 16 Updated Sep 25, 2024

[CVPR 2024] Exploring Orthogonality in Open World Object Detection

Python 33 3 Updated Jun 11, 2024

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral, Best Student Paper].

Python 155 12 Updated Oct 3, 2024

AuraSR: GAN-based Super-Resolution for real-world

Python 393 31 Updated Jul 31, 2024

[CVPR 2024 Oral] MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation.

Python 125 11 Updated Aug 1, 2024

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Python 730 43 Updated Apr 5, 2024

[CVPR 2024] Deformable Convolution v4

Python 486 27 Updated May 17, 2024
Next