Skip to content
View shjo-april's full-sized avatar
๐ŸŽฏ
Focusing
๐ŸŽฏ
Focusing

Block or report shjo-april

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PixArt-ฮฑ: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Python 2,719 174 Updated Aug 1, 2024

MagicAvatar: Multimodal Avatar Generation and Animation

620 33 Updated Aug 29, 2023

ICCV 2023 ่ฎบๆ–‡ๅ’Œๅผ€ๆบ้กน็›ฎๅˆ้›†

2,489 249 Updated Oct 1, 2023

Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds

Python 1,499 101 Updated Jul 22, 2024
Python 7,655 496 Updated Apr 14, 2024

๐Ÿ”Š Text-Prompted Generative Audio Model

Jupyter Notebook 35,517 4,175 Updated Aug 19, 2024

Image to prompt with BLIP and CLIP

Python 2,662 433 Updated May 15, 2024

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Python 2,401 247 Updated Apr 24, 2024

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Python 725 52 Updated May 10, 2022

OVSegmentor, CVPR23

Python 53 4 Updated Apr 22, 2024

Let us control diffusion models!

Python 29,930 2,702 Updated Feb 25, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,873 1,377 Updated Sep 5, 2024

Open-Set Grounded Text-to-Image Generation

Python 1,981 147 Updated Mar 6, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,427 661 Updated Aug 12, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 36,979 3,229 Updated Aug 17, 2024

Official Implementation of "CAT-Seg๐Ÿฑ: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Python 252 25 Updated Apr 11, 2024

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 46,954 5,557 Updated Sep 18, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 36,576 4,516 Updated Sep 25, 2024

IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)

Python 80 9 Updated Sep 5, 2023

The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"

Python 444 42 Updated Jun 18, 2024

An open source implementation of CLIP.

Python 9,917 957 Updated Aug 19, 2024

An open-source framework for training large multimodal models.

Python 3,682 277 Updated Aug 31, 2024

Ultralytics YOLO11 ๐Ÿš€

Python 29,370 5,768 Updated Oct 2, 2024

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Python 1,285 134 Updated Oct 5, 2023

4 bits quantization of LLaMA using GPTQ

Python 2,986 459 Updated Jul 13, 2024

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐Ÿ”œ video, up to 5x faster than OpenAI CLIP and LLaVA ๐Ÿ–ผ๏ธ & ๐Ÿ–‹๏ธ

Python 1,025 61 Updated Oct 1, 2024

[CVPR 2022] CLIMS: Cross Language Image Matching for Weakly Supervised Semantic Segmentation

Python 121 12 Updated Jun 7, 2024

๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 132,865 26,498 Updated Oct 2, 2024

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 24,992 3,230 Updated Jul 23, 2024

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

Jupyter Notebook 7,573 792 Updated Dec 8, 2022
Next