Skip to content
View VIROBO-15's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report VIROBO-15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository gives the official implementation of Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models (WACV 2025)

Python 25 3 Updated Sep 18, 2024

[MM24] Official codes and datasets for ACM MM24 paper "Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models".

Python 144 4 Updated Sep 13, 2024

Official implementation of the paper "STEREO: Towards Adversarially Robust Concept Erasing from Text-to-Image Generation Models"

15 Updated Sep 7, 2024

[MICCAI 2024] Official code repository of paper titled "BAPLe: Backdoor Attacks on Medical Foundation Models using Prompt Learning" accepted in MICCAI 2024 conference.

Python 49 Updated Sep 20, 2024

Looking 3D: Anomaly Detection with 2D-3D Alignment (CVPR24)

Python 17 3 Updated Jul 26, 2024

[ECCV 2024] Official code repository of paper titled "Efficient 3D-Aware Facial Image Editing Via Attribute-Specific Prompt Learning"

9 Updated Aug 2, 2024
4 Updated May 28, 2024

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Python 2,652 191 Updated Dec 5, 2023

Official repository of paper titled "How Good is my Video LMM? Complex Video Reasoning and Robustness Evaluation Suite for Video-LMMs".

Python 39 2 Updated Aug 23, 2024
Jupyter Notebook 43 4 Updated Sep 4, 2024

Official Implementations of "Mixed-Type Tabular Data Synthesis with Score-based Diffusion in Latent Space""

Python 78 27 Updated Jul 15, 2024

Bilingual Medical Mixture of Experts LLM

24 1 Updated Aug 15, 2024

MobiLlama : Small Language Model tailored for edge devices

Python 587 42 Updated Mar 3, 2024

📜 [ICDAR 2021] "A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts", S P Sharan, Sowmya Aitha, Amandeep Kumar, Abhishek Trivedi, Aaron August…

Python 1 Updated Mar 27, 2023

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Python 446 29 Updated Aug 12, 2024

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Python 4,292 221 Updated Jun 14, 2024

[CVPRW 2024] Official repository of paper titled "Learning to Prompt with Text Only Supervision for Vision-Language Models".

Python 82 3 Updated Aug 21, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

985 65 Updated Sep 20, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,176 192 Updated Sep 20, 2024

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing

Python 408 29 Updated Jul 25, 2024

PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models

Python 236 11 Updated Jan 2, 2024

Are gradient information useful for pruning of LLMs?

Python 35 8 Updated Apr 22, 2024
Python 37 5 Updated Nov 9, 2023

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 743 37 Updated Jun 2, 2024

Official Repository for "Generalizing to Unseen Domains in Diabetic Retinopathy Classification". (WACV-24)

Python 3 Updated Oct 30, 2023

Code for Ray Conditioning

Jupyter Notebook 27 3 Updated Feb 9, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 8,118 541 Updated Sep 20, 2024

Official implementation of the paper "FLIP: Cross-domain Face Anti-spoofing with Language Guidance". (ICCV 2023)

Python 62 3 Updated Mar 26, 2024

SA2-Net: Scale-aware Attention Network for Microscopic Image Segmentation (BMVC'23 -- Oral)

Python 17 3 Updated Dec 14, 2023

Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)

1,409 138 Updated Sep 2, 2024
Next