Skip to content
View fistyee's full-sized avatar
😄
I may be slow to respond.
😄
I may be slow to respond.

Block or report fistyee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results
Python 11 Updated Apr 3, 2024

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Python 617 62 Updated Aug 25, 2024

[Survey] Awesome List of Mixup Augmentation and Beyond (https://arxiv.org/abs/2409.05202)

120 10 Updated Sep 18, 2024

[ACL 2024 Best Paper] Deciphering Oracle Bone Language with Diffusion Models

Python 74 4 Updated Sep 29, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 629 24 Updated Sep 27, 2024

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 112 1 Updated Aug 5, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 882 39 Updated Sep 30, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,849 149 Updated Sep 25, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 11,290 965 Updated Oct 3, 2024

Long Context Transfer from Language to Vision

Python 298 16 Updated Aug 26, 2024

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 246 25 Updated Oct 1, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,405 115 Updated Oct 3, 2024

Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation

Python 147 2 Updated Sep 13, 2024
HTML 2 Updated Apr 16, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,763 2,104 Updated Aug 9, 2024

This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.

Python 1,507 159 Updated Apr 24, 2023

Training and Evaluation Code for "Mixture of Volumetric Primitives for Efficient Neural Rendering"

Python 200 17 Updated Jan 6, 2022
Python 5 Updated Feb 28, 2024

test

Python 4 Updated May 5, 2024

[CVPR'24] Group Anything with Radiance Fields

Python 374 28 Updated Aug 1, 2024

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Python 218 6 Updated Jan 17, 2024

Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]

Python 271 19 Updated Mar 4, 2024

App showcasing multiple real-time diffusion models pipelines with Diffusers

Python 862 101 Updated Jun 21, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,515 682 Updated Jul 25, 2024

Official repo for LayoutGPT

Python 285 20 Updated Apr 10, 2024

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Python 976 69 Updated Jun 6, 2024

[NeurIPS 2023] LMC: Large Model Collaboration with Cross-assessment for Training-Free Open-Set Object Recognition

Python 14 2 Updated May 26, 2024

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。

TypeScript 75,416 58,873 Updated Oct 3, 2024
Next