Skip to content
View fistyee's full-sized avatar
😄
I may be slow to respond.
😄
I may be slow to respond.

Block or report fistyee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
51 stars written in Python
Clear filter

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…

Python 64,500 7,975 Updated Oct 1, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 31,681 4,711 Updated Oct 2, 2024

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Python 25,398 5,260 Updated Oct 3, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 21,763 2,104 Updated Aug 9, 2024

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Python 9,515 682 Updated Jul 25, 2024

PyTorch code for Vision Transformers training with the Self-Supervised learning method DINO

Python 6,249 905 Updated Jul 3, 2024

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Python 3,997 345 Updated May 6, 2023

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,849 149 Updated Sep 25, 2024

Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)

Python 1,557 135 Updated Jan 23, 2024

This may be the simplest implement of DDPM. You can directly run Main.py to train the UNet on CIFAR-10 dataset and see the amazing process of denoising.

Python 1,507 159 Updated Apr 24, 2023

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,405 115 Updated Oct 3, 2024

Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"

Python 976 69 Updated Jun 6, 2024

ICCV 2023 Papers: Discover cutting-edge research from ICCV 2023, the leading computer vision conference. Stay updated on the latest in computer vision and deep learning, with code included. ⭐ suppo…

Python 921 42 Updated Sep 1, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 882 39 Updated Sep 30, 2024

App showcasing multiple real-time diffusion models pipelines with Diffusers

Python 862 101 Updated Jun 21, 2024

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Python 848 52 Updated Mar 19, 2024

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Python 630 24 Updated Sep 27, 2024

CAIRI Supervised, Semi- and Self-Supervised Visual Representation Learning Toolbox and Benchmark

Python 617 62 Updated Aug 25, 2024

[CVPR'24] Group Anything with Radiance Fields

Python 374 28 Updated Aug 1, 2024

Video-P2P: Video Editing with Cross-attention Control

Python 374 24 Updated Jul 20, 2024

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high …

Python 301 117 Updated Sep 30, 2024

Long Context Transfer from Language to Vision

Python 299 16 Updated Aug 26, 2024

Official repo for LayoutGPT

Python 285 20 Updated Apr 10, 2024

Code release for Image Sculpting: Precise Object Editing with 3D Geometry Control [CVPR 2024]

Python 271 19 Updated Mar 4, 2024

LiveBench: A Challenging, Contamination-Free LLM Benchmark

Python 246 25 Updated Oct 1, 2024

[ICCV 2023] BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion

Python 242 14 Updated Jul 21, 2024

[ICLR 2024 (Spotlight)] "Frozen Transformers in Language Models are Effective Visual Encoder Layers"

Python 218 6 Updated Jan 17, 2024

Training and Evaluation Code for "Mixture of Volumetric Primitives for Efficient Neural Rendering"

Python 200 17 Updated Jan 6, 2022
Next