Highlights
- Pro
Starred repositories
Stable Diffusion web UI
A Gradio web UI for Large Language Models.
Instant voice cloning by MIT and MyShell.
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Modular visual interface for GDB in Python
YOLOv3 in PyTorch > ONNX > CoreML > TFLite
A collaboration friendly studio for NeRFs
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
Stable Diffusion built-in to Blender
A Blender script to procedurally generate 3D spaceships
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
a state-of-the-art-level open visual language model | 多模态预训练模型
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
Foundational model for human-like, expressive TTS
Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Real time transcription with OpenAI Whisper.
A Unified Framework for Surface Reconstruction
Stable Diffusion in TensorFlow / Keras
A Python toolbox for building complex digital hardware
Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis
This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models