Skip to content
View coder543's full-sized avatar

Highlights

  • Pro

Block or report coder543

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

204 results for source starred repositories
Clear filter

Foundational model for human-like, expressive TTS

Python 3,729 647 Updated Jul 30, 2024

Cast Mac windows to visionOS

Swift 861 41 Updated Aug 28, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,788 521 Updated Jul 17, 2024

This repository provides the code and model checkpoints of the research paper: Scalable Pre-training of Large Autoregressive Image Models

Python 680 42 Updated May 2, 2024

A fast, local neural text to speech system

C++ 5,838 425 Updated Aug 7, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,778 204 Updated Jun 18, 2024

Enhanced ChatGPT Clone: Features Anthropic, AWS, OpenAI, Assistants API, Azure, Groq, o1, GPT-4o, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, langchain, D…

TypeScript 17,651 2,921 Updated Sep 20, 2024

Instant voice cloning by MIT and MyShell.

Python 28,449 2,782 Updated Aug 21, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,857 206 Updated Jul 27, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,732 384 Updated Aug 10, 2024

llama.cpp with BakLLaVA model describes what does it see

Python 378 45 Updated Nov 8, 2023

A programming framework for agentic AI 🤖

Jupyter Notebook 30,963 4,522 Updated Sep 19, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 5,884 402 Updated May 29, 2024

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Python 4,471 331 Updated Jul 10, 2024

⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains

TypeScript 16,105 1,240 Updated Sep 20, 2024

WebUI for Fine-Tuning and Self-hosting of Open-Source Large Language Models for Coding

JavaScript 1,552 105 Updated Sep 20, 2024
Python 3,333 143 Updated Feb 25, 2024

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 7,649 447 Updated May 3, 2024

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Jupyter Notebook 4,529 494 Updated Sep 17, 2024

Toy Gaussian Splatting visualization in Unity

C# 2,071 233 Updated Aug 10, 2024

Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"

Python 13,710 1,765 Updated Sep 17, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,469 1,206 Updated Aug 21, 2024

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,566 756 Updated Feb 11, 2024

Search images with a text or image query, using Open AI's pretrained CLIP model.

Python 196 21 Updated Jan 15, 2022

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,766 1,049 Updated Aug 15, 2024

Simple, open source, lightweight (< 1 KB) and privacy-friendly web analytics alternative to Google Analytics.

Elixir 19,777 1,053 Updated Sep 20, 2024

Spacelift client and CLI

Go 129 34 Updated Sep 20, 2024

♾ Infisical is the open-source secret management platform: Sync secrets across your team/infrastructure, prevent secret leaks, and manage internal PKI

TypeScript 15,084 866 Updated Sep 20, 2024

Driver for the VL53L1 time-of-flight sensor in pure Rust.

Rust 12 3 Updated May 3, 2024
Next