Skip to content
View daswer123's full-sized avatar
😼
Learning and practicing
😼
Learning and practicing

Block or report daswer123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Automatically finds all installed Steam, Epic and Ubisoft games with their respective DLC-related DLL locations on the user's computer, parses SteamCMD, Steam Store and Epic Games Store for user-se…

C# 3,715 183 Updated Aug 20, 2024

An API to transcribe audio with OpenAI's Whisper Large v3!

Python 167 22 Updated Aug 21, 2024

Tag manager and captioner for image datasets

Python 667 31 Updated Aug 4, 2024
Python 1,356 93 Updated Sep 16, 2024

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 3,779 204 Updated Jun 18, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 11,371 851 Updated Sep 18, 2024

A ruby gem to liberate content from Microsoft Word documents

Ruby 1,464 156 Updated May 22, 2024

Agentic components of the Llama Stack APIs

Python 3,238 321 Updated Sep 19, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,413 285 Updated Aug 15, 2024

[ECCV2024 Oral] Clearer anytime frame interpolation & Manipulated interpolation of anything

Python 212 13 Updated Aug 13, 2024

Flowframes Windows GUI for video interpolation using DAIN (NCNN) or RIFE (CUDA/NCNN)

Python 1,465 114 Updated Sep 6, 2024

Multi-Platform Package Manager for Stable Diffusion

C# 4,367 281 Updated Sep 18, 2024

[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Python 296 19 Updated Sep 11, 2024

Простой нормализатор текстов перед синтезом речи

Python 19 1 Updated May 13, 2024

🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning

Python 119 24 Updated Jul 15, 2024

Automatic1111 serverless worker.

Dockerfile 76 107 Updated May 18, 2024

Bring portraits to life!

Python 11,782 1,229 Updated Sep 6, 2024

Простой расстановщик ударений с обработкой омографов

Python 88 8 Updated Aug 30, 2024

Understand Human Behavior to Align True Needs

Python 3,278 289 Updated Jul 20, 2024

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 5,286 543 Updated Jul 3, 2024

Kolors Team

Python 3,553 228 Updated Sep 4, 2024

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 210 30 Updated Jul 1, 2024

Unofficial implementation of NVIDIA P-Flow TTS paper

Python 1 Updated Feb 24, 2024

High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

Python 1,611 130 Updated Sep 11, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,538 352 Updated Aug 10, 2024

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Python 4,726 384 Updated Aug 10, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,450 198 Updated Aug 1, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,483 5,006 Updated Sep 19, 2024

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,279 2,376 Updated Aug 28, 2024
Next