Skip to content
View fsddl2023's full-sized avatar

Block or report fsddl2023

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 33,520 4,080 Updated Aug 16, 2024

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,387 60 Updated Sep 7, 2024

InstructLab Command-Line Interface. Use this to chat with a model and execute the InstructLab workflow to train a model using custom taxonomy data.

Python 784 295 Updated Sep 19, 2024

Multi-modal conversational AI (xRx) system

Python 95 11 Updated Sep 19, 2024

A fast multimodal LLM for real-time voice

Python 853 46 Updated Sep 19, 2024

Docker image with Uvicorn managed by Gunicorn for high-performance FastAPI web applications in Python with performance auto-tuning.

Python 2,687 333 Updated Sep 18, 2024

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,568 224 Updated Aug 20, 2024

Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.

Python 506 73 Updated Jan 9, 2024

👔 A collection of cv and resume templates written in LaTeX. Leave an issue if your language is not supported!

TeX 2,835 594 Updated Aug 22, 2023

A multi-platform CLI for offline transcription of speech recordings utilizing state-of-the-art machine learning models.

Python 11 2 Updated Sep 11, 2024

A GUI tool for offline transcription of speech recordings, including speaker diarization, utilizing state-of-the-art machine learning models.

Python 303 20 Updated Sep 12, 2024

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 11,457 1,203 Updated Aug 21, 2024

Faster Whisper transcription with CTranslate2

Python 11,448 952 Updated Aug 21, 2024

The simplest way to serve AI/ML models in production

Python 882 63 Updated Sep 19, 2024

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Python 1,324 84 Updated Sep 6, 2024
Python 424 54 Updated Sep 17, 2024

A guidance language for controlling large language models.

Jupyter Notebook 18,716 1,032 Updated Sep 18, 2024

Instruct LLMs for flat and nested NER. Fine-tuning Llama and Mistral models for instruction named entity recognition. (Instruction NER)

Python 73 8 Updated May 5, 2024

Large-scale LLM inference engine

Python 944 103 Updated Sep 19, 2024

LLM Frontend for Power Users.

JavaScript 7,537 2,169 Updated Sep 19, 2024

Adding guardrails to large language models.

Python 3,886 291 Updated Sep 19, 2024

NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

Python 3,990 365 Updated Sep 18, 2024

The Security Toolkit for managing Generative AI(especially LLMs) and Supervised Learning processes(Learning and Inference).

Python 19 4 Updated Aug 15, 2024

One click templates for inferencing Language Models

Shell 98 11 Updated Sep 10, 2024

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/

Python 11,244 1,232 Updated Sep 19, 2024

A simple FastAPI Server to run XTTSv2

Python 360 80 Updated Jul 21, 2024

A modern hardware definition language and toolchain based on Python

Python 1,526 168 Updated Sep 19, 2024
Python 102 5 Updated Jun 12, 2024

Berkeley's Spatial Array Generator

Scala 778 160 Updated Aug 14, 2024
Next