Stars
A simpler site generator. Transforms a directory of templates (of varying types) into HTML.
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
Fast and memory-efficient exact attention
This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
Curated list of project-based tutorials
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)
Audio Dataset for training CLAP and other models
Stable diffusion for real-time music generation
This repository contains demos I made with the Transformers library by HuggingFace.
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Useful resources for Mongolian NLP
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.
A new markup-based typesetting system that is powerful and easy to learn.
A fluent API to FFMPEG (http://www.ffmpeg.org)
Headless TypeScript ORM with a head. Runs on Node, Bun and Deno. Lives on the Edge and yes, it's a JavaScript ORM too 😅
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
A smarter cd command. Supports all major shells.
AI-Powered Photos App for the Decentralized Web 🌈💎✨
A fast directory-first photo gallery website, with rich UI, optimized for running on low resource servers (especially on raspberry pi)