-
Zhejiang University
- Hangzhou
- www.tianchez.com
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
This repository provides tutorials and implementations for various Generative AI Agent techniques, from basic to advanced. It serves as a comprehensive guide for building intelligent, interactive A…
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
A Framework of Small-scale Large Multimodal Models
Efficient Triton Kernels for LLM Training
Open Source framework for voice and multimodal conversational AI
LlamaIndex is a data framework for your LLM applications
🐫 CAMEL: Finding the Scaling Law of Agents. A multi-agent framework. https://www.camel-ai.org
🤖 MLE-Agent: Your intelligent companion for seamless AI engineering and research. 🔍 Integrate with arxiv and paper with code to provide better code/research plans 🧰 OpenAI, Anthropic, Ollama, etc s…
An open-source RAG-based tool for chatting with your documents.
A compact LLM pretrained in 9 days by using high quality data
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Video surveilance footage analyst powered by GPT-4o
COYO-700M: Large-scale Image-Text Pair Dataset
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
✨✨ MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans?
Python Library to evaluate VLM models' robustness across diverse benchmarks
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
🚀 基于大语言模型和 RAG 的知识库问答系统。开箱即用、模型中立、灵活编排,支持快速嵌入到第三方业务系统。
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
A suite of multimodal language models that are powerful and efficient
A vector search SQLite extension that runs anywhere!
ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].
A Home Assistant integration & Model to control your smart home using a Local LLM