Lists (1)
Sort Name ascending (A-Z)
Starred repositories
Guideline following Large Language Model for Information Extraction
Python tools for interacting with Wikidata
基于scrapy的层次优先队列方法爬取中文维基百科,并自动抽取结构和半结构数据
A new markup-based typesetting system that is powerful and easy to learn.
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.
Scripts of LLM pre-training and fine-tuning (w/wo LoRA, DeepSpeed)
Enforce the output format (JSON Schema, Regex etc) of a language model
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
llama3 implementation one matrix multiplication at a time
An open-source RAG-based tool for chatting with your documents.
[ACL 2024] IEPile: A Large-Scale Information Extraction Corpus
Unsupervised Natural Language Parsing (Tutorial)
The official implementation of Self-Play Fine-Tuning (SPIN)
Robust recipes to align language models with human and AI preferences
[ACL 2024] LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark
ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models
Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
Block Transformer: Global-to-Local Language Modeling for Fast Inference (Official Code)
Vim-fork focused on extensibility and usability
Fast and memory-efficient exact attention