Skip to content
View xipq's full-sized avatar

Highlights

  • Pro

Block or report xipq

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,266 172 Updated Sep 20, 2024
Python 288 31 Updated Sep 19, 2024
Python 479 57 Updated Sep 16, 2024

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 103 13 Updated Aug 6, 2024

Open-source calculator for LLM system requirements.

Python 41 7 Updated Jun 18, 2024

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 135 13 Updated Aug 2, 2024

Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages.

Python 24 Updated May 22, 2024

I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment

Python 10 2 Updated Sep 10, 2024

Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication

Python 14 Updated Mar 21, 2024

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

839 72 Updated Aug 26, 2024
Python 168 12 Updated Oct 3, 2023

Open CS Application | ๅผ€ๆบCS็”ณ่ฏท

JavaScript 1,923 221 Updated Sep 2, 2024

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 94 5 Updated Jul 6, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,443 226 Updated Sep 15, 2024

A repo lists papers related to LLM based agent

Python 977 72 Updated Aug 1, 2024

Knowledge Circuits in Pretrained Transformers

Python 46 1 Updated Sep 18, 2024

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 25,218 3,153 Updated Sep 12, 2024

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,303 223 Updated Nov 26, 2023
Python 1,202 163 Updated Sep 19, 2024

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 339 26 Updated Jun 29, 2024

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,562 122 Updated Sep 8, 2024

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,354 102 Updated Sep 20, 2024
Python 284 16 Updated Jun 9, 2024

A recipe for online RLHF.

Python 378 43 Updated Sep 20, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 649 39 Updated Aug 22, 2024

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 62 2 Updated Jun 7, 2024

Tutorials on training and testing retrieval-based models (DrQA & DPR)

Python 51 7 Updated Nov 30, 2020

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 468 28 Updated May 20, 2024

Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"

Python 97 12 Updated Aug 16, 2024
Next