xipq

Leo xipq

🐈

3 followers · 46 following

Beijing

Achievements

Highlights

Lists (6)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

3,266 172 Updated Sep 20, 2024

zhentingqi / rStar

Python 288 31 Updated Sep 19, 2024

trotsky1997 / MathBlackBox

Python 479 57 Updated Sep 16, 2024

YuxiXie / MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Jupyter Notebook 103 13 Updated Aug 6, 2024

manuelescobar-dev / LLM-System-Requirements

Open-source calculator for LLM system requirements.

Python 41 7 Updated Jun 18, 2024

nightdessert / Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

Python 135 13 Updated Aug 2, 2024

aqweteddy / ChatVector

Official code for the ACL 2024 paper: Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages.

Python 24 Updated May 22, 2024

multimodal-art-projection / I-SHEEP

I-SHEEP: Iterative Self-enHancEmEnt Paradigm of LLMs through Self-Instruct and Self-Assessment

Python 10 2 Updated Sep 10, 2024

yinzhangyue / EoT

Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication

Python 14 Updated Mar 21, 2024

zhangxy-2019 / Self-Alignment-for-Factuality

2 Updated Jul 20, 2024

zhijing-jin / nlp-phd-global-equality

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

839 72 Updated Aug 26, 2024

dinobby / ReConcile

Python 168 12 Updated Oct 3, 2023

opencsapp / opencsapp.github.io

Open CS Application | 开源CS申请

JavaScript 1,923 221 Updated Sep 2, 2024

deepcs233 / Visual-CoT

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 94 5 Updated Jul 6, 2024

tatsu-lab / alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,443 226 Updated Sep 15, 2024

AGI-Edgerunners / LLM-Agents-Papers

A repo lists papers related to LLM based agent

Python 977 72 Updated Aug 1, 2024

zjunlp / KnowledgeCircuits

Knowledge Circuits in Pretrained Transformers

Python 46 1 Updated Sep 18, 2024

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Shell 25,218 3,153 Updated Sep 12, 2024

noahshinn / reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Python 2,303 223 Updated Nov 26, 2023

allenai / open-instruct

Python 1,202 163 Updated Sep 19, 2024

princeton-nlp / LESS

[ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning

Jupyter Notebook 339 26 Updated Jun 29, 2024

zou-group / textgrad

TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Python 1,562 122 Updated Sep 8, 2024

EvolvingLMMs-Lab / lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Python 1,354 102 Updated Sep 20, 2024

Re-Align / URIAL

Python 284 16 Updated Jun 9, 2024

RLHFlow / Online-RLHF

A recipe for online RLHF.

Python 378 43 Updated Sep 20, 2024

princeton-nlp / SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 649 39 Updated Aug 22, 2024

chujiezheng / LLM-Extrapolation

Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"

Python 62 2 Updated Jun 7, 2024

efficientqa / retrieval-based-baselines

Tutorials on training and testing retrieval-based models (DrQA & DPR)

Python 51 7 Updated Nov 30, 2020

hkust-nlp / deita

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 468 28 Updated May 20, 2024

chanchimin / RQ-RAG

Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"

Python 97 12 Updated Aug 16, 2024

Leo xipq

Highlights

Lists (6)

Camels

IR

Mechanistic Interpretability

NAMs

RL / Self-Evolve

RLHF (discontinued)

Stars