Starred repositories
The related works and background techniques about Openai o1
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.
Similarities: a toolkit for similarity calculation and semantic search. 相似度计算、匹配搜索工具包,支持亿级数据文搜文、文搜图、图搜图,python3开发,开箱即用。
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
A curated collection of LLM reasoning and planning resources, including key papers, limitations, benchmarks, and additional learning materials.
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
[CVPR 2024] A world model for autonomous driving.
Building Open-Ended Embodied Agents with Internet-Scale Knowledge
[ICLR 2023] ReAct: Synergizing Reasoning and Acting in Language Models
[NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents
Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)
[ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
This repo is the official implementation of "MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control "
LaVi-Lab / NaviLLM
Forked from zd11024/NaviLLM[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Official implementation of the DECKARD Agent from the paper "Do Embodied Agents Dream of Pixelated Sheep?"
Paper list in the survey paper: Toward General-Purpose Robots via Foundation Models: A Survey and Meta-Analysis
[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate
Official repository of Learning to Act from Actionless Videos through Dense Correspondences.