Skip to content

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

License

Notifications You must be signed in to change notification settings

zchoi/Awesome-Embodied-Robotics-and-Agent

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 

Repository files navigation

🤖 Awesome Embodied Robotics and Agent Awesome

This is a curated list of "Embodied robotics or agent with Vision-Language Models (VLMs) and Large Language Models (LLMs)" research which is maintained by haonan.

Watch this repository for the latest updates and feel free to raise pull requests if you find some interesting papers!

News🔥

[2025/03/18] Add some popular vision-language action (VLA) models. 🦾
[2024/06/28] Created a new board about agent self-evolutionary research. 🤖
[2024/06/07] Add Mobile-Agent-v2, a mobile device operation assistant with effective navigation via multi-agent collaboration. 🚀
[2024/05/13] Add "Learning Interactive Real-World Simulators"——outstanding paper award in ICLR 2024 🥇.
[2024/04/24] Add "A Survey on Self-Evolution of Large Language Models", a systematic survey on self-evolution in LLMs! 💥
[2024/04/16] Add some CVPR 2024 papers.
[2024/04/15] Add MetaGPT, accepted for oral presentation (top 1.2%) at ICLR 2024, ranking #1 in the LLM-based Agent category. 🚀
[2024/03/13] Add CRADLE, an interesting paper exploring LLM-based agent in Red Dead Redemption II!🎮

Development of Embodied Robotics and Benchmarks

0-video-1.mp4
0-video-2.mp4
0-video-3.mp4

image

  • Video demo and figure from [1] and [2].

Table of Contents 🍃

Methods

Survey

Vision-Language-Action Model

Self-Evolving Agents

Advanced Agent Applications

LLMs with RL or World Model

Planning and Manipulation or Pretraining

Multi-Agent Learning and Coordination

Vision and Language Navigation

Detection

  • DetGPT: Detect What You Need via Reasoning [arXiv 2023]
    Renjie Pi1∗ Jiahui Gao2* Shizhe Diao1∗ Rui Pan1 Hanze Dong1 Jipeng Zhang1 Lewei Yao1 Jianhua Han3 Hang Xu2 Lingpeng Kong2 Tong Zhang1
    1The Hong Kong University of Science and Technology 2The University of Hong Kong 3Shanghai Jiao Tong University

3D Grounding

Interactive Embodied Learning

Rearrangement

Benchmark

Simulator

Others

Acknowledge

[1] Video demo from this project
[2] Figure from this [project][https://robotics-transformer-x.github.io/)

About

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates! 🔥

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published