🧠 Satori: Proactive AR Task Guidance Framework

Satori is a modular framework for developing context-aware augmented reality (AR) task assistants. It enables real-time, step-by-step guidance by integrating egocentric vision, stream-based data pipelines, and language-based reasoning.

Designed for AR devices like Microsoft HoloLens, Satori allows researchers and developers to prototype and deploy intelligent AR systems with reusable components.

📄 Satori: Towards Proactive AR Assistant with Belief-Desire-Intention User Modeling — Accepted to ACM CHI 2025. arXiv

🧭 Features

Stream-based architecture using ptgctl
Modular pipeline system for vision, reasoning, and feedback
Agent abstraction for composing AR task logic
Integration with GPT-4V for multimodal guidance generation
Flexible configuration via YAML

🚀 Getting Started

Requirements

Python 3.9+
PyTorch + torchvision
ptgctl and ptgctl-pipeline
Additional dependencies in requirements.txt

Installation

# Clone the repository
git clone https://github.com/VIDA-NYU/satori-assistance.git
cd satori-assistance

# Create virtual environment (optional but recommended)
python -m venv .venv
source .venv/bin/activate  # on Windows use `.venv\Scripts\activate`

# Install dependencies
pip install -r requirements.txt

Run the Application

python main.py

By default, this launches the main Satori agent using pipeline configurations located in the configs/ folder. You can modify these to define your own tasks and pipeline compositions.

📁 Project Structure

satori-assistance/
├── main.py                     # Main entry point
├── configs/                    # Pipeline and agent configuration files
├── pipelines/                 # Pipeline logic (e.g., belief, desire, guidance)
├── ptgctl_pipeline/           # Stream management and base classes
├── docs/                      # Sphinx documentation
├── requirements.txt           # Python dependencies
└── README.md                  # This file

🔧 Configuration

You can customize the task agent using YAML config files in configs/. These specify:

Pipelines to load (e.g., task control, guidance, vision)
Stream mappings
Optional runtime parameters

Refer to configs/README.md for examples.

📚 Documentation

Full documentation is available at:

📘 Satori Documentation

To build the docs locally:

cd docs
make html

📄 Citation

If you use Satori in your research or applications, please cite our CHI 2025 paper:

@article{li2024satori,
  title={Satori: Towards Proactive AR Assistant with Belief-Desire-Intention User Modeling},
  author={Li, Chenyi and Wu, Guande and Chan, Gromit Yeuk-Yin and Turakhia, Dishita G and Quispe, Sonia Castelo and Li, Dong and Welch, Leslie and Silva, Claudio and Qian, Jing},
  journal={arXiv preprint arXiv:2410.16668},
  year={2024}
}

🧑‍💻 Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines and open issues.

🛠️ Maintainers

Developed by VIDA Lab @ NYU
Contact: Guande Wu

📬 License

This project is licensed under the MIT License. See LICENSE for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 Satori: Proactive AR Task Guidance Framework

🧭 Features

🚀 Getting Started

Requirements

Installation

Run the Application

📁 Project Structure

🔧 Configuration

📚 Documentation

📄 Citation

🧑‍💻 Contributing

🛠️ Maintainers

📬 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
_docs		_docs
configs		configs
docs		docs
pipelines		pipelines
ptgctl_pipeline		ptgctl_pipeline
runtime		runtime
static		static
utils		utils
.gitignore		.gitignore
.gitmodules		.gitmodules
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main.py		main.py

License

VIDA-NYU/satori-assistance

Folders and files

Latest commit

History

Repository files navigation

🧠 Satori: Proactive AR Task Guidance Framework

🧭 Features

🚀 Getting Started

Requirements

Installation

Run the Application

📁 Project Structure

🔧 Configuration

📚 Documentation

📄 Citation

🧑‍💻 Contributing

🛠️ Maintainers

📬 License

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages