Publication

SlotVLA: Towards Modeling of Object-Relation Representations in Robotic Manipulation

A slot-attention-based VLA framework for compact object-relation representations in robotic manipulation.

ICRA 2026 Accepted 2026

Taisei Hanyu, Nhat Chung, Huy Le, Toan Nguyen, Yuki Ikebe, Anthony Gunderman, Duy Nguyen Ho Minh, Khoa Vo, Tung Kieu, Kashu Yamazaki, Chase Rainwater, Anh Nguyen, Ngan Le

Paper DOI Project

Abstract

SlotVLA introduces object-relation representations for robotic manipulation. The work uses a slot-based visual tokenizer and relation-centric decoding to produce compact object-centric representations for multitask manipulation policies.

The project pairs the model with LIBERO+, a benchmark designed to evaluate fine-grained object-relation reasoning in manipulation tasks with object-centric annotations and temporal tracking.

Abstract

Links