Publications

Papers and preprints spanning multimodal AI, embodied AI, robot learning, and vision-language-action models.

Search publications Year Area

2026

SlotVLA: Towards Modeling of Object-Relation Representations in Robotic Manipulation

Taisei Hanyu, Nhat Chung, Huy Le, Toan Nguyen, Yuki Ikebe, Anthony Gunderman, Duy Nguyen Ho Minh, Khoa Vo, Tung Kieu, Kashu Yamazaki, Chase Rainwater, Anh Nguyen, Ngan Le

A slot-attention-based VLA framework for compact object-relation representations in robotic manipulation.

ICRA 2026 Accepted

Paper Project

2026

CodeGraphVLP: Code-as-Planner Meets Semantic-Graph State for Non-Markovian Vision-Language-Action Models

Khoa Vo, Sieu Tran, Taisei Hanyu, Yuki Ikebe, Duy Nguyen, Bui Duy Quoc Nghi, Minh Vu, Anthony Gunderman, Chase Rainwater, Anh Nguyen, Ngan Le

A non-Markovian VLA framework that combines persistent semantic-graph state with executable code-as-planner reasoning.

arXiv Preprint

Paper

2026

Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective

Nhat Chung, Taisei Hanyu, Toan Nguyen, Huy Le, Frederick Bumgarner, Duy Minh Ho Nguyen, Khoa Vo, Kashu Yamazaki, Chase Rainwater, Tung Kieu, Anh Nguyen, Ngan Le

A memory-aware robotic manipulation benchmark and slot-centric VLA model for non-Markovian manipulation tasks.

Proceedings of the AAAI Conference on Artificial Intelligence AAAI 2026

Paper Project

2025

Clutter-Resistant Vision-Language-Action Models through Object-Centric and Geometry Grounding

Khoa Vo, Taisei Hanyu, Yuki Ikebe, Trong Thang Pham, Nhat Chung, Minh Nhat Vu, Duy Nguyen Ho Minh, Anh Nguyen, Anthony Gunderman, Chase Rainwater, Ngan Le

A VLA framework that uses object-centric and geometry-grounded views for clutter-resistant robot manipulation.

arXiv Preprint

Paper Project

2025

SolarFormer++: Multi-scale Transformer for Solar PV Profiling and Obstruction Localization for Degradation Mitigation

Esteban Duran, Minh Tran, Malachi Massey, Adrian Gracia, Taisei Hanyu, Anh Tran, Roy McCann, Haitao Liao, Jackson Cothren, Meredith Adkins, Chase Rainwater, Ying Huang, Alan Mantooth, Ngan Le

A multi-scale Transformer approach for solar PV profiling and obstruction localization for degradation mitigation.

IEEE Transactions on Industry Applications Published

Paper Project

2024

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Taisei Hanyu, Kashu Yamazaki, Minh Tran, Roy A. McCann, Haitao Liao, Chase Rainwater, Meredith Adkins, Jackson Cothren, Ngan Le

A multi-resolution Transformer for semantic segmentation of aerial imagery.

Remote Sensing Editor's Choice

Paper Project

2024

Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation

Kashu Yamazaki, Taisei Hanyu, Khoa Vo, Thang Pham, Minh Tran, Gianfranco Doretto, Anh Nguyen, Ngan Le

Real-time open-vocabulary 3D mapping and queryable scene representation using RGB-D observations.

ICRA 2024 Oral

Paper Project

2023

SolarFormer: Multi-scale Transformer for Solar PV Profiling

Adrian de Luis, Minh Tran, Taisei Hanyu, Anh Tran, Liao Haitao, Roy McCann, Alan Mantooth, Ying Huang, Ngan Le

A multi-scale Transformer model for solar PV profiling from aerial imagery.

arXiv Preprint

Paper Project