Taisei Hanyu
News Projects Publications CV

Projects

Projects

Selected work in multimodal AI, embodied AI, robot learning, and vision-language-action models.

ICRA 2026 Accepted 2026

SlotVLA

A slot-attention-based VLA framework for compact object-relation representations in robotic manipulation.

  • Robotics
  • VLA
  • Object-Centric Learning
AAAI 2026 2026

LIBERO-Mem

A non-Markovian manipulation benchmark and slot-centric VLA framework for memory-aware robot policies.

  • Robotics
  • Memory
  • VLA
Under Review 2025

OBEYED-VLA

A VLA framework that separates perceptual grounding from action reasoning for clutter-resistant robot manipulation.

  • Robotics
  • VLA
  • Object Grounding
ICRA 2024 Oral 2024

Open-Fusion

Real-time open-vocabulary 3D mapping and queryable scene representation from RGB-D observations.

  • Robotics
  • 3D Mapping
  • Open Vocabulary
Remote Sensing 2024 2024

AerialFormer

A multi-resolution Transformer architecture for semantic segmentation in aerial imagery.

  • Aerial Imagery
  • Segmentation
  • Transformers
Preprint 2023

SolarFormer

A multi-scale Transformer model for solar photovoltaic profiling from aerial imagery.

  • Aerial Imagery
  • Solar PV
  • Segmentation

© 2026 Taisei Hanyu

EmailAICV ProfileSemantic ScholarGitHubLinkedInXCV