Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

Balci, Emirhan; Sarigul, Mehmet; Ata, Baris

Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

Tarih

2025

Yazarlar

Balci, Emirhan

Sarigul, Mehmet

Ata, Baris

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

This study evaluates the performance of state-of-the-art text-based generative large language models in indoor robot navigation planning, focusing on object, spatial, and common-sense reasoning-centric instructions. Three scenes from the Matterport3D dataset were selected, along with corresponding instruction sequences and routes. Object-labeled semantic maps were generated using the RGB-D images and camera poses of the scenes. The instructions were provided to the models, and the generated robot codes were executed on a mobile robot within the selected scenes. The routes followed by the robot, which detected objects through the semantic map, were recorded. The findings indicate that while the models successfully executed object and spatial-based instructions, some models struggled with those requiring common-sense reasoning. This study aims to contribute to robotics research by providing insights into the navigation planning capabilities of language models.

Açıklama

33rd Conference on Signal Processing and Communications Applications-SIU-Annual

Anahtar Kelimeler

Large Language Models, Robotics, Navigation, Prompt Engineering

Kaynak

2025 33rd Signal Processing and Communications Applications Conference, Siu

Bağlantı

http://dx.doi.org/10.1109/SIU66497.2025.11111749
https://hdl.handle.net/20.500.14669/4614

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu

Detaylı Öğe Kaydı

Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

Tarih

Yazarlar

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

Erişim Hakkı

Özet

Açıklama

Anahtar Kelimeler

Kaynak

WoS Q Değeri

Scopus Q Değeri

Cilt

Sayı

Künye

Bağlantı

Koleksiyon