Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

Balci, Emirhan; Sarigul, Mehmet; Ata, Baris

Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

dc.authorid	Sarıgül, Mehmet/0000-0001-7323-6864
dc.contributor.author	Balci, Emirhan
dc.contributor.author	Sarigul, Mehmet
dc.contributor.author	Ata, Baris
dc.date.accessioned	2026-02-27T07:33:30Z
dc.date.available	2026-02-27T07:33:30Z
dc.date.issued	2025
dc.description	33rd Conference on Signal Processing and Communications Applications-SIU-Annual
dc.description.abstract	This study evaluates the performance of state-of-the-art text-based generative large language models in indoor robot navigation planning, focusing on object, spatial, and common-sense reasoning-centric instructions. Three scenes from the Matterport3D dataset were selected, along with corresponding instruction sequences and routes. Object-labeled semantic maps were generated using the RGB-D images and camera poses of the scenes. The instructions were provided to the models, and the generated robot codes were executed on a mobile robot within the selected scenes. The routes followed by the robot, which detected objects through the semantic map, were recorded. The findings indicate that while the models successfully executed object and spatial-based instructions, some models struggled with those requiring common-sense reasoning. This study aims to contribute to robotics research by providing insights into the navigation planning capabilities of language models.
dc.identifier.doi	10.1109/SIU66497.2025.11111749
dc.identifier.isbn	979-8-3315-6656-2; 979-8-3315-6655-5
dc.identifier.issn	2165-0608
dc.identifier.uri	http://dx.doi.org/10.1109/SIU66497.2025.11111749
dc.identifier.uri	https://hdl.handle.net/20.500.14669/4614
dc.identifier.wos	WOS:001575462500002
dc.indekslendigikaynak	Web of Science
dc.language.iso	tr
dc.publisher	IEEE
dc.relation.ispartof	2025 33rd Signal Processing and Communications Applications Conference, Siu
dc.relation.ispartofseries	Signal Processing and Communications Applications Conference
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_20260302
dc.subject	Large Language Models
dc.subject	Robotics
dc.subject	Navigation
dc.subject	Prompt Engineering
dc.title	Benchmarking Large Language Model Reasoning in Indoor Robot Navigation
dc.type	Proceedings Paper

Koleksiyon

WoS İndeksli Yayınlar Koleksiyonu

Benchmarking Large Language Model Reasoning in Indoor Robot Navigation

Dosyalar

Koleksiyon