Scene-Aware AR Assistant
Fabian Blatter
Bachelor's Thesis, March 2024
Supervisors: Manuel Braunschweiler, Yukun Dai, Dr. Fabio Zünd, Prof. Dr. Bob Sumner
Abstract
Virtual assistants are currently available on almost every platform, however they generally are only disembodied voices, with no awareness of the user’s surroundings. Using Augmented Reality (AR) and Natural Language Processing (NLP) technologies, this thesis describes the implementation of an application that lets you have a conversation with an AR assistant. The assistant can interact with its surroundings by looking out the window and telling you the weather, setting reminders or sitting down when requested. We employ the language model GPT-4 to generate responses consisting of fitting sets of actions. With the help of a weather forecast API the assistant is able to answer questions about current data. We conduct a quantitative user study to assess the application’s usability, believability and feeling of co-presence. Further, we draw promising conclusions about the potentials and limitations of using a Large Language Model (LLM) in this context in future works.