Sasha: Creative Goal-Oriented Reasoning in Smart Homes with Large Language Models

被引:7
|
作者
King, Evan [1 ]
Yu, Haoxiang [1 ]
Lee, Sangsu [1 ]
Julien, Christine [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
来源
PROCEEDINGS OF THE ACM ON INTERACTIVE MOBILE WEARABLE AND UBIQUITOUS TECHNOLOGIES-IMWUT | 2024年 / 8卷 / 01期
基金
美国国家科学基金会;
关键词
smart environments; pervasive computing; ambient intelligence; large language models; USERS;
D O I
10.1145/3643505
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smart home assistants function best when user commands are direct and well-specified-e.g., "turn on the kitchen light"-or when a hard-coded routine specifies the response. In more natural communication, however, human speech is unconstrained, often describing goals (e.g., "make it cozy in here" or "help me save energy") rather than indicating specific target devices and actions to take on those devices. Current systems fail to understand these under-specified commands since they cannot reason about devices and settings as they relate to human situations. We introduce large language models (LLMs) to this problem space, exploring their use for controlling devices and creating automation routines in response to under-specified user commands in smart homes. We empirically study the baseline quality and failure modes of LLM-created action plans with a survey of age-diverse users. We find that LLMs can reason creatively to achieve challenging goals, but they experience patterns of failure that diminish their usefulness. We address these gaps with Sasha, a smarter smart home assistant. Sasha responds to loosely-constrained commands like "make it cozy" or "help me sleep better" by executing plans to achieve user goals-e.g., setting a mood with available devices, or devising automation routines. We implement and evaluate Sasha in a hands-on user study, showing the capabilities and limitations of LLM-driven smart homes when faced with unconstrained user-generated scenarios.
引用
收藏
页数:38
相关论文
共 43 条
  • [21] Composing Smart Data Services in Shop Floors Through Large Language Models
    Mathew, Jerin George
    Monti, Flavia
    Firmani, Donatella
    Leotta, Francesco
    Mandreoli, Federica
    Mecella, Massimo
    SERVICE-ORIENTED COMPUTING, ICSOC 2024, PT II, 2025, 15405 : 287 - 296
  • [22] Automated Scoring of Creative Problem Solving With Large Language Models: A Comparison of Originality and Quality Ratings
    Luchini, Simone A.
    Maliakkal, Nadine T.
    Distefano, Paul V.
    Laverghetta Jr, Antonio
    Patterson, John D.
    Beaty, Roger E.
    Reiter-Palmon, Roni
    PSYCHOLOGY OF AESTHETICS CREATIVITY AND THE ARTS, 2025,
  • [23] RELAND: Integrating Large Language Models' Insights into Industrial Recommenders via a Controllable Reasoning Pool
    Tian, Changxin
    Hu, Binbin
    Gan, Chunjing
    Chen, Haoyu
    Zhang, Zhuo
    Yu, Li
    Liu, Ziqi
    Zhang, Zhiqiang
    Zhou, Jun
    Chen, Jiawei
    PROCEEDINGS OF THE EIGHTEENTH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2024, 2024, : 63 - 73
  • [24] A sepsis diagnosis method based on Chain-of-Thought reasoning using Large Language Models
    Zhang, Weimin
    Wu, Mengfei
    Zhou, Luyao
    Shao, Min
    Wang, Cui
    Wang, Yu
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2025, 45 (02) : 269 - 277
  • [25] Large Language Models are Versatile Decomposers: Decomposing Evidence and Questions for Table-based Reasoning
    Ye, Yunhu
    Hui, Binyuan
    Yang, Min
    Li, Binhua
    Huang, Fei
    Li, Yongbin
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 174 - 184
  • [26] Leveraging Non-Parametric Reasoning With Large Language Models for Enhanced Knowledge Graph Completion
    Zhang, Ying
    Shen, Yangpeng
    Xiao, Gang
    Peng, Jinghui
    IEEE ACCESS, 2024, 12 : 177012 - 177027
  • [27] NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models
    Zhou, Gengze
    Hong, Yicong
    Wang, Zun
    Wang, Xin Eric
    Wu, Qi
    COMPUTER VISION-ECCV 2024, PT VII, 2025, 15065 : 260 - 278
  • [28] Prompting large language models for user simulation in task-oriented dialogue systems
    Algherairy, Atheer
    Ahmed, Moataz
    COMPUTER SPEECH AND LANGUAGE, 2025, 89
  • [29] Mathemyths: Leveraging Large Language Models to Teach Mathematical Language through Child-AI Co-Creative Storytelling
    Zhang, Chao
    Liu, Xuechen
    Ziska, Katherine
    Jeon, Soobin
    Yu, Chi-Lin
    Xu, Ying
    PROCEEDINGS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYTEMS (CHI 2024), 2024,
  • [30] ToM-LM: Delegating Theory of Mind Reasoning to External Symbolic Executors in Large Language Models
    Tang, Weizhi
    Belle, Vaishak
    NEURAL-SYMBOLIC LEARNING AND REASONING, PT II, NESY 2024, 2024, 14980 : 245 - 257