Scalable POMDP Decision-Making Using Circulant Controllers

Kyle Hollins Wray, Kenneth Czuprynski

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    2 Scopus citations

    Abstract

    This paper presents a novel policy representation for partially observable Markov decision processes (POMDPs) called circulant controllers and a provably efficient gradient-based algorithm for them. A formal mathematical description is provided that leverages circulant matrices for the controller's stochastic node transitions. This structure is particularly effective for capturing decision-making patterns found in real-world domains with repeated periodic behaviors that adapt their cycles based on observation. This includes domains such as bipedal walking over varied terrain, pick-and-place tasks in warehouses, and home healthcare monitoring and medicine delivery in household environments. A performant gradient-based algorithm is presented with a detailed theoretical analysis, formally proving the algorithm's improved performance, as well as circulant controllers' structural properties. Experiments on these domains demonstrate that the proposed controller algorithm outperforms other state-of-the-art POMDP controller algorithms. The proposed novel controller approach is demonstrated on an actual robot performing a navigation task in a real household environment.

    Original languageEnglish (US)
    Title of host publication2021 IEEE International Conference on Robotics and Automation, ICRA 2021
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages6831-6837
    Number of pages7
    ISBN (Electronic)9781728190778
    DOIs
    StatePublished - 2021
    Event2021 IEEE International Conference on Robotics and Automation, ICRA 2021 - Xi'an, China
    Duration: May 30 2021Jun 5 2021

    Publication series

    NameProceedings - IEEE International Conference on Robotics and Automation
    Volume2021-May
    ISSN (Print)1050-4729

    Conference

    Conference2021 IEEE International Conference on Robotics and Automation, ICRA 2021
    Country/TerritoryChina
    CityXi'an
    Period5/30/216/5/21

    All Science Journal Classification (ASJC) codes

    • Software
    • Artificial Intelligence
    • Electrical and Electronic Engineering
    • Control and Systems Engineering

    Fingerprint

    Dive into the research topics of 'Scalable POMDP Decision-Making Using Circulant Controllers'. Together they form a unique fingerprint.

    Cite this