TY - JOUR
T1 - Seamlessly Unifying Attributes and Items
T2 - Conversational Recommendation for Cold-start Users
AU - Li, Shijun
AU - Lei, Wenqiang
AU - Wu, Qingyun
AU - He, Xiangnan
AU - Jiang, Peng
AU - Chua, Tat Seng
N1 - Funding Information:
S. Li and W. Lei contributed equally to this research. This work is supported by the National Natural Science Foundation of China (Grant No. 61972372) and the National Key Research and Development Program of China (Grant No. 2020AAA0106000). Authors’ addresses: S. Li and X. He, University of Scinece and Technology of China, 443 Huangshan Road, He Fei, China, 230027; emails: [email protected], [email protected]; W. Lei (corresponding author) and T.-S. Chua, National University of Singapore, 13 Computing Drive, National University of Singapore, Singapore, Republic of Singapore, 117417; emails: [email protected], [email protected]; Q. Wu, University of Virginia, Computer Science 85 Engeers.s way, Charlottesville, VA, United States, 22903; email: [email protected]; P. Jiang, Kuaishou Inc., 6 West Shangdi Road, Haidian District, Beijing, China; email: [email protected]. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]. © 2021 Association for Computing Machinery. 1046-8188/2021/08-ART40 $15.00 https://doi.org/10.1145/3446427
Publisher Copyright:
© 2021 Association for Computing Machinery.
PY - 2021/10
Y1 - 2021/10
N2 - Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recommendation actions homogeneously. Specifically, they only consider the items as the arms, being incapable of handling the item attributes, which naturally provide interpretable information of user's current demands and can effectively filter out undesired items. In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively. This important scenario was studied in a recent work [54]. However, it employs a hand-crafted function to decide when to ask attributes or make recommendations. Such separate modeling of attributes and items makes the effectiveness of the system highly rely on the choice of the hand-crafted function, thus introducing fragility to the system. To address this limitation, we seamlessly unify attributes and items in the same arm space and achieve their EE trade-offs automatically using the framework of Thompson Sampling. Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play. Extensive experiments on three benchmark datasets show that ConTS outperforms the state-of-the-art methods Conversational UCB (ConUCB) [54] and Estimation - Action - Reflection model [27] in both metrics of success rate and average number of conversation turns.
AB - Static recommendation methods like collaborative filtering suffer from the inherent limitation of performing real-time personalization for cold-start users. Online recommendation, e.g., multi-armed bandit approach, addresses this limitation by interactively exploring user preference online and pursuing the exploration-exploitation (EE) trade-off. However, existing bandit-based methods model recommendation actions homogeneously. Specifically, they only consider the items as the arms, being incapable of handling the item attributes, which naturally provide interpretable information of user's current demands and can effectively filter out undesired items. In this work, we consider the conversational recommendation for cold-start users, where a system can both ask the attributes from and recommend items to a user interactively. This important scenario was studied in a recent work [54]. However, it employs a hand-crafted function to decide when to ask attributes or make recommendations. Such separate modeling of attributes and items makes the effectiveness of the system highly rely on the choice of the hand-crafted function, thus introducing fragility to the system. To address this limitation, we seamlessly unify attributes and items in the same arm space and achieve their EE trade-offs automatically using the framework of Thompson Sampling. Our Conversational Thompson Sampling (ConTS) model holistically solves all questions in conversational recommendation by choosing the arm with the maximal reward to play. Extensive experiments on three benchmark datasets show that ConTS outperforms the state-of-the-art methods Conversational UCB (ConUCB) [54] and Estimation - Action - Reflection model [27] in both metrics of success rate and average number of conversation turns.
UR - http://www.scopus.com/inward/record.url?scp=85108536399&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85108536399&partnerID=8YFLogxK
U2 - 10.1145/3446427
DO - 10.1145/3446427
M3 - Article
AN - SCOPUS:85108536399
SN - 1046-8188
VL - 39
JO - ACM Transactions on Information Systems
JF - ACM Transactions on Information Systems
IS - 4
M1 - 40
ER -