Skip to main navigation Skip to search Skip to main content

A Large-Scale Dataset of Interactions Between Weibo Users and Platform-Empowered LLM Agent

  • Shaokui Gu
  • , Yongjie Yin
  • , Qingyuan Gong
  • , Fenghua Tong
  • , Yipeng Zhou
  • , Qiang Duan
  • , Yang Chen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We release a large-scale dataset that captures interactions between human users and CommentRobert, an LLM-based social media agent on Weibo. The dataset contains Weibo posts in which users actively mention the LLM agent account @CommentRobert, indicating that the users are interested in interacting with the platform-empowered LLM agent. The dataset contains 557,645 interactions from 304,400 unique users over 17 months. We detail our data collection methodology, user attributes, and content characteristics, underscoring the dataset's value in examining real-world human-LLM agent interactions. Our analysis offers insights into the demographic and behavioral traits of users interested in the selected LLM agent, interaction dynamics between humans and the agent, and linguistic patterns in comments. These interactions provide a unique lens through which to explore how humans perceive, trust, and communicate with LLMs. This dataset enables further research into modeling human intent understanding, improving LLM agent design, and studying the evolution of human-LLM agent relationships. Potential applications also include long-term user engagement prediction and AI-generated comment detection on social platforms. This constructed dataset is available at https://zenodo.org/records/16921462.

Original languageEnglish (US)
Title of host publicationCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery, Inc
Pages6392-6396
Number of pages5
ISBN (Electronic)9798400720406
DOIs
StatePublished - Nov 10 2025
Event34th ACM International Conference on Information and Knowledge Management, CIKM 2025 - Seoul, Korea, Republic of
Duration: Nov 10 2025Nov 14 2025

Publication series

NameCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management

Conference

Conference34th ACM International Conference on Information and Knowledge Management, CIKM 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period11/10/2511/14/25

All Science Journal Classification (ASJC) codes

  • Information Systems and Management
  • Computer Science Applications
  • Information Systems

Fingerprint

Dive into the research topics of 'A Large-Scale Dataset of Interactions Between Weibo Users and Platform-Empowered LLM Agent'. Together they form a unique fingerprint.

Cite this