TY - GEN
T1 - Query hidden attributes in social networks
AU - Nazi, Azade
AU - Thirumuruganathan, Saravanan
AU - Hristidis, Vagelis
AU - Zhang, Nan
AU - Shaban, Khaled
AU - Das, Gautam
PY - 2015/1/26
Y1 - 2015/1/26
N2 - Micro blogs and collaborative content sites such as Twitter and Amazon are popular among millions of users who generate huge numbers of tweets, posts, and reviews every day. Despite their popularity, these sites only provide rudimentary mechanisms to navigate their sites, programmatically or through a browser, like a keyword search interface or a get-neighbors (e.g., Friends) interface. Many interesting queries cannot be directly answered by any of these interfaces, e.g., Find Twitter users in Los Angeles that have tweeted the word 'diabetes' in the last year. Note that the Twitter programming interface does not allow conditions on the user's home location. In this paper, we introduce the novel problem of querying hidden attributes in micro blogs and collaborative content sites by leveraging the existing search mechanisms offered by those sites. We model these data sources as heterogeneous graphs and their two key access interfaces, Local Search and Content Search, which search through keywords and neighbors respectively. We show which of these two approaches is better for which types of hidden attribute searches. We conduct experiments on Twitter, Amazon, and Rate MDs to evaluate the performance of the search approaches.
AB - Micro blogs and collaborative content sites such as Twitter and Amazon are popular among millions of users who generate huge numbers of tweets, posts, and reviews every day. Despite their popularity, these sites only provide rudimentary mechanisms to navigate their sites, programmatically or through a browser, like a keyword search interface or a get-neighbors (e.g., Friends) interface. Many interesting queries cannot be directly answered by any of these interfaces, e.g., Find Twitter users in Los Angeles that have tweeted the word 'diabetes' in the last year. Note that the Twitter programming interface does not allow conditions on the user's home location. In this paper, we introduce the novel problem of querying hidden attributes in micro blogs and collaborative content sites by leveraging the existing search mechanisms offered by those sites. We model these data sources as heterogeneous graphs and their two key access interfaces, Local Search and Content Search, which search through keywords and neighbors respectively. We show which of these two approaches is better for which types of hidden attribute searches. We conduct experiments on Twitter, Amazon, and Rate MDs to evaluate the performance of the search approaches.
UR - http://www.scopus.com/inward/record.url?scp=84936886989&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84936886989&partnerID=8YFLogxK
U2 - 10.1109/ICDMW.2014.113
DO - 10.1109/ICDMW.2014.113
M3 - Conference contribution
T3 - IEEE International Conference on Data Mining Workshops, ICDMW
SP - 886
EP - 891
BT - Proceedings - 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014
A2 - Zhou, Zhi-Hua
A2 - Wang, Wei
A2 - Kumar, Ravi
A2 - Toivonen, Hannu
A2 - Pei, Jian
A2 - Zhexue Huang, Joshua
A2 - Wu, Xindong
PB - IEEE Computer Society
T2 - 14th IEEE International Conference on Data Mining Workshops, ICDMW 2014
Y2 - 14 December 2014
ER -