TY - GEN
T1 - Real time search user behavior
AU - Jansen, Bernard J.
AU - Campbell, Gerry
AU - Gregg, Matthew
PY - 2010
Y1 - 2010
N2 - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.
AB - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.
UR - http://www.scopus.com/inward/record.url?scp=77953100508&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77953100508&partnerID=8YFLogxK
U2 - 10.1145/1753846.1754086
DO - 10.1145/1753846.1754086
M3 - Conference contribution
AN - SCOPUS:77953100508
SN - 9781605589312
T3 - Conference on Human Factors in Computing Systems - Proceedings
SP - 3961
EP - 3966
BT - CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts
T2 - 28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010
Y2 - 10 April 2010 through 15 April 2010
ER -