TY - GEN
T1 - Automatic class labeling for CiteSeerX
AU - Kashireddy, Surya Dhairya
AU - Gauch, Susan
AU - Billah, Syed Masum
PY - 2013
Y1 - 2013
N2 - The CiteSeerx project at the University of Arkansas uses a browsing interface is based on the Association for Computing Machinery's Computing Classification System (ACM CCS). CCS contains just 369 categories whereas the CiteSeerx database contains over 2 million documents. This results in more than 6500 documents per category, far too many to browse. To address this problem, we are exploring ways to automatically expand the CCS ontology. Previous work has focused on using clustering to automatically identify the new clas-ses. This work focuses on how to label the subclasses in a se-mantically meaningful way to that they can sup-port user browsing. We develop methods based on text mining from the subclass members to extract class la-bels. We evaluate three methods by comparing the suggested labels with human-assigned labels for existing categories.
AB - The CiteSeerx project at the University of Arkansas uses a browsing interface is based on the Association for Computing Machinery's Computing Classification System (ACM CCS). CCS contains just 369 categories whereas the CiteSeerx database contains over 2 million documents. This results in more than 6500 documents per category, far too many to browse. To address this problem, we are exploring ways to automatically expand the CCS ontology. Previous work has focused on using clustering to automatically identify the new clas-ses. This work focuses on how to label the subclasses in a se-mantically meaningful way to that they can sup-port user browsing. We develop methods based on text mining from the subclass members to extract class la-bels. We evaluate three methods by comparing the suggested labels with human-assigned labels for existing categories.
UR - http://www.scopus.com/inward/record.url?scp=84893237840&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84893237840&partnerID=8YFLogxK
U2 - 10.1109/WI-IAT.2013.35
DO - 10.1109/WI-IAT.2013.35
M3 - Conference contribution
AN - SCOPUS:84893237840
SN - 9781479929023
T3 - Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
SP - 241
EP - 245
BT - Proceedings - 2013 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
T2 - 2013 12th IEEE/WIC/ACM International Conference on Web Intelligence, WI 2013
Y2 - 17 November 2013 through 20 November 2013
ER -