TY - GEN
T1 - The Manually Annotated Sub-Corpus
T2 - 48th Annual Meeting of the Association for Computational Linguistics, ACL 2010
AU - Ide, Nancy
AU - Baker, Collin
AU - Fellbaum, Christiane
AU - Passonneau, Rebecca Jane
PY - 2010/12/1
Y1 - 2010/12/1
N2 - The Manually Annotated Sub-Corpus (MASC) project provides data and annotations to serve as the base for a communitywide annotation effort of a subset of the American National Corpus. The MASC infrastructure enables the incorporation of contributed annotations into a single, usable format that can then be analyzed as it is or ported to any of a variety of other formats. MASC includes data from a much wider variety of genres than existing multiply-annotated corpora of English, and the project is committed to a fully open model of distribution, without restriction, for all data and annotations produced or contributed. As such, MASC is the first large-scale, open, community-based effort to create much needed language resources for NLP. This paper describes the MASC project, its corpus and annotations, and serves as a call for contributions of data and annotations from the language processing community.
AB - The Manually Annotated Sub-Corpus (MASC) project provides data and annotations to serve as the base for a communitywide annotation effort of a subset of the American National Corpus. The MASC infrastructure enables the incorporation of contributed annotations into a single, usable format that can then be analyzed as it is or ported to any of a variety of other formats. MASC includes data from a much wider variety of genres than existing multiply-annotated corpora of English, and the project is committed to a fully open model of distribution, without restriction, for all data and annotations produced or contributed. As such, MASC is the first large-scale, open, community-based effort to create much needed language resources for NLP. This paper describes the MASC project, its corpus and annotations, and serves as a call for contributions of data and annotations from the language processing community.
UR - http://www.scopus.com/inward/record.url?scp=84859942549&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84859942549&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84859942549
SN - 9781617388088
T3 - ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
SP - 68
EP - 73
BT - ACL 2010 - 48th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
Y2 - 11 July 2010 through 16 July 2010
ER -