TY - JOUR
T1 - Automatic analysis of thematic structure in written English
AU - Park, Kwanghyun
AU - Lu, Xiaofei
N1 - Publisher Copyright:
© John Benjamins Publishing Company.
PY - 2015
Y1 - 2015
N2 - This paper proposes and describes a computational system for the automatic analysis of thematic structure, as defined in Systemic Functional Linguistics, in written English. The system takes an English text as input and produces as output an analysis of the thematic structure of each sentence in the text. The system is evaluated using data from The Wall Street Journal section of the Penn Treebank (Marcus et al. 1993) and the British Academic Written English corpus (Gardner & Nesi 2013). An experiment using these data shows that the system achieves a high degree of reliability in regard to both identifying theme-rheme boundaries and determining several of the linguistic properties of the identified themes, including syntactic nodes, theme function, markedness, mood types, and theme roles. To illustrate how the system is used, we describe an example application designed to compare collections of novice and expert academic writing in terms of thematic structure.
AB - This paper proposes and describes a computational system for the automatic analysis of thematic structure, as defined in Systemic Functional Linguistics, in written English. The system takes an English text as input and produces as output an analysis of the thematic structure of each sentence in the text. The system is evaluated using data from The Wall Street Journal section of the Penn Treebank (Marcus et al. 1993) and the British Academic Written English corpus (Gardner & Nesi 2013). An experiment using these data shows that the system achieves a high degree of reliability in regard to both identifying theme-rheme boundaries and determining several of the linguistic properties of the identified themes, including syntactic nodes, theme function, markedness, mood types, and theme roles. To illustrate how the system is used, we describe an example application designed to compare collections of novice and expert academic writing in terms of thematic structure.
UR - http://www.scopus.com/inward/record.url?scp=84926450719&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84926450719&partnerID=8YFLogxK
U2 - 10.1075/ijcl.20.1.04par
DO - 10.1075/ijcl.20.1.04par
M3 - Article
AN - SCOPUS:84926450719
SN - 1384-6655
VL - 20
SP - 81
EP - 101
JO - International Journal of Corpus Linguistics
JF - International Journal of Corpus Linguistics
IS - 1
ER -