Automatic analysis of syntactic complexity in second language writing

Research output: Contribution to journalReview articlepeer-review

531 Scopus citations


We describe a computational system for automatic analysis of syntactic complexity in second language writing using fourteen different measures that have been explored or proposed in studies of second language development. The system takes a written language sample as input and produces fourteen indices of syntactic complexity of the sample based on these measures. The system is designed with advanced second language proficiency research in mind, and is therefore developed and evaluated using college-level second language writing data from the Written English Corpus of Chinese Learners (Wen et al. 2005). Experimental results show that the system achieves a very high reliability on unseen test data from the corpus. We illustrate how the system is used in an example application to investigate whether and to what extent each of these measures significantly differentiates between different proficiency levels.

Original languageEnglish (US)
Pages (from-to)474-496
Number of pages23
JournalInternational Journal of Corpus Linguistics
Issue number4
StatePublished - 2010

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language


Dive into the research topics of 'Automatic analysis of syntactic complexity in second language writing'. Together they form a unique fingerprint.

Cite this