Abstract
We predict discourse segment boundaries from linguistic features of utterances, using a corpus of spoken narratives as data. We present two methods for developing segmentation algorithms from training data: hand tuning and machine learning. When multiple types of features are used, results approach human performance on an independent test set (both methods), and using cross-validation (machine learning).
Original language | English (US) |
---|---|
Pages (from-to) | 108-115 |
Number of pages | 8 |
Journal | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
Volume | 1995-June |
State | Published - 1995 |
Event | 33rd Annual Meeting of the Association for Computational Linguistics, ACL 1995 - Cambridge, United States Duration: Jun 26 1995 → Jun 30 1995 |
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Linguistics and Language
- Language and Linguistics