TY - JOUR
T1 - Prediction of a hotspot pattern in keyword search results
AU - Gao, Jie
AU - Radeva, Axinia
AU - Shen, Chuyao
AU - Wang, Shiqi
AU - Wang, Qianbo
AU - Passonneau, Rebecca J.
N1 - Funding Information:
The research reported here was supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Defense U.S. Army Research Laboratory (DOD/ARL), contract number W911NF-12-C-0012 . The U.S. Government is authorized to reproduce and distribute reprints for governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, expressed or implied, of IARPA, DOD/ARL, or the U.S. Government.
Publisher Copyright:
© 2018 Elsevier Ltd
PY - 2018/3
Y1 - 2018/3
N2 - This paper identifies and models a phenomenon observed across low-resource languages in keyword search results from speech retrieval systems where the speech recognition has high error rate, due to very limited training data. High confidence correct detections (HCCDs) of keywords are rare, yet often succeed one another closely in time. We refer to these close sequences of HCCDs as keyword hotspots. The ability to predict keyword hotspots could support speech retrieval, and provide new insights into the behavior of speech recognition systems. We treat hotspot prediction as a binary classification task on all word-sized time intervals in an audio file of a telephone conversation, using prosodic features as predictors. Rare events that follow this pattern are often modeled as a self-exciting point process (SEPP), meaning the occurrence of a rare event excites a following one. To label successive points in time as occurring within a hotspot or not, we fit a SEPP function to the distribution of HCCDs in the keyword search output. Two major learning challenges are that the size of the positive class is very small, and the training and test data have dissimilar distributions. To address these challenges, we develop a novel data selection framework that chooses training data with good generalization properties. Results exhibit superior generalization performance.
AB - This paper identifies and models a phenomenon observed across low-resource languages in keyword search results from speech retrieval systems where the speech recognition has high error rate, due to very limited training data. High confidence correct detections (HCCDs) of keywords are rare, yet often succeed one another closely in time. We refer to these close sequences of HCCDs as keyword hotspots. The ability to predict keyword hotspots could support speech retrieval, and provide new insights into the behavior of speech recognition systems. We treat hotspot prediction as a binary classification task on all word-sized time intervals in an audio file of a telephone conversation, using prosodic features as predictors. Rare events that follow this pattern are often modeled as a self-exciting point process (SEPP), meaning the occurrence of a rare event excites a following one. To label successive points in time as occurring within a hotspot or not, we fit a SEPP function to the distribution of HCCDs in the keyword search output. Two major learning challenges are that the size of the positive class is very small, and the training and test data have dissimilar distributions. To address these challenges, we develop a novel data selection framework that chooses training data with good generalization properties. Results exhibit superior generalization performance.
UR - http://www.scopus.com/inward/record.url?scp=85032953962&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85032953962&partnerID=8YFLogxK
U2 - 10.1016/j.csl.2017.10.005
DO - 10.1016/j.csl.2017.10.005
M3 - Article
AN - SCOPUS:85032953962
SN - 0885-2308
VL - 48
SP - 80
EP - 102
JO - Computer Speech and Language
JF - Computer Speech and Language
ER -