TY - JOUR
T1 - Speech-based text entry for mobile handheld devices
T2 - An analysis of efficacy and error correction techniques for server-based solutions
AU - Price, Kathleen J.
AU - Sears, Andrew
N1 - Funding Information:
The authors thank Aether Systems, Inc. for their support of this research. This material is based upon work supported by the National Science Foundation under Grants IIS–9910607 and IIS–0121570. Any opinions, findings and conclusions or recommendations expressed in this material are those of the authors and do not necessarily reflect the views of the National Science Foundation (NSF).
PY - 2005
Y1 - 2005
N2 - As handheld devices become ubiquitous and the tasks performed become multipurpose in nature, efficient data entry techniques are necessary. This research evaluated several speech-based text entry solutions for handheld devices using server-based speech recognition. Because server-based solutions introduce network delays, an analysis of the relationship among network delays, number of recognition errors, how fast users can correct errors, and overall data entry rates was performed. The analysis and empirical results confirm the importance of minimizing recognition errors. This suggests that a server-based approach that makes more computing resources available may prove effective. Results from two empirical studies are presented. The first compares two error correction mechanisms: a multitap and soft keyboard solution. The second employs a longitudinal investigation of the effects of experience on text entry rates. Users attained an effective mean text entry rate as high as 25.3 words per min, which is higher than or comparable to data entry rates reported for other input techniques for handheld devices. The results of this research have implications for researchers and designers of automatic speech recognition systems and mobile devices.
AB - As handheld devices become ubiquitous and the tasks performed become multipurpose in nature, efficient data entry techniques are necessary. This research evaluated several speech-based text entry solutions for handheld devices using server-based speech recognition. Because server-based solutions introduce network delays, an analysis of the relationship among network delays, number of recognition errors, how fast users can correct errors, and overall data entry rates was performed. The analysis and empirical results confirm the importance of minimizing recognition errors. This suggests that a server-based approach that makes more computing resources available may prove effective. Results from two empirical studies are presented. The first compares two error correction mechanisms: a multitap and soft keyboard solution. The second employs a longitudinal investigation of the effects of experience on text entry rates. Users attained an effective mean text entry rate as high as 25.3 words per min, which is higher than or comparable to data entry rates reported for other input techniques for handheld devices. The results of this research have implications for researchers and designers of automatic speech recognition systems and mobile devices.
UR - http://www.scopus.com/inward/record.url?scp=32644449281&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=32644449281&partnerID=8YFLogxK
U2 - 10.1207/s15327590ijhc1903_1
DO - 10.1207/s15327590ijhc1903_1
M3 - Article
AN - SCOPUS:32644449281
SN - 1044-7318
VL - 19
SP - 279
EP - 304
JO - International Journal of Human-Computer Interaction
JF - International Journal of Human-Computer Interaction
IS - 3
ER -