TY - GEN
T1 - Hybrid methods for POS guessing of Chinese unknown words
AU - Lu, Xiaofei
PY - 2005
Y1 - 2005
N2 - This paper describes a hybrid model that combines a rule-based model with two statistical models for the task of POS guessing of Chinese unknown words. The rule-based model is sensitive to the type, length, and internal structure of unknown words, and the two statistical models utilize contextual information and the likelihood for a character to appear in a particular position of words of a particular length and POS category. By combining models that use different sources of information, the hybrid model achieves a precision of 89%, a significant improvement over the best result reported in previous studies, which was 69%.
AB - This paper describes a hybrid model that combines a rule-based model with two statistical models for the task of POS guessing of Chinese unknown words. The rule-based model is sensitive to the type, length, and internal structure of unknown words, and the two statistical models utilize contextual information and the likelihood for a character to appear in a particular position of words of a particular length and POS category. By combining models that use different sources of information, the hybrid model achieves a precision of 89%, a significant improvement over the best result reported in previous studies, which was 69%.
UR - http://www.scopus.com/inward/record.url?scp=84859912482&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84859912482&partnerID=8YFLogxK
U2 - 10.3115/1628960.1628962
DO - 10.3115/1628960.1628962
M3 - Conference contribution
AN - SCOPUS:84859912482
SN - 1932432515
SN - 9781932432510
T3 - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
SP - 1
EP - 6
BT - ACL-05 - 43rd Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference
PB - Association for Computational Linguistics (ACL)
T2 - 43rd Annual Meeting of the Association for Computational Linguistics, ACL-05
Y2 - 25 June 2005 through 30 June 2005
ER -