ACBiMA: Advanced chinese bi-characterword morphological analyzer

Ting Hao Kenneth Huang, Yun Nung Chen, Lingpeng Kong

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

While morphological information has been demonstrated to be useful for various Chinese NLP tasks, there is still a lack of complete theories, category schemes, and toolkits for Chinese morphology. This paper focuses on the morphological structures of Chinese bi-character words, where a corpus were collected based on a welldefined morphological type scheme covering both Chinese derived words and compound words. With the corpus, a morphological analyzer is developed to classify Chinese bi-character words into the defined categories, which outperforms strong baselines and achieves about 66% macro F-measure for compound words, and effectively covers derived words.

Original languageEnglish (US)
Title of host publicationProceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015
EditorsLiang-Chih Yu, Zhifang Sui, Yue Zhang, Vincent Ng
PublisherAssociation for Computational Linguistics (ACL)
Pages26-31
Number of pages6
ISBN (Electronic)9781941643570
StatePublished - 2015
Event8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - Beijing, China
Duration: Jul 30 2015Jul 31 2015

Publication series

NameProceedings of the 8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015 - co-located with 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing, ACL IJCNLP 2015

Conference

Conference8th SIGHAN Workshop on Chinese Language Processing, SIGHAN 2015
Country/TerritoryChina
CityBeijing
Period7/30/157/31/15

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Computer Science Applications
  • Education
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'ACBiMA: Advanced chinese bi-characterword morphological analyzer'. Together they form a unique fingerprint.

Cite this