TY - GEN
T1 - Deep learning based parts of speech tagger for Bengali
AU - Kabir, Md Fasihul
AU - Abdullah-Al-Mamun, Khandaker
AU - Huda, Mohammad Nurul
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/11/28
Y1 - 2016/11/28
N2 - This paper describes the Part of Speech (POS) tagger for Bengali Language. Here, POS tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. In many Natural Language Processing (NLP) applications, POS tagging is considered as the one of the basic necessary tools. Identifying the ambiguities in language lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. Different methods of automating the process have been developed and employed for Bengali. In this paper, we report about our work on building POS tagger for Bengali using the Deep Learning. Bengali is a morphologically rich language and our taggers make use of morphological and contextual information of the words. It is observed from the experiments based on Linguistic Data Consortium (LDC) catalog number LDC2010T16 and ISBN 1-58563-561-8 corpus that 93.33% accuracy is obtained for Bengali POS tagger using the Deep Learning.
AB - This paper describes the Part of Speech (POS) tagger for Bengali Language. Here, POS tagging is the process of assigning the part of speech tag or other lexical class marker to each and every word in a sentence. In many Natural Language Processing (NLP) applications, POS tagging is considered as the one of the basic necessary tools. Identifying the ambiguities in language lexical items is the challenging objective in the process of developing an efficient and accurate POS Tagger. Different methods of automating the process have been developed and employed for Bengali. In this paper, we report about our work on building POS tagger for Bengali using the Deep Learning. Bengali is a morphologically rich language and our taggers make use of morphological and contextual information of the words. It is observed from the experiments based on Linguistic Data Consortium (LDC) catalog number LDC2010T16 and ISBN 1-58563-561-8 corpus that 93.33% accuracy is obtained for Bengali POS tagger using the Deep Learning.
UR - http://www.scopus.com/inward/record.url?scp=85007174596&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85007174596&partnerID=8YFLogxK
U2 - 10.1109/ICIEV.2016.7760098
DO - 10.1109/ICIEV.2016.7760098
M3 - Conference contribution
AN - SCOPUS:85007174596
T3 - 2016 5th International Conference on Informatics, Electronics and Vision, ICIEV 2016
SP - 26
EP - 29
BT - 2016 5th International Conference on Informatics, Electronics and Vision, ICIEV 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 5th International Conference on Informatics, Electronics and Vision, ICIEV 2016
Y2 - 13 May 2016 through 14 May 2016
ER -