TY - GEN
T1 - A Corpus Based N-gram Hybrid Approach of Bengali to English Machine Translation
AU - Rahman, Mohammad Masudur
AU - Kabir, Md Faisal
AU - Huda, Mohammad Nurul
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018/7/2
Y1 - 2018/7/2
N2 - Machine translation means automatic translation which is performed using computer software. There are several approaches to machine translation, some of them need extensive linguistic knowledge while others require enormous statistical calculations. This paper presents a hybrid method, integrating corpus based approach and statistical approach for translating Bengali sentences into English with the help of N-gram language model. The corpus based method finds the corresponding target language translation of sentence fragments, selecting the best match text from the bilingual corpus to acquire knowledge while the N-gram model rearranges the sentence constituents to get an accurate translation without employing external linguistic rules. A variety of Bengali sentences, including various structures and verb tenses are considered to translate through the new system. The performance of the proposed system is evaluated in terms of adequacy, fluency, WER, and BLEU score. The assessment scores are compared with other conventional approaches as well as with Google Translate, a well-known free machine translation service by Google. It has been found that experimental results of the work provide higher scores over Google Translate and other methods with less computational cost.
AB - Machine translation means automatic translation which is performed using computer software. There are several approaches to machine translation, some of them need extensive linguistic knowledge while others require enormous statistical calculations. This paper presents a hybrid method, integrating corpus based approach and statistical approach for translating Bengali sentences into English with the help of N-gram language model. The corpus based method finds the corresponding target language translation of sentence fragments, selecting the best match text from the bilingual corpus to acquire knowledge while the N-gram model rearranges the sentence constituents to get an accurate translation without employing external linguistic rules. A variety of Bengali sentences, including various structures and verb tenses are considered to translate through the new system. The performance of the proposed system is evaluated in terms of adequacy, fluency, WER, and BLEU score. The assessment scores are compared with other conventional approaches as well as with Google Translate, a well-known free machine translation service by Google. It has been found that experimental results of the work provide higher scores over Google Translate and other methods with less computational cost.
UR - http://www.scopus.com/inward/record.url?scp=85062859647&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85062859647&partnerID=8YFLogxK
U2 - 10.1109/ICCITECHN.2018.8631938
DO - 10.1109/ICCITECHN.2018.8631938
M3 - Conference contribution
AN - SCOPUS:85062859647
T3 - 2018 21st International Conference of Computer and Information Technology, ICCIT 2018
BT - 2018 21st International Conference of Computer and Information Technology, ICCIT 2018
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 21st International Conference of Computer and Information Technology, ICCIT 2018
Y2 - 21 December 2018 through 23 December 2018
ER -