Parsing-based Machine Translation using an Open Source Toolkit: Joshua for Tamil Language
Author(s):
B.P.SREEJA, G.SARATHA DEVI
Keywords:
Corpus, SCFGs
Abstract
Joshua, an open source toolkit for statistical machine translation. It implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, n-gram language model integration, beam-and cube-pruning and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. In this paper, it is demonstrated that the toolkit achieves state of the art translation performance on the Tamil -English translation task.
Article Details
Unique Paper ID: 142612
Publication Volume & Issue: Volume 2, Issue 4
Page(s): 106 - 110
Article Preview & Download
Share This Article
Join our RMS
Conference Alert
NCSEM 2024
National Conference on Sustainable Engineering and Management - 2024