Parsing-based Machine Translation using an Open Source Toolkit: Joshua for Tamil Language

  • Unique Paper ID: 142612
  • Volume: 2
  • Issue: 4
  • PageNo: 106-110
  • Abstract:
  • Joshua, an open source toolkit for statistical machine translation. It implements all of the algorithms required for synchronous context free grammars (SCFGs): chart-parsing, n-gram language model integration, beam-and cube-pruning and k-best extraction. The toolkit also implements suffix-array grammar extraction and minimum error rate training. It uses parallel and distributed computing techniques for scalability. In this paper, it is demonstrated that the toolkit achieves state of the art translation performance on the Tamil -English translation task.

Cite This Article

  • ISSN: 2349-6002
  • Volume: 2
  • Issue: 4
  • PageNo: 106-110

Parsing-based Machine Translation using an Open Source Toolkit: Joshua for Tamil Language

Related Articles