Comparing Rule-Based Translation to Syntax Tree Diagram Translation

Main Article Content

heri heryono

Abstract

Machine translation has developed more effectively to be specified as modern linguistics characterization. It indicates the formalization of translating process by machine within humanly approaching. Recently, in order to help people communicate indirectly through written, machine translation has its own system to break and match the codes through comprehensive database of words pocket. In linguistics perspective, machine translation always be assisted by its definite performance through derivative computational system. The problem appears when users need to translate ambiguous sentence from English to Bahasa Indonesia or vice versa. The paper used rule-based system which contains particular parts in order to run the process of translating; starts from tokenization, pre-processing, reordering and morphological process which lead to output accuracy. And used syntactic diagram translation, which was carried out manually by generating sentences into small units. The result of this research showed that RBS runs by standard linguistics system; and the output of translating refer to basic translation without giving any alternative translation nor meaning. While using syntactic diagram, there are possibility to get alternative result not only by linguistics features but also based on signification meaning of both translations. Those techniques run by considering linguistics features in order to help users to get translation output better.

Article Details

How to Cite
heryono, heri. (2018). Comparing Rule-Based Translation to Syntax Tree Diagram Translation. Journal of Informatics Information System Software Engineering and Applications (INISTA), 1(1). https://doi.org/10.20895/inista.v1i1.8
Section
Articles

References

[1] A. Burchardt, K. Harris, G. Rehm, and H. Uszkoreit. Towards a Systematic and Human-Informed Paradigm for High-Quality Machine Translation. –In: Proceedings of the LREC 2016 Workshop “Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem”. Translation Evaluation: From Fragmented Tools and Data Sets to an Integrated Ecosystem (LREC-2016), located at International Co, 2016.
[2] Akmajian, Adrian, Richard A. Demers, and Robert M. Harnish. 1979. Linguistics, an introduction to language and communication. Cambridge, Mass: MIT Press.

[3] Baker, L. (1998). English Syntax. London: The MIT Press

[4] Chiang, David. 2005. A hierarchical phrase-based model for statistical machine transla- tion. In Proceedings of ACL
[5] Crystal, D. (2008). A Dictionary of Linguistics and Phonetics (sixth edition). Oxford: Blackwell

[6] D. A. Cruse. 1986. Lexical Semantics. Cambridge University Press

[7] G. Sangavi, K. Mrinalini, P. Vijayalakshmi, ”Analysis on Bilingual Machine Translation Systems for English and Tamil”, International Conference on Computation of Power, Energy Information and Communication (ICCPEIC), 2016.
[8] Joordens, S., & Besner, D. (1992). Priming effects that span an intervening unrelated word: Implications for models of memory representation and retrieval. Journal of Experimental Psychology: Learning, Memory, and Cognition, 18, 483-491.

[9] Kate, J,. Rohit., Mooney, J,. Raymond. Using String-Kernels for Learning Semantic Parser. In Proceedings of the Joint 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (COLING/ACL 2006), pp. 913-920, Sydney, Australia, July 2006.
[10] Kavirajan, B., Kumar, Anan. Improving the Rules based Machine Translation System using Sentence Simplification (English to Tamil). ICACI, September 2017.
[11] Plisson, Joel., Lavrac, Nad., Mladenic, Dunja. A Rule based Approach to Word Lemmatization. Department of Knowledge Technologies Jožef Stefan Institute.
[12] Robert A. Hall, Jr., An Essay on Language. Philadelphia: 1962.
[13] Russell, S. & Norvig, P. Artificial intelligence: a modern approach. Pearson Education, Ltd. 2004.
[14] Shao, Yan. Tokenization and Sentence Segmentation. Department of Linguistics and Philology, Uppsala University, 29 March 2017.
[15] Tala, F. Z. (2003). A Study of Stemming Effects on Information Retrieval in Bahasa Indonesia. M.S. thesis. M.Sc. Thesis. Master of Logic Project. Institute for Logic, Language and Computation. Universiteti van Amsterdam the Netherlands.
[16] Xu, J. (2007). A Comparative Study of Categories in Generative Grammar and Parts of Speech in Traditional Grammar. Research on Chinese as a Second Language, 3.
[17] Yule, G. (2010). The Study of Language. Cambridge: Cambridge University Press

[18] West, C. and Zimmerman, D. (1983) “small insults: a study of interruptions in crosssex conversations between unacquainted persons’. In Thorne, B., Kramarae, C. And Henley, N (eds) (1983) Language, Gender and Society. Rowley: Newbury House. 103- 17.