Can Machine Translate Dialogue Acts: Evidence from Translating Dialogues from English to Arabic
Abstract
Machine translation advances translation quality at morphological, syntactic, and semantic levels. The pragmatic level of machine translation is also evolving, but challenges remain due to cultural and contextual issues on the one hand and machine translation deficiencies on the other. While computational studies have made strides in automating translation tasks, linguistic-oriented research in this area remains sparse. In response to this gap, this study seeks to assess the effectiveness of Neural Machine Translation, as exemplified by Google Translate, in translating dialogue acts inherent in natural English conversations into Arabic, drawing upon Austin's theory of speech acts and leveraging a corpus of authentic sources[1]. Our findings highlight certain challenges in the machine’s identification of the performative functions of the utterances in conversations, viz. directives, expressives and representatives. Such challenges emanate from specific linguistic features of English conversations (e.g., idiomatic expressions, polysemous words, and deixis) and the lack of contextual information in everyday discourse. These challenges ultimately impede the faithful representation of speakers' intentions in the translated output.
Keywords: Neural Machine Translation; dialogue acts; English; Arabic translation quality
[1] https://www.eslfast.com/robot/ and https://helenadailyenglish.com/english-conversations-in-real-life-with-common-phrases-meaning-example
Full Text:
PDFReferences
Al-Haj, H. & Lavie, A. (2012). The impact of Arabic morphological segmentation on broad-coverage English-to-Arabic statistical machine translation. Machine Translation, 26, 3-24.
Agerfalk, P. J (2010). Getting pragmatic. European Journal of Information Systems, 19(3), 251-256.
Almahasees, Z. & Mustafa, Z. (2017). Machine translation quality of Khalil Gibran's the Prophet. AWEJ for translation & Literary Studies, 1(4), 151-159.
Austin, J. (1962). How to do Things with Words. Oxford University Press.
Bahdanau, D., Cho, K. & Bengio, Y. (2014). Neural machine translation by jointly learning to align and translate. arXiv, [1409.0473].
Brown, P. & Levinson, S. C. (1987). Politeness: Some universals in language usage (Vol. 4). Cambridge University Press.
Cambria, E. & White, B. (2014). Jumping NLP curves: A review of natural language processing research. IEEE Computational intelligence magazine, 9(2), 48-57.
Cerisara, C., Kr´al, P. & Lenc, L. (2017). On the effects of using word2vec representations in neural networks for dialogue act recognition. Computer Speech & Lang. 47, 175–193 https://doi.org/10.1016/j.csl.2017.07.009
Cheng, Z., Jiang, Z., Yin, Y., Li, N. & Gu, Q. (2021). A unified target-oriented sequence-to-sequence model for emotion-cause pair extraction. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 29, 2779-2791.
Clay, S., Vandyke, D. & Kaplan, J. (2016). Towards a unified approach to dialogue act prediction. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (ACL). Berlin, Germany.
Colombo, P., Chapuis, E., Manica, M., Vignon, E., Varni, G. & Clavel, C. (2020). Guiding Attention in Sequence-to-Sequence Models for Dialogue Act Prediction. Proceedings of the AAAI Conference on Artificial Intelligence, 34(05), pp.7594-7601.
Duran N. & Battle S. (2018). Probabilistic word association for dialogue act classification with recurrent neural networks. In E Pimenidis & C. Jayne (Eds.), Engineering Applications of Neural Networks. EANN 2018. Communications in Computer and Information Science(pp-229-239). Springer
Elmadany, A., Mubarak, H. & Magdy, W. (2017 May). An Arabic speech-act and sentiment corpus of tweets. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018) European Language Resources Association (ELRA), Torino, Italy)
Elmadany, A. A., Abdou, S. M. & Gheith, M. (2015). Towards understanding egyptian arabic dialogues. arXiv preprint arXiv:1509.03208.
Ji, Y., Haffari, G. & Eisenstein, J. (2016). A latent variable recurrent neural network for discourse relation language models. NAACL-HLT , San Diego, USA.
Joukhadar, A., Saghergy, H. & Kweider, L. (2019). Arabic dialogue act recognition for textual chatbot systems. The First International Workshop on NLP Solutions for Under-Resourced Languages (NSURL 2019). Trento: Association for Computational Linguistics. Retrieved from https://aclanthology.org/2019.nsurl-1.7
Kalchbrenner, N. & Blunsom, P. (2013). Recurrent convolutional neural networks for discourse compositionality. arXiv preprint arXiv:1306.3584.
Keizer, S. (2001). A Bayesian approach to dialogue act classification. In BI-DIALOG 2001: Proc. of the 5th Workshop on Formal Semantics and Pragmatics of Dialogue (pp. 210-218).
Kim, M., & Kim, H. (2018). An integrated neural network model for identifying speech acts, predictors, and sentiments of dialogue utterances. Pattern Recognition Letters, 101, 1-5. DOI: 10.1016/j.patrec.2017.11.009
Kumar R. (2011). Making machine translations polite: The problematic speech acts. In C. Singh, G. Lehal, J. Sengupta , D. Sharma & V. Goyal (Eds), Information systems for Indian languages. ICISIL 2011. Communications in Computer and Information Science (185-190). Springer,
Li, J., Sun, A., Han, J. & Li, C. (2020). A survey on deep learning for named entity recognition. IEEE Transactions on Knowledge and Data Engineering, 34(1), 50-70.
Lee, J. & Dernoncourt, F. (2016). Sequential short-text classification with recurrent and convolutional neural networks. arXiv preprint arXiv:1603.03827.
Li, D. (2012). The pragmatic construction of word meaning in utterances. Journal of Chinese Language and Computing 18(3), 121–137
Martinková, P. (2013). Means of coherence and cohesion in spoken and written discourse. In Proceedings of the 2nd Central European Conference in Linguistics for Postgraduate Studies (pp. 167–181), Brno, Czech
Müllerová, O. (2011). Výstavba mluvených projevů, jejich členění a syntax. In Mluvená čeština: hledání funkčního rozpětí (pp. 129–172). Praha: Academia.
Popescu-Belis, A. (2019). Context in neural machine translation: A review of models and evaluations. arXiv preprint arXiv:1901.09115.
Sennrich, R., Haddow, B. & Birch, A. (2015). Improving neural machine translation models with monolingual data. arXiv preprint arXiv:1511.06709.
Schegloff, E. (2007). Sequence organization in interaction: A Primer in conversation analysis I. Cambridge University Press.
Searle, J. (1969). Speech acts: An essay in the philosophy of language. Cambridge University Press
Searle, J. (1976). A classification of illocutionary Acts’. Language in Society, 5(1), 1-23.
Sennrich, R., Haddow, B. & Birch, A. (2016). Controlling politeness in neural machine translation via side constraints. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human language technologies (pp. 35-40).
Shen, S. & Lee, H. Y. (2016). Neural attention models for sequence classification: Analysis and application to key term extraction and dialogue act detection. arXiv preprint arXiv:1604.00077.
Thomas, M. A., Abraham, D. S. & Liu, D. (2020). An Assessment of federated machine learning for translational research. in r. luppicini(eds.), interdisciplinary approaches to digital transformation and innovation (pp. 123-142). IGI Global.
Soori, H., & Awab, S. (2015). Verb noun Collocations in Arabic and their pattern-based acquisition. In Proceedings of the 16th International Conference on Computational Linguistics and Intelligent Systems (COLING). Beijing, China.
Tiwari, G., Sharma, A., Sahotra, A. & Kapoor, R. (2020, July). English-Hindi neural machine translation-LSTM seq2seq and ConvS2S. In 2020 International Conference on Communication and Signal Processing (ICCSP) (pp. 871-875). IEEE.
Wang, Y., Tsai, P., Goodman, D. & Lin, M. (2010). Agreement, acknowledgment, and alignment: The discourse-pragmatic functions of hao and dui in Taiwan Mandarin conversation. Discourse Studies, 12(2), 241-267. DOI: 10.1177/1461445609346922
Yusof, N. M., Darus, S. & Ab Aziz, M. J. (2017). Evaluating intelligibility in human translation and machine translation. 3L, Language, Linguistics, Literature, 23(4), 251-264.
DOI: http://dx.doi.org/10.17576/3L-2023-2904-05
Refbacks
- There are currently no refbacks.
eISSN : 2550-2247
ISSN : 0128-5157