Uncovering Factors Influencing Millennials’ Use of Non-Standard Words in Twitter

Nur Nashatul Nasuha Nazman, Su-Hie Ting, Kee-Man Chuah

Abstract


Social media communication has its own language features and one aspect is modified spelling of standard words. Social media users use shortened words with full awareness of the meanings, and new non-standard words are constantly added to the repertoire of social media language. A pertinent question is whether social media users learn these non-standard words to use or whether they also contribute to the vocabulary used in social media communication. The study examined Malaysian millennials’ use of non-standard words in Twitter and their reasons for shortening words. For the non-standard words, data were collected from 200 active Twitter users whereas data on reasons for shortening words were collected from 30 users. The results showed that the Malaysian millennials frequently used non-standard spelling of words. The three top words were “ni” (this), “nak” (want), and X (negation). The main reasons for the Twitter users to shorten words were the 280-character limit per tweet, user convenience, and characteristics of words. The Twitter users felt free to create new spellings of standard words at times for fun, but most of the time, they use the common non-standard words. The Malaysian millennials reported that they were inclined to shorten long and complex words, and words with many vowels. The study suggests that Twitter users balance between speed in communication and preservation of meaning when using non-standard words.

 

Keywords: Social media, Twitter, non-standard words, word formation processes, shortening of words.


https://doi.org/10.17576/JKMJC-2022-3804-22


Full Text:

PDF

References


Ahmed, B. (2015). Lexical normalisation of Twitter Data. Paper presented 2015 Science and Information Conference (SAI), 2015, pp. 326-328. https://doi.org/jn96

Ariffin, S. N. A. N., & Tiun, S. (2018). Part-of-speech tagger for Malay social media texts. GEMA Online® Journal of Language Studies, 18(4), 17-22. http://dx.doi.org/10.17576/gema-2018-1804-09

Athirah, R. R., Soon, L. K., & Haw, S. C. (2019). Exploring edit distance for normalising out-of-vocabulary Malay words on social media. In R. Černý, J. Kočí and V. Kočí (Eds.), MATEC Web of Conferences (pp. 1-7). Les Ulis, France: EDP Sciences.

Bisera, K., & Elena, S. (2018). Internet memes and their socio-linguistic features. European Journal of Literature, Language and Linguistics Studies, 2(4), 158-169.

Chekima, K., & Rayner, A. (2017). Sentiment analysis of Malay social media text. In R. Alfred, H. Iida, A. A. A. Ibrahim, & Y. Lim (Eds.), Proceedings of the Fourth International Conference on Computational Science and Technology (pp. 205-219). Singapore: Springer.

Choi, D., Kim, J., & Kim, P. (2013). A method for normalizing non-standard words in online social network services: A case study on Twitter. In Vinh, P., Alagar, V., Vassev, E., Khare, A. (Eds.), Context-aware systems and applications (ICCASA 2013, Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, Vol. 128, pp. 359-368). Cham.: Springer. https://doi.org/10.1007/978-3-319-05939-6_35

Crystal, D. (2011). Internet linguistics: A student guide. New York: Routledge.

Dixon, S. (2022, March 29). Twitter: Distribution of global audiences 2021, by age group. Statista. https://www.statista.com/statistics/283119/age-distribution-of-global-twitter-users/

Dunlap, J. C., & Lowenthal, P. R. (2009). Tweeting the night away: Using Twitter to enhance social presence. Journal of Information Systems Education, 20(2), 129-135.

Flint, E., Ford, E., Thomas, O., Caines, A., & Buttery, P. (2017). A text normalisation system for non-standard English words. Proceedings of the 3rd Workshop on Noisy User Generated Text, pp. 107-115. https://doi.org/10.18653/v1/W17-4414

Fontein, D., & Copp, E. (2018, December 19). Words and phrases to ban from your social media vocabulary. Hootsuite. https://blog.hootsuite.com/words-and-phrases-to-ban-from-your-social-media-vocabulary/

Foster, H. (2018). How is social media changing the English language? Language Services Direct. https://www.languageservicesdirect.co.uk/social-media-changing-english-language/

Fugard, A. J., & Potts, H. W. (2015). Supporting thinking on sample sizes for thematic analyses: A quantitative tool. International Journal of Social Research Methodology, 18(6), 669-684.

Galebole, W. (2020). Factors leading to spelling change of English language words on Whatsapp. American Journal of Communication, 2(1), 26-40.

Gligorić, K., Anderson, A., & West, R. (2018, April 10). How constraints affect content: The case of Twitter’s switch from 140 to 280 characters. Association for the Advancement of Artificial Intelligence. https://www.cs.toronto.edu/~ashton/pubs/twitter-constraints-icwsm2018.pdf

Golbeck, J. (2016). Negativity and anti-social attention seeking among narcissists on Twitter: A linguistic analysis. First Monday, 21(3), 1-10.

Han, B., & Baldwin, T. (2011). Lexical normalisation of short text messages: Makn sens a# Twitter. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 368-378, Stroudsburg, PA: Association for Computational Linguistics. https://aclanthology.org/P11-1038

Haroon, R. (2021, July 18). “Mulut Puaka” wreaking havoc on social media with their whining. The New Straits Times.

https://www.nst.com.my/opinion/columnists/2021/07/709340/mulut-puaka-wreaking-havoc-social-media-their-whining

Honeycutt, C., & Herring, S. C. (2009). Beyond microblogging: Conversation and collaboration via Twitter. Conference Proceeding HICSS 2009 42nd Hawaii International Conference on System Sciences, pp. 1-10.

Hong, L., Convertino, G., & Chi, E. H. (2011). Language matters in Twitter: A large scale study. Proceedings of the International AAAI Conference on Web and Social Media, 5(1), 518-521. https://doi.org/10.1609/icwsm.v5i1.14184

Hughes, D. J., Rowe, M., Batey, M., & Lee, A. (2012). A tale of two sites: Twitter vs. Facebook and the personality predictors of social media usage. Computers in Human Behavior, 28(2), 561-569.

Izazi, Z. Z., & Tengku-Sepora, T. M. (2020). Slangs on social media: Variations among Malay language users on Twitter. Pertanika Journal of Social Sciences & Humanities, 28(1), 46-58.

Isaac, M. (2017, September 26). Twitter to test doubling tweet length to 280 characters. The New York Times. https://www.nytimes.com/2017/09/26/technology/twitter-280-characters.html?smid=url-share

Jansen, B. J., Zhang, M., Sobel, K., & Chowdury, A. (2009). Twitter power: Tweets as electronic word of mouth. Journal of the American Society for Information Science and Technology, 60(11), 2169-2188.

Jaidka, K., Zhou, A., & Lelkes, Y. (2019). Brevity is the soul of Twitter: The constraint affordance and political discussion. Journal of Communication, 69(4), 345-372.

Kamus Dewan (4th ed.). (2007). Kuala Lumpur, Malaysia: Dewan Bahasa dan Pustaka.

Kern, M. L., Park, G., Eichstaedt, J. C., Schwartz, H. A., Sap, M., Smith, L. K., & Ungar, L. H. (2016). Gaining insights from social media language: Methodologies and challenges. Psychological Methods, 21(4), 1-17.

Lusetti, M., Ruzsics, T., Göhring, A., Samardžić, T., & Stark, E. (2018). Encoder-decoder methods for text normalization. Proceedings of the Fifth Worksop on NLP for Similar Languages, Varieties and Dialects (VarDial 2018), pp. 18-28, Santa Fe, New Mexico, USA, Association for Computational Linguistics. https://aclanthology.org/W18-3902

Liu, F., Weng, F., Wang, B., & Liu, Y. (2011). Insertion, deletion, or substitution?: Normalizing text messages without pre-categorization nor supervision. Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pp. 71-76, Portland, Oregon, USA. Association for Computational Linguistics. https://aclanthology.org/P11-2013

Mitchell, L., Frank, M. R., Harris, K. D., Dodds, P. S., & Danforth, C. M. (2013). The geography of happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place. PloS One, 8(5), e64417.

Nazman, N., Chuah, K. M., & Ting, S. H. (2020). Tryna b Kewl: Textual analytics of distorted words among Malaysian millennials on Twitter. International Journal of Innovative Technology and Exploring Engineering, 9(3), 3522-3526.

Nisa, R. F. (2016). Word formation process of non-standard vocabulary in Twitter statuses of Indonesian dangdut singers. Lantern, 5(3).

Nkhata, L., & Jimaima, H. (2020). Neologisms: A morphological analysis of social media discourses on the Zambian online media. Multidisciplinary Journal of Language and Social Sciences Education, 4, 66-93.

Ortiz-Ospina, E. (2019, September 18). The rise of social media. Our World in Data. https://ourworldindata.org/rise-of-social-media

Reed, J. (2014). How social media is changing language. OUP Blog. https://blog.oup.com/2014/06/social-media-changing-language/

Saloot, A. M., Idris, N., Aw, A., & Thorleuchter, D. (2014). Twitter corpus creation: The case of a Malay Chat-style-text Corpus (MCC). Digital Scholarship in the Humanities, 31(2), 227-243. https://doi.org/10.1093/llc/fqu066

Sayce, D. (2022). The number of tweets per day. DSayce.

Schmied, J. (2012). Social digital discourse: New challenges for corpus-and sociolinguistics Topics in Linguistics, 10, 43-56.

Seward, Z. M. (2013). The first-ever hashtag, @-reply and retweet, as Twitter users invented them. Quartz. https://qz.com/135149/the-first-ever-hashtag-reply-and-retweet-as-twitter-users-invented-them

Sriram, B., Fuhry, D., Demir, E., Ferhatosmanoglu, H., & Demirbas, M. (2010). Short text classification in Twitter to improve information filtering. Proceedings of 33ed International ACM SIGIR conference on Research and development in information retrieval, pp. 841-842.

StatCounter Global Stats. (2020). Social Media Stats Malaysia, Jan-Dec 2020. https://gs.statcounter.com/social-media-stats/all/malaysia/2020

Statista. (2022). Forecast of the number of internet users in Malaysia from 2010 to 2025. https://www.statista.com/forecasts/1146231/internet-users-in-malaysia

Stone, C. B., & Wang, Q. (2019). From conversations to digital communication: The mnemonic consequences of consuming and producing information via social media. Topics in Cognitive Science, 11(4), 774-793.

Suparta, I. W., Qomariana, Y., & Rahayuni, N. K. S. (2016). Word formation on Twitter posts. Jurnal Humanis, 18(2), 6-23.


Refbacks

  • There are currently no refbacks.


e-ISSN: 2289-1528