ENHANCING HEALTHCARE COMMUNICATION: A YORUBA-TO-ENGLISH NLP MODEL FOR DOCTOR-PATIENT INTERACTIONS USING MACHINE TRANSLATION AND SPEECH-TO-TEXT

  • Adefehinti T.O
  • Idowu A.O
  • Awodun M.A
  • Tenibiaje M.O
Keywords: Natural Language Processing (NLP), Yoruba-to-English translation, Speech-to-Text, Health Care Communication

Abstract

The problem of language barriers in healthcare is another problem in multilingual areas such as Nigeria where more than 500 languages are used. These obstacles inhibit effective communication between the medical staff and the patients, which results in misdiagnoses, poor adherence to treatment, and deteriorated outcomes. This study comes up with a Yoruba-to-English Natural Language Processing (NLP) model to handle this problem, and the study targets at improving the doctor-patient communication in the Yoruba-speaking areas. The model combines two major elements: Yoruba Speech Recognition (YSR) in order to turn Yoruba speech into text and Yoruba-English Machine Translation (YEMT) in order to translate the transcribed Yoruba text into fluent English. The model was trained by use of a transformer-based architecture on a dataset of Yoruba healthcare interactions, giving a Word Error Rate (WER) of 10.1% and a Sentence Accuracy of 77.0% on the test set. The findings prove that the model can enhance healthcare communication through minimizing the language barrier, and eventually patient care, diagnosis, and treatment compliance. Although informal speech and tonal differences may be a problem, the research can offer an encouraging way of a low-resource language in healthcare setting, which can trigger further studies and advancement in multilingual NLP systems. The study is part of the emerging body of research in NLP among African languages that provides a pragmatic framework which can be extended to other multilingual healthcare environment.

References

Albrecht, U., V; Behrends, M., Schmeer, R., Matthies, H.K., & Von Jan, U. (2023). Usage of Multilingual Mobile Translation Applications in Clinical Settings. Journal of Health Information Science.
Ajagbe, S.A. (2024). Developing Nigeria Multilingual Languages Speech Datasets for Antenatal Orientation. In: Florez, H., Astudillo, H. (eds) Applied Informatics. ICAI 2024. Communications in Computer and Information Science, vol 2237. Springer, Cham.
Albrecht, J., et al. (2023). Language Barriers in Healthcare. Journal of Healthcare Communication.
Babatunde, Akinbowale Nathaniel, et al. (2024). Speech-to-Text Hybrid English to Yoruba SMS Translator. Innovative Computing Review, 4.1, 15-36.
Devlin, N.J., Shah, K.K., Feng, Y., Mulhern, B., van Hout, B. (2018). Valuing Health-Related Quality of Life: An EQ-5D-5L Value Set for England. Health Economics, 27, 7-22.
Kale, R., Ojo, J., & Eze, T. (2023). Cultural Communication in Healthcare: Language Barriers and Patient Satisfaction. African Journal of Health Careers.
Kale, A., et al. (2023). Implications of Language Barriers for Healthcare. Oman Medical Journal, 35(2), e122. doi: 10.5001/omj.2020.40.
Meyer, J., et al. (2024). Challenges and Opportunities in NLP for Underrepresented Languages in Healthcare. Journal of AI and Healthcare.
Rahmon, M., et al. (2024). Development of a Yoruba-to-English NLP Model for Healthcare Communication. International Journal of NLP and Healthcare Research.
Shamsi, H., Almutairi, A., Mashrafi, S., & Kalbani, T. (2024). Implications of Language Barriers for Healthcare: A Systematic Review. OMAN Medical Journal.
Umeokafor, N., Eze, T., & Alabi, A. (2022). Addressing Language Barriers with NLP in African Healthcare Systems. Nigerian Medical Journal.
Adelani, D., et al. (2021). Named Entity Recognition for African Languages. Transactions of the Association for Computational Linguistics.
Hossain, E., et al. (2023). Natural Language Processing in Electronic Health Records in Relation to Healthcare Decision-Making: A Systematic Review. Computers in Biology and Medicine.
Meyer, R., et al. (2024). Improving NLP for Code-Switched Speech in Under-Resourced Languages Using Out-of-Domain Data. Journal of Natural Language Processing.
Saskia Locke, et al. (2021). Natural Language Processing in Medicine: Understanding the Societal Impacts. Information, Communication & Society.
William Leeson, et al. (2019). Natural Language Processing (NLP) in Qualitative Public Health Research: A Proof-of-Concept Study. International Journal of Qualitative Methods.
Shamsi, H., et al. (2024). Sentiment Analysis in Healthcare: Enhancing Patient Care through Language Processing. OMAN Medical Journal.
Vieira, L. N., O'Hagan, M., & O'Sullivan, C. (2020). Understanding the Societal Impacts of Machine Translation: A Critical Review of the Literature on Medical and Legal Use Cases. Information, Communication & Society, 24(11), 1515-1532.
Published
2026-05-31