Systematic Literature Review on Ontology-based Indonesian Question Answering System

Fadhila Tangguh Admojo, Adidah Lajis, Haidawati Nasir

Abstract


Question-Answering (QA) systems at the intersection of natural language processing, information retrieval, and knowledge representation aim to provide efficient responses to natural language queries. These systems have seen extensive development in English and languages like Indonesian present unique challenges and opportunities. This literature review paper delves into the state of ontology-based Indonesian QA systems, highlighting critical challenges. The first challenge lies in sentence understanding, variations, and complexity. Most systems rely on syntactic analysis and struggle to grasp sentence semantics. Complex sentences, especially in Indonesian, pose difficulties in parsing, semantic interpretation, and knowledge extraction. Addressing these linguistic intricacies is pivotal for accurate responses. Secondly, template-based SPARQL query construction, commonly used in Indonesian QA systems, suffers from semantic gaps and inflexibility. Advanced techniques like semantic matching algorithms and dynamic template generation can bridge these gaps and adapt to evolving ontologies. Thirdly, lexical gaps and ambiguity hinder QA systems. Bridging vocabulary mismatches between user queries and ontology labels remains a challenge. Strategies like synonym expansion, word embedding, and ontology enrichment must be explored further to overcome these challenges. Lastly, the review discusses the potential of developing multi-domain ontologies to broaden the knowledge coverage of QA systems. While this presents complex linguistic and ontological challenges, it offers the advantage of responding to various user queries across various domains. This literature review identifies crucial challenges in developing ontology-based Indonesian QA systems and suggests innovative approaches to address these challenges.

Full Text:

PDF

References


G. Mai, K. Janowicz, R. Zhu, L. Cai, and N. Lao, “Geographic Question Answering: Challenges, Uniqueness, Classification, and Future Directions,” AGILE: GIScience Series, vol. 2, no. 8, 2021.

E. M. Nabil Alkholy, M. Hassan Haggag, and A. Aboutabl, “Question Answering Systems: Analysis and Survey,” International Journal of Computer Science & Engineering Survey, vol. 09, no. 06, 2018.

W. Franco et al., “Ontology-based Question Answering Systems over Knowledge Bases: A Survey,” in Proceedings of the 22nd International Conference on Enterprise Information Systems - Volume 1: ICEIS, 2020, pp. 532–539.

I. Mahmoud Ibrahim Alturani and M. Pouzi Bin Hamzah, “An Efficient Semantic Analysis Technique for the Question Answering Systems,” Journal of Engineering and Applied Sciences, vol. 14, no. 22, 2019.

A. Abdi, N. Idris, and Z. Ahmad, “QAPD: An ontology-based question answering system in the physics domain,” Soft comput, vol. 22, no. 1, pp. 213–230, 2018.

C. Trojahn, R. Vieira, D. Schmidt, A. Pease, and G. Guizzardi, “Foundational ontologies meet ontology matching: A survey,” Semant Web, vol. 13, no. 4, pp. 685–704, 2022.

M. B. Canciglieri, A. L. Szejka, O. Canciglieri Junior, and L. Yoshida, “Current issues in multiple domain semantic reconciliation for ontology-driven interoperability in product design and manufacture,” in IFIP Advances in Information and Communication Technology, 2018.

G. R. Roldán-Molina, D. Ruano-Ordás, V. Basto-Fernandes, and J. R. Méndez, “An ontology knowledge inspection methodology for quality assessment and continuous improvement,” Data Knowl Eng, vol. 133, 2021.

A. F. Khan et al., “When linguistics meets web technologies. Recent advances in modelling linguistic linked data,” Semant Web, vol. 13, no. 6, pp. 987–1050, 2022.

D. Diefenbach, A. Both, K. Singh, and P. Maret, “Towards a question answering system over the Semantic Web,” Semant Web, vol. 11, no. 3, pp. 421–439, 2020.

T. H. Alwaneen, A. M. Azmi, H. A. Aboalsamh, E. Cambria, and A. Hussain, “Arabic question answering system: a survey,” Artif Intell Rev, vol. 55, no. 1, pp. 207–253, Jan. 2022.

A. Arbaaeen and A. Shah, “Ontology-Based Approach to Semantically Enhanced Question Answering for Closed Domain: A Review,” Information (Switzerland), vol. 12, no. 5, 2021.

A. Abdi, S. Hasan, M. Arshi, S. M. Shamsuddin, and N. Idris, “A question answering system in hadith using linguistic knowledge,” Comput Speech Lang, vol. 60, 2020.

M. Jarrar, “The Arabic ontology – an Arabic wordnet with ontologically clean content,” Appl Ontol, vol. 16, no. 1, pp. 1–26, 2021.

G. M. R. I. Rasiq, A. Al Sefat, T. Hossain, Md. I.-E.-H. Munna, J. J. Jisha, and M. M. Hoque, “Question Answering System over Linked Data: A Detailed Survey,” ABC Research Alert, vol. 8, no. 1, 2020.

M. A. Calijorne Soares and F. S. Parreiras, “A literature review on question answering techniques, paradigms and systems,” Journal of King Saud University - Computer and Information Sciences, vol. 32, no. 6. King Saud bin Abdulaziz University, pp. 635–646, Jul. 01, 2020.

C. Antoniou and N. Bassiliades, “A survey on semantic question answering systems,” The Knowledge Engineering Review, vol. 37, no. 3. 2022.

A. Pereira, A. Trifan, R. P. Lopes, and J. L. Oliveira, “Systematic review of question answering over knowledge bases,” IET Software, vol. 16, no. 1, pp. 1–13, Feb. 2022.

A. Albarghothi, F. Khater, and K. Shaalan, “Arabic Question Answering Using Ontology,” Procedia Comput Sci, vol. 117, pp. 183–191, 2017.

M. Breja and S. K. Jain, “A survey on non-factoid question answering systems,” International Journal of Computers and Applications, vol. 44, no. 9, pp. 830–837, 2022.

M. Mattila and A. Dahanayke, “Systematic Literature Review of Question Answering Systems,” in Lecture Notes in Networks and Systems, 2021.

D. Eberhard, G. Simons, and C. Fennig, “Languages of the World,” Ethnologue. 25rd ed. Dallas, Texas: SIL International, 2022. Accessed: Oct. 11, 2022.

I. Ghosh, “Ranked: The 100 Most Spoken Languages Worldwide,” 2020. Accessed: Oct. 11, 2022.

F. Koto, A. Rahimi, J. H. Lau, and T. Baldwin, “IndoLEM and IndoBERT: a benchmark dataset and pre-trained language model for Indonesian NLP,” in Proceedings of the 28th International Conference on Computational Linguistics, 2020, pp. 757–770.

S. Li, N. Lin, L. Xiao, and S. Jiang, “IndoAbbr: A New Benchmark Dataset for Indonesian Abbreviation Identification,” in 2020 International Conference on Asian Language Processing, IALP 2020, 2020.

S. S. Alanazi, N. Elfadil, M. Jarajreh, and S. Algarni, “Question Answering Systems: A Systematic Literature Review,” International Journal of Advanced Computer Science and Applications, vol. 12, no. 3, 2021.

E. Dimitrakis, K. Sgontzos, and Y. Tzitzikas, “A survey on question answering systems over linked data and documents,” J Intell Inf Syst, vol. 55, no. 2, pp. 233–259, 2020.

F. T. Admojo and E. Winarko, “Sistem Pencarian Informasi Berbasis Ontologi untuk Jalur Pendakian Gunung Menggunakan Query Bahasa Alami dengan Penyajian Peta Interaktif,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 10, no. 1, pp. 23–34, Jan. 2017.

A. A. Shah, S. D. Ravana, S. Hamid, and M. A. Ismail, “Accuracy evaluation of methods and techniques in Web-based question answering systems: a survey,” Knowl Inf Syst, vol. 58, no. 3, pp. 611–650, 2019.

H. Sulistyanto and A. SN, “A Few Survey of Developments and Challenges Arising on General and Indonesian Question Answering System,” in International Conference on Information Systems for Business Competitiveness (ICISBC 2013), 2013, pp. 71–75. Accessed: Sep. 05, 2023

R. Wongso, Meiliana, and D. Suhartono, “A Literature Review of Question Answering System using Named Entity Recognition,” in Proceedings - 2016 3rd International Conference on Information Technology, Computer, and Electrical Engineering, ICITACEE 2016, 2016, pp. 274–277.

S. Fandy, Utomo, N. Suryana, and M. S. Azmi, “Question Answering System : A Review On Question Analysis, Document Processing, And Answer Extraction Techniques,” Journal of Theoretical and Applied Information Technology, vol. 95, no. 14. pp. 3158–3174, 2017.

A. Abdiansah, A. Azhari, and A. K. Sari, “Survey on Answer Validation for Indonesian Question Answering System (IQAS),” International Journal of Intelligent Systems and Applications, vol. 10, no. 4, pp. 68–78, Apr. 2018.

Y. Puspitarani, “Indonesian Information Extraction : Challenges and Opportunities,” JATISI (Jurnal Teknik Informatika dan Sistem Informasi), vol. 8, no. 1, pp. 421–429, 2021.

A. F. Aji et al., “One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in Indonesia,” in Proceedings of the Annual Meeting of the Association for Computational Linguistics, 2022, pp. 7226–7249.

F. Darari, A. A. Krisnandhi, and R. Manurung, “OWLizr: Knowledge Representation System for Bahasa Indonesia Based on Web Ontology Language Description Logic (OWL DL),” in International Conference on Advanced Computer Science And Information Systems 2010, 2010, pp. 293–298. Accessed: Sep. 05, 2023.

S. J. Putra, R. H. Gusmita, K. Hulliyah, and H. T. Sukmana, “A semantic-based question answering system for indonesian translation of Quran,” in Proceedings of the 18th International Conference on Information Integration and Web-Based Applications and Services, in iiWAS ’16. New York, NY, USA: Association for Computing Machinery, 2016, pp. 504–507.

V. Atina, E. Sediyono, and R. Rizal, “Information Retrieval System for Indonesian Manuscript using Semantic Web,” Int J Comput Appl, vol. 170, no. 8, 2017.

Wahyudi, M. L. Khodra, A. S. Prihatmanto, and C. Machbub, “A Question Answering System Using Graph-Pattern Association Rules (QAGPAR) on YAGO Knowledge Base,” in 2018 International Conference on Information Technology Systems and Innovation, ICITSI 2018 - Proceedings, 2018.

F. S. Utomo, N. Suryana, and M. S. Azmi, “New instances classification framework on Quran ontology applied to question answering system,” Telkomnika (Telecommunication Computing Electronics and Control), vol. 17, no. 1, pp. 139–146, Feb. 2019.

R. A. Yunmar and I. Wayan Wiprayoga Wisesa, “Design of Ontology-based Question Answering System for Incompleted Sentence Problem,” in IOP Conference Series: Earth and Environmental Science, 2019.

A. Amalia, P. Y. C. Sipahutar, E. Elviwani, and F. Purnamasari, “Chatbot Implementation with Semantic Technology for Drugs Information Searching System,” in Journal of Physics: Conference Series, 2020.

F. Ishlakhuddin and A. SN, “Ontology-based Chatbot to Support Monitoring of Server Performance and Security By Rule-base,” IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 15, no. 2, p. 131, Apr. 2021.

M. I. Rahajeng and A. Purwarianti, “Indonesian Question Answering System for Factoid Questions using Face Beauty Products Knowledge Graph,” Jurnal Linguistik Komputasional, vol. 4, no. 2, pp. 59–63, 2021.

E. S. B. Perangin-Angin, Z. K. A. Baizal, and D. Richasdy, “Question Answering using Ontology for Sumedang Larang History with Support Vector Machine Based on Telegram Bot,” Jurnal Media Informatika Budidarma, vol. 6, no. 4, pp. 2438–2445, Oct. 2022.

A. N. Hasanah, A. Baizal, and R. Dharayani, “Question Answering For Sumedang Larang Kingdom Using The Multilayer Perceptron Algorithm,” JIPI (Jurnal Ilmiah Penelitian dan Pembelajaran Informatika), vol. 7, no. 4, 2022.

R. Jasmi, Z. K. A. Baizal, and D. Richasdy, “Question Answering Chatbot using Ontology for History of the Sumedang Larang Kingdom using Cosine Similarity as Similarity Measure,” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 6, no. 4, 2022.

R. F. Saldhi, Z. K. A. Baizal, and R. Dharayani, “Question Answering System at the Kingdom of Sumedang Larang with Naïve Bayes Method,” Journal of Computer System and Informatics (JoSYC), vol. 3, no. 4, pp. 322–329, 2022.

S. A. Anggrayni, Z. K. A. Baizal, and D. Richasdy, “Question Answering System Using Semantic Reasoning on Ontology for The History of The Sumedang Larang Kingdom,” Building of Informatics, Technology and Science (BITS), vol. 4, no. 2, pp. 545–553, 2022.

R. Mahendra, S. D. Larasati, and R. Manurung, “Extending an Indonesian semantic analysis-based question answering system with linguistic and world knowledge axioms,” in Proceedings of the 22nd Pacific Asia Conference on Language, Information and Computation, PACLIC 22, 2008.

K. Höffner, S. Walter, E. Marx, R. Usbeck, J. Lehmann, and A. C. Ngonga Ngomo, “Survey on challenges of Question Answering in the Semantic Web,” Semant Web, vol. 8, no. 6, pp. 895–920, 2017.

A. Farea, Z. Yang, K. Duong, N. Perera, and F. Emmert-Streib, “Evaluation of Question Answering Systems: Complexity of judging a natural language.” 2022.

A. M. Moeliono, H. Lapoliwa, H. Alwi, S. S. Tjatur, W. Sasangka, and S. Sugiyono, Tata Bahasa Baku Bahasa Indonesia, 4th ed. Jakarta: Kementerian Pendidikan dan Kebudayaan Republik Indonesia, 2017.

Y. Lan, G. He, J. Jiang, J. Jiang, W. X. Zhao, and J. R. Wen, “A Survey on Complex Knowledge Base Question Answering: Methods, Challenges and Solutions,” in Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence, 2021, pp. 4483–4491.

C. Zhang, Y. Lai, Y. Feng, and D. Zhao, “A review of deep learning in question answering over knowledge bases,” AI Open, vol. 2. pp. 205–215, 2021.

A. Dhandapani and V. Vadivel, “Question Answering System over Semantic Web,” IEEE Access, vol. 9, pp. 46900–46910, 2021.

T. Rebele, F. Suchanek, J. Hoffart, J. Biega, E. Kuzey, and G. Weikum, “YAGO: A multilingual knowledge base from wikipedia, wordnet, and geonames,” in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016.




DOI: http://dx.doi.org/10.17977/um018v6i22023p129-144

Refbacks



Copyright (c) 2023 Knowledge Engineering and Data Science

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Flag Counter

Creative Commons License


This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

View My Stats