Constructing Qur’an Recitation Classification using Alexnet Algorithm

Harits Ar Rosyid; Dzulkifli Abdullah; Mohammed S. Alqahtani

doi:10.17977/um018v7i22024p152-163

Constructing Qur’an Recitation Classification using Alexnet Algorithm

Harits Ar Rosyid, Dzulkifli Abdullah, Mohammed S. Alqahtani

Abstract

The growing demands for accurate and efficient methods in the Qur'an recitation classification highlight the limitations of existing models, particularly in assisting the memorization process. This study aims to address these challenges by implementing the AlexNet Convolutional Neural Network architecture, widely recognized for its effectiveness in image classification, to classify the Qur'an recitations using the Mel-Frequency Cepstral Coefficient (MFCC) as the feature extraction method. The research involves several stages, including data collection, preprocessing (audio segmentation by verse), data augmentation, feature extraction, and classification using the AlexNet architecture, followed by performance evaluation. Key results demonstrate that the combination of MFCC and AlexNet yields promising accuracy in classifying Surah Al-Ikhlas recitations, suggesting its potential application for automatic reading correction. This approach significantly improves over traditional methods, contributing to more effective tools for Qur'an memorization assistance. Future work could explore its application in other significant improvement contexts and address potential challenges related to varying audio quality.

Full Text:

PDF

References

N. M. S. A. Nik Abdullah, F. S. Mohd Sabbri, and R. A. Muhammad Isa, “Tahfiz Students’ Experiences in Memorizing the Qur’an: Unveiling Their Motivating Factors and Challenges,” IIUM J. Educ. Stud., vol. 9, no. 2, pp. 42–63, Jun. 2021.

A. B. Baried and M. Hannase, “Sufis And Women: The Study of Women’s Sufis In The Western World,” Refleksi, vol. 21, no. 1, Oct. 2022.

R. Sari, S. Sakban, and D. Deprizon, “The Effect Of Application Of The ODOA ( One Day One Verse) Method On The Ability To Memorize The Al-Qruan Of Class IV Students In Memorizing Surah Al- Bayyinah At Muhammdiyah 03 Unggunlan Pekanbaru Primary School,” Kalijaga J. Penelit. Multidisiplin Mhs., vol. 1, no. 4, pp. 127–134, Aug. 2024.

A. M. Diponegoro, I. H. Khotimah, and F. S. Setiawan, “Implementation of the Tikrar Method in BTQ (Guidance for Tahfidz Al Qur’an) Learning at Madrasah Ibtidaiyah,” MUDARRISA J. Kaji. Pendidik. Islam, vol. 16, no. 2, pp. 269–283, Dec. 2024.

A. Makrus and L. Usriyah, “Teacher Strategies in Enhancing Quranic Memorization and Psychological Implications for Quranic Memorizers: A Study at Mukhtar Syafa’at Banyuwangi’s Distinguished Junior High School,” IJIE Int. J. Islam. Educ., vol. 2, no. 1, pp. 13–28, Jun. 2023.

N. Naufalita and R. Sari, “Understanding Anxiety among Students Who Memorize the Qur’an,” J. Psikol. Integr., vol. 12, no. 1, pp. 66–82, Jun. 2024.

R. Hadiyansah and R. Andamira, “Convolutional Neural Network (CNN) for Detecting Al-Qur’an Reciting and Memorizing,” Khazanah J. Relig. Technol., vol. 1, no. 2, pp. 44–48, Dec. 2023.

H. M. Mahmudin and E. Pratiwi, “Innovation of Al-Quran Learning Platform with Deepspeech Artificial Intelligence Technology Using Design Sprint Method,” J. La Multiapp, vol. 6, no. 1, pp. 102–113, Jan. 2025.

G. Samara, E. Al-Daoud, N. Swerki, and D. Alzu’bi, “The Recognition of Holy Qur’an Reciters Using the MFCCs’ Technique and Deep Learning,” Adv. Multimed., vol. 2023, pp. 1–14, Mar. 2023.

Z. Li, B. Chen, S. Wu, M. Su, J. M. Chen, and B. Xu, “Deep learning for urban land use category classification: A review and experimental assessment,” Remote Sens. Environ., vol. 311, p. 114290, Sep. 2024.

L. Nanni, G. Maguolo, S. Brahnam, and M. Paci, “An Ensemble of Convolutional Neural Networks for Audio Classification,” Appl. Sci., vol. 11, no. 13, p. 5796, Jun. 2021.

M. M. Islam, S. Nooruddin, F. Karray, and G. Muhammad, “Human activity recognition using tools of convolutional neural networks: A state of the art review, data sets, challenges, and future prospects,” Comput. Biol. Med., vol. 149, p. 106060, Oct. 2022.

A. Ullah, H. Elahi, Z. Sun, A. Khatoon, and I. Ahmad, “Comparative Analysis of AlexNet, ResNet18 and SqueezeNet with Diverse Modification and Arduous Implementation,” Arab. J. Sci. Eng., vol. 47, no. 2, pp. 2397–2417, Feb. 2022.

A. Asif, H. Mukhtar, F. Alqadheeb, H. F. Ahmad, and A. Alhumam, “An Approach for Pronunciation Classification of Classical Arabic Phonemes Using Deep Learning,” Appl. Sci., vol. 12, no. 1, p. 238, Dec. 2021.

D. Jaganathan, S. Balsubramaniam, V. Sureshkumar, and S. Dhanasekaran, “Concatenated Modified LeNet Approach for Classifying Pneumonia Images,” J. Pers. Med., vol. 14, no. 3, p. 328, Mar. 2024.

S. Sharma and K. Guleria, “Deep Learning Models for Image Classification: Comparison and Applications,” in 2022 2nd International Conference on Advance Computing and Innovative Technologies in Engineering (ICACITE), Apr. 2022, pp. 1733–1738.

M. Razavi, S. Mavaddati, and H. Koohi, “ResNet deep models and transfer learning technique for classification and quality detection of rice cultivars,” Expert Syst. Appl., vol. 247, p. 123276, Aug. 2024.

A. A. Masaoodi, H. I. Shahadi, and H. H. Abbas, “Eye Movement Recognition: Exploring Trade-Offs in Deep Learning Approaches with Development,” 2024, pp. 238–251.

R. Sobti, K. Guleria, and V. Kadyan, “Comprehensive literature review on children automatic speech recognition system, acoustic linguistic mismatch approaches and challenges,” Multimed. Tools Appl., vol. 83, no. 35, pp. 81933–81995, Mar. 2024.

S. P. Jakkaladiki and F. Maly, “Integrating hybrid transfer learning with attention-enhanced deep learning models to improve breast cancer diagnosis,” PeerJ Comput. Sci., vol. 10, p. e1850, Feb. 2024.

H. Kheddar, Y. Himeur, S. Al-Maadeed, A. Amira, and F. Bensaali, “Deep transfer learning for automatic speech recognition: Towards better generalization,” Knowledge-Based Syst., vol. 277, p. 110851, Oct. 2023.

A. Abeysinghe, S. Tohmuang, J. L. Davy, and M. Fard, “Data augmentation on convolutional neural networks to classify mechanical noise,” Appl. Acoust., vol. 203, p. 109209, Feb. 2023.

W. N. Manamperi, T. D. Abhayapala, P. N. Samarasinghe, and J. (Aimee) Zhang, “Drone audition: Audio signal enhancement from drone embedded microphones using multichannel Wiener filtering and Gaussian-mixture based post-filtering,” Appl. Acoust., vol. 216, p. 109818, Jan. 2024.

S. Ali et al., “Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge,” Sci. Rep., vol. 14, no. 1, p. 2032, Jan. 2024.

P. Papadimitroulas et al., “Artificial intelligence: Deep learning in oncological radiomics and challenges of interpretability and data harmonization,” Phys. Medica, vol. 83, pp. 108–121, Mar. 2021.

C. Aliferis and G. Simon, “Overfitting, Underfitting and General Model Overconfidence and Under-Performance Pitfalls and Best Practices in Machine Learning and AI,” 2024, pp. 477–524.

M. M. Bejani and M. Ghatee, “A systematic review on overfitting control in shallow and deep neural networks,” Artif. Intell. Rev., vol. 54, no. 8, pp. 6391–6438, Dec. 2021.

T. Kattenborn, J. Leitloff, F. Schiefer, and S. Hinz, “Review on Convolutional Neural Networks (CNN) in vegetation remote sensing,” ISPRS J. Photogramm. Remote Sens., vol. 173, pp. 24–49, Mar. 2021.

S. B. Akbar, K. Thanupillai, and S. Sundararaj, “Combining the advantages of AlexNet convolutional deep neural network optimized with anopheles search algorithm based feature extraction and random forest classifier for COVID‐19 classification,” Concurr. Comput. Pract. Exp., vol. 34, no. 15, Jul. 2022.

H. Aldarmaki, A. Ullah, S. Ram, and N. Zaki, “Unsupervised Automatic Speech Recognition: A review,” Speech Commun., vol. 139, pp. 76–91, Apr. 2022.

T. Islam, M. S. Hafiz, J. R. Jim, M. M. Kabir, and M. F. Mridha, “A systematic review of deep learning data augmentation in medical imaging: Recent advances and future research directions,” Healthc. Anal., vol. 5, p. 100340, Jun. 2024.

A. Gracia Moisés, I. Vitoria Pascual, J. J. Imas González, and C. Ruiz Zamarreño, “Data Augmentation Techniques for Machine Learning Applied to Optical Spectroscopy Datasets in Agrifood Applications: A Comprehensive Review,” Sensors, vol. 23, no. 20, p. 8562, Oct. 2023.

M. Segu, A. Tonioni, and F. Tombari, “Batch normalization embeddings for deep domain generalization,” Pattern Recognit., vol. 135, p. 109115, Mar. 2023.

Z. Liu, Z. Huang, L. Wang, and P. Zhang, “A Pronunciation Prior Assisted Vowel Reduction Detection Framework with Multi-Stream Attention Method,” Appl. Sci., vol. 11, no. 18, p. 8321, Sep. 2021.

L. Syafa’ah, R. Prasetyono, and H. Hariyady, “Enhancing Qur’anic Recitation Experience with CNN and MFCC Features for Emotion Identification,” Kinet. Game Technol. Inf. Syst. Comput. Network, Comput. Electron. Control, May 2024.

DOI: http://dx.doi.org/10.17977/um018v7i22024p152-163