Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine
The recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based models, and none is voice based. Poetry meter recognition is not easy for an ordinary reader, it is very difficult for the listener and it is usually performed manually by experts. This paper proposes a model to detect the poetry meter from a single spoken line (“Bayt”) of an Arabic poem. Data of 230 samples collected from 10 poems of Arabic poetry, including three meters read by two speakers, are used in this work. The work adopts the extraction of linear prediction cepstrum coefficient and Mel frequency cepstral coefficient (MFCC) features, as a time series input to the proposed long short-term memory (LSTM) classifier, in addition to a global feature set that is computed using some statistics of the features across all of the frames to feed the support vector machine (SVM) classifier. The results show that the SVM model achieves the highest accuracy in the speakerdependent approach. It improves results by 3%, as compared to the state-of-the-art studies, whereas for the speaker-independent approach, the MFCC feature using LSTM exceeds the other proposed models.
Abuata, B. and Al-Omari, A., 2018. A rule-based algorithm for the detection of arud meter in CLASSICAL Arabic poetry. International Arab Journal of Information Technology, 15(4), pp. 1-5.
Al-Falahi, A. Ramdani, M. and Bellafkih, M., 2017, Machine learning for authorship attribution in Arabic poetry. International Journal of Future Computer and Communication, 6(2), p. 486.
Almuhareb, A. Alkharashi, I. Al-Saud, L. and Altuwaijri, H., 2013. Recognition of Classical Arabic Poems. In: 2nd Workshop on Computational Linguistics for Literature, pp. 9-16.
Alnagdawi, M., Rashideh, H. and Aburumman, F., 2013. Finding Arabic poem meter using context free grammar. Journal of Communication and Computer Engineering, 3(1), pp. 52-59.
Araújo, L. 2010, Computer-based Assessment (CBA) of Foreign language speaking skills. JRC Scientific and Technical Reports, 1, p. 165. Arberry, J., 1965. Arabic Poetry. A Primer for Students. Cambridge University Press, Cambridge.
Hirjee, H. and Brown, D., 2010, Using automated rhyme detection to characterize rhyming style in rap music. Empirical Musicology Review, 5(4), pp. 121-145.
Hochreiter, S. and Schmidhuber, J., 1997. Long short-term memory, Neural computation, 9(8), pp. 1735-1780.
Ismail, A., Eladawy, M., Keshk, H. and Saleh, S., 2010. Expert system for testing the harmony of Arabic poetry. Journal of Engineering Sciences, 1, pp. 401-411.
Kurta, A. and Kara, M., 2012. An algorithm for the detection and analysis of Arud meter in Diwan poetry. Turk Journal of Electrical Engineering and Computer Science, 20(6), pp. 948-963.
Lipton, C., Berkowitz, J. and Elkan, C., 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning, arXiv Preprint arXiv: 1506.00019. Available from: https://www.arxiv.org/abs/1506.00019.
Morris, H., 1966. On the metrics of pre-islamic Arabic poetry. Quarterly Progress Report of the Research, Laboratory of Electronics, 83, pp. 113-116.
Rao, K. and Koolagudi, S., 2013, Robust Emotion Recognition using Spectral and Prosodic Features. Springer Science and Business Media. Berlin, Germany, pp. 23-24.
Reynolds, A., 1995. Speaker identification and verification using Gaussian mixture speaker models. Speech Communication, 17(1-2), pp. 91-108. Available from: https://www.sciencedirect.com/science/article/abs/pii/016763939500009D.
Rutledge, J.C., 1995. Fundamentals of speech recognition, by lawrence rabiner and bing-hwang juang. Analysis of Biomedical Engineering, 23, pp. 526-526.
Sarangi, S.K. and Saha, G., 2020, Improved speech-signal based frequency warping scale for cepstral feature in robust speaker verification system. Journal of Signal Processing Systems, 1, pp. 1-14.
Scott, H. 2010. Pegs, Cords, and Ghuls: Meter of Classical Arabic Poetry. Swarthmore College Department of Linguistics. Available from: https://www.scholarship.tricolib.brynmawr.edu/handle/10066/6864.
Sønderby, K., Sønderby, K., Nielsen, H. and Winther, O., 2015. Convolutional LSTM Networks for Subcellular Localization of Proteins. International Conference on Algorithms for Computational Biology, Springer, pp. 68-80.
Stoetzer, W., 1989. Theory and Practice in Arabic Metrics. Leiden, Het Oosters Institute. Vapnik, V., 1995, The Nature of Statistical Learning Theory. Springer-Verlag, NewYork. Available from: https://www.springer.com/gp/book/9780387987804.
Wells, J.R., Ting, K.M. and Naiwala, C.P., 2012, December. A Non-time Series Approach to Vehicle Related Time Series Problems. Vol. 134. In: Proceedings of the 10th Australasian Data Mining Conference, Australian Computer Society, Inc., pp. 61-70.
Yousef, W.A., Ibrahime, O.M., Madbouly, T.M. and Mahmoud, M.A., 2019. Learning meters of Arabic and English poems with Recurrent Neural Networks: a step forward for language understanding and synthesis, arXiv preprint arXiv:1905.05700. Available from: https://www.arxiv.org/ abs/1905.05700.
Zhang, L. and Gao, J., 2017, A comparative study to understanding about poetics based on natural language processing. Open Journal of Modern Linguistics, 7(5), pp. 229-237.
Copyright (c) 2020 Abdulbasit K. Al-Talabani
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License [CC BY-NC-SA 4.0] that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).