Automatic Recognition of Arabic Poetry Meter from Speech Signal using Long Short-term Memory and Support Vector Machine

Abdulbasit K. Al-Talabani

Abstract


The recognition of the poetry meter in spoken lines is a natural language processing application that aims to identify a stressed and unstressed syllabic pattern in a line of a poem. Stateof-the-art studies include few works on the automatic recognition of Arud meters, all of which are text-based models, and none is voice based. Poetry meter recognition is not easy for an ordinary reader, it is very difficult for the listener and it is usually performed manually by experts. This paper proposes a model to detect the poetry meter from a single spoken line (“Bayt”) of an Arabic poem. Data of 230 samples collected from 10 poems of Arabic poetry, including three meters read by two speakers, are used in this work. The work adopts the extraction of linear prediction cepstrum coefficient and Mel frequency cepstral coefficient (MFCC) features, as a time series input to the proposed long short-term memory (LSTM) classifier, in addition to a global feature set that is computed using some statistics of the features across all of the frames to feed the support vector machine (SVM) classifier. The results show that the SVM model achieves the highest accuracy in the speakerdependent approach. It improves results by 3%, as compared to the state-of-the-art studies, whereas for the speaker-independent approach, the MFCC feature using LSTM exceeds the other proposed models.


Keywords


Speech processing; Long short-term memory; Support vector machine; Prosody; Cepstral features

Full Text:

PDF

References


Abuata, B. and Al-Omari, A., 2018. A rule-based algorithm for the detection of arud meter in CLASSICAL Arabic poetry. International Arab Journal of Information Technology, 15(4), pp. 1-5.

Al-Falahi, A. Ramdani, M. and Bellafkih, M., 2017, Machine learning for authorship attribution in Arabic poetry. International Journal of Future Computer and Communication, 6(2), p. 486.

Almuhareb, A. Alkharashi, I. Al-Saud, L. and Altuwaijri, H., 2013. Recognition of Classical Arabic Poems. In: 2nd Workshop on Computational Linguistics for Literature, pp. 9-16.

Alnagdawi, M., Rashideh, H. and Aburumman, F., 2013. Finding Arabic poem meter using context free grammar. Journal of Communication and Computer Engineering, 3(1), pp. 52-59.

Araújo, L. 2010, Computer-based Assessment (CBA) of Foreign language speaking skills. JRC Scientific and Technical Reports, 1, p. 165. Arberry, J., 1965. Arabic Poetry. A Primer for Students. Cambridge University Press, Cambridge.

Hirjee, H. and Brown, D., 2010, Using automated rhyme detection to characterize rhyming style in rap music. Empirical Musicology Review, 5(4), pp. 121-145.

Hochreiter, S. and Schmidhuber, J., 1997. Long short-term memory, Neural computation, 9(8), pp. 1735-1780.

Ismail, A., Eladawy, M., Keshk, H. and Saleh, S., 2010. Expert system for testing the harmony of Arabic poetry. Journal of Engineering Sciences, 1, pp. 401-411.

Kurta, A. and Kara, M., 2012. An algorithm for the detection and analysis of Arud meter in Diwan poetry. Turk Journal of Electrical Engineering and Computer Science, 20(6), pp. 948-963.

Lipton, C., Berkowitz, J. and Elkan, C., 2015. A Critical Review of Recurrent Neural Networks for Sequence Learning, arXiv Preprint arXiv: 1506.00019. Available from: https://www.arxiv.org/abs/1506.00019.

Morris, H., 1966. On the metrics of pre-islamic Arabic poetry. Quarterly Progress Report of the Research, Laboratory of Electronics, 83, pp. 113-116.

Rao, K. and Koolagudi, S., 2013, Robust Emotion Recognition using Spectral and Prosodic Features. Springer Science and Business Media. Berlin, Germany, pp. 23-24.

Reynolds, A., 1995. Speaker identification and verification using Gaussian mixture speaker models. Speech Communication, 17(1-2), pp. 91-108. Available from: https://www.sciencedirect.com/science/article/abs/pii/016763939500009D.

Rutledge, J.C., 1995. Fundamentals of speech recognition, by lawrence rabiner and bing-hwang juang. Analysis of Biomedical Engineering, 23, pp. 526-526.

Sarangi, S.K. and Saha, G., 2020, Improved speech-signal based frequency warping scale for cepstral feature in robust speaker verification system. Journal of Signal Processing Systems, 1, pp. 1-14.

Scott, H. 2010. Pegs, Cords, and Ghuls: Meter of Classical Arabic Poetry. Swarthmore College Department of Linguistics. Available from: https://www.scholarship.tricolib.brynmawr.edu/handle/10066/6864.

Sønderby, K., Sønderby, K., Nielsen, H. and Winther, O., 2015. Convolutional LSTM Networks for Subcellular Localization of Proteins. International Conference on Algorithms for Computational Biology, Springer, pp. 68-80.

Stoetzer, W., 1989. Theory and Practice in Arabic Metrics. Leiden, Het Oosters Institute. Vapnik, V., 1995, The Nature of Statistical Learning Theory. Springer-Verlag, NewYork. Available from: https://www.springer.com/gp/book/9780387987804.

Wells, J.R., Ting, K.M. and Naiwala, C.P., 2012, December. A Non-time Series Approach to Vehicle Related Time Series Problems. Vol. 134. In: Proceedings of the 10th Australasian Data Mining Conference, Australian Computer Society, Inc., pp. 61-70.

Yousef, W.A., Ibrahime, O.M., Madbouly, T.M. and Mahmoud, M.A., 2019. Learning meters of Arabic and English poems with Recurrent Neural Networks: a step forward for language understanding and synthesis, arXiv preprint arXiv:1905.05700. Available from: https://www.arxiv.org/ abs/1905.05700.

Zhang, L. and Gao, J., 2017, A comparative study to understanding about poetics based on natural language processing. Open Journal of Modern Linguistics, 7(5), pp. 229-237.




DOI: http://dx.doi.org/10.14500/aro.10631
View Counter: Abstract | 180 | and PDF | 129 |

Article Metrics

Metrics Loading ...

Metrics powered by PLOS ALM

Refbacks

  • There are currently no refbacks.


Copyright (c) 2020 Abdulbasit K. Al-Talabani

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

  
 


ARO Journal is an OAJ supported by Koya University, it has no article submission/processing charges (APCs).
© 2013-2020, Koya University is a public University accredited by the Ministry of Higher Education and Scientific Research, KRG - F.R. Iraq.