Music Genre Classification Using Audio Data

Sowmya Natarajan, Shivansh Saxena, Ishan Bhardwaj, Hussain Hakeem

Dept of Electronics & Communication Engineering S.R.M. Institute of Science & Technology Chennai, India.

ABSTRACT: Music genre classification involves automatically categorizing music tracks into specific genres based on audio features. Key techniques include fea-ture extraction, focusing on attributes like rhythm, timbre, and pitch. Algorithms such as K-Nearest Neighbors (K-NN), Support Vector Machines (SVM), and Random Forests are commonly used to perform the classification task. K-NN relies on similarity measures between tracks, SVM separates genres by finding optimal hyperplanes, and Random Forests combine multiple decision trees for robust classification. These methods enhance music recommendation and organ-ization systems by enabling efficient genre identification. We achieved highest accuracy of 87% with random forest, 71% with svm and 27% with KNN .

KEYWORDS: Music genre classification, Support Vector Machines (SVM), Ran-dom Forests

REFERENCES:

Xu, C., Maddage, N. C., Shao, X., Cao, F., & Tian, Q.: Musical genre classification using support vector machines. In: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP’03). 5, V-429) (2003)
Choi, Keunwoo, György Fazekas, Mark Sandler, and Kyunghyun Cho: Convolutional recurrent neural networks for music classification. In: IEEE International conference on acoustics, speech and signal processing (ICASSP), 2392-2396. ( 2017)
Long, Mingtao, Luyao Hu, and Fubao Jin: Analysis of main characteristics of music genre based on PCA algorithm. In: 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE), 101-105. IEEE, (2021).
Tzanetakis, George, and Perry Cook: Musical genre classification of audio signals. IEEE Transactions on speech and audio processing 10(5), 293-302, (2002)
Hu, Xiao, and J. Stephen Downie: Improving mood classification in music digital libraries by combining lyrics and audio. In: Proceedings of the 10th annual joint confer-ence on Digital libraries, 159-168. (2010)
Costa, Y. M. G., Oliveira, L. S., Silla Jr, C. N., & Koerich, A. L.: Music genre classi-fication using LBP textural features. Signal Processing, 92(11), 2723–2737 (2012)
Elbir, Ahmet, and Nizamettin Aydin: Music genre classification and music recommen-dation by using deep learning. Electronics Letters 56(12) 627-629. (2020)
Dieleman, Sander, and Benjamin Schrauwen. End-to-end learning for music audio. IEEE international conference on acoustics, speech and signal processing (ICASSP)(2014)
Li, Tao, Mitsunori Ogihara, and Qi Li: A comparative study on content-based music genre classification. In: Proceedings of the 26th annual international ACM SIGIR con-ference on Research and development in information retrieval, 282-289. (2003)
Oramas, S., Nieto, O., Sordo, M., & Serra, X.: A deep multimodal approach for cold-start music recommendation. In: Proceedings of the 2nd workshop on deep learning for recommender systems, 32-37 (2017).
Ellis, D. P. (2007). Classifying music audio with timbral and chroma features. (2007)
McFee, Brian, Colin Raffel, Dawen Liang, Daniel PW Ellis, Matt McVicar, Eric Battenberg, and Oriol Nieto: librosa: Audio and music signal analysis in python.” SciPy 2015(7), 18-24 (2015)
Logan, Beth: Mel frequency cepstral coefficients for music modeling. In: Ismir, vol. 270(1), 11. (2000)
FUJISHIMA, T.: Realtime Chord Recognition of Musical Sound: a System Using Common Lisp Music. CCRMA. (1999).
Han, Yoonchang, Jaehun Kim, and Kyogu Lee: Deep convolutional neural networks for predominant instrument recognition in polyphonic music. IEEE/ACM Transactions on Audio, Speech, and Language Processing 25(1), 208-221 (2016).
Pedregosa, Fabian, Gaël Varoquaux, Alexandre Gramfort, Vincent Michel, Bertrand Thirion, Olivier Grisel, Mathieu Blondel et al.: Scikit-learn: Machine learning in Py-thon. The Journal of machine Learning research 12, 2825-2830 (2011)
Breiman, Leo: Random forests. Machine learning 45(1), 5-32 (2001)
Liaw, Andy, and Matthew Wiener: Classification and regression by randomForest. R news 2(3), 18-22 (2002)
Biau, Gérard: Analysis of a random forests model. The Journal of Machine Learning Research 13, 1063-1095. (2012).
Cortes, Corinna, and Vladimir Vapnik: Support-vector networks. Machine learning 20(3) 273-297 (1995).
Schölkopf, Bernhard, and Alexander J. Smola: Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT press, (2002)
Cover, Thomas, and Peter Hart: Nearest neighbor pattern classification. IEEE transac-tions on information theory 13(1), 21-27 (1967)
Beyer, Kevin, Jonathan Goldstein, Raghu Ramakrishnan, and Uri Shaft: When is “nearest neighbor” meaningful?. In: International conference on database theory, Ber-lin, Heidelberg: Springer Berlin Heidelberg, 217-235 (1999)
Kohavi, Ron.: A study of cross-validation and bootstrap for accuracy estimation and model selection. Ijcai, vol. 14(2), 1137-1145 (1995)
Caruana, Rich, and Alexandru Niculescu-Mizil: “An empirical comparison of supervised learning algorithms, in: Proceedings of the 23rd international conference on Ma-chine learning. (2006)
Sokolova, Marina, and Guy Lapalme: “A systematic analysis of performance measures for classification tasks.” Information processing & management 45(4), 427-437. (2009)
Goodfellow, I., Bengio, Y., & Courville, A.: Deep Learning. MIT Press. (2016)

IZVOR: Proceedings of the 16th International Conference on Business Information Security BISEC’2025

Menu

Music Genre Classification Using Audio Data

Music Genre Classification Using Audio Data

Sowmya Natarajan, Shivansh Saxena, Ishan Bhardwaj, Hussain Hakeem

Dept of Electronics & Communication Engineering S.R.M. Institute of Science & Technology Chennai, India.

sowmyan1@srmist.edu.in

ss0885@srmist.edu.in

ib8884@srmist.edu.in

hh1834@srmist.edu.in

DOI: 10.46793/BISEC25.466N