Download PDFOpen PDF in browserTransfer Learning Using Musical/Non-Musical Mixtures for Multi-Instrument RecognitionEasyChair Preprint 107275 pages•Date: August 16, 2023AbstractDatasets for most music information retrieval tend to be relatively small. However, in deep learning, insufficient training data often leads to poor performance. Typically, this problem is approached by transfer learning (TL) and data augmentation. In this work, we compare various of these methods for the task of multi-instrument recognition. A convolutional neural network (CNN) is able to identify eight instrument families and seven specific instruments from polyphonic music recordings. Training is conducted in two phases: After pre-training with a music tagging dataset, the CNN is retrained using multi-track data. Experimenting with different TL methods suggests that training the final fully-connected layers from scratch while fine-tuning the convolutional backbone yields the best performance. Two different mixing strategies - musical and non-musical mixing -- are investigated. It turns out that a blend of both mixing strategies works best for multi-instrument recognition. Keyphrases: Convolutional Neural Network, Multi-Instrument Recognition, ransfer Learning
|