Emmanouil Benetos

Publication List

[You can also visit my Queen Mary Research Online Webpage for PDFs of (most of) my papers]

Books and Book Chapters

L. Liu and E. Benetos, “From audio to music notation”, in Handbook of Artificial Intelligence for Music, E. Miranda (ed.), pp. 693-714, Springer, 2021 (ISBN: 978-3-030-72115-2).
E. Benetos, D. Stowell, and M. D. Plumbley, “Approaches to complex sound scene analysis”, in Computational Analysis of Sound Scenes and Events, T. Virtanen, M. D. Plumbley, and D. P. W. Ellis (eds.), pp, 215-242, Springer, 2018 (ISBN: 978-3-319-63450-0).
X. Serra, M. Magas, E. Benetos, M. Chudy, S. Dixon, A. Flexer, E. Gómez, F. Gouyon, P. Herrera, S. Jorda, O. Paytuvi, G. Peeters, J. Schlüter, H. Vinet, G. Widmer, “Roadmap for Music Information ReSearch”, G. Peeters (ed.), Creative Commons BY-NC-ND 3.0 license, 2013 (ISBN: 978-2-9540351-1-6).
ROADMAP WIKI
E. Benetos, S. Siatras, C. Kotropoulos, N. Nikolaidis, and I. Pitas, “Movie analysis with emphasis to dialogue and action scene detection”, in Multimodal Processing and Interaction: Audio, Video, Text, P. Maragos, A. Potamianos and P. Gros (eds.), Springer-Verlag, 2008 (ISBN: 978-0-387-76315-6).

Journal Papers

D. Edwards, S. Dixon, E. Benetos, A. Maezawa, Y. Kusaka, “A data-driven analysis of robust automatic piano transcription”, IEEE Signal Processing Letters, vol. 31, pp. 681-685, 2024.
postprint
S. Singh, C. J. Steinmetz, E. Benetos, H. Phan, and D. Stowell, “ATGNN: audio tagging graph neural network”, IEEE Signal Processing Letters, vol. 31, pp. 825-829, 2024.
postprint
Y. Li, W. Cao, W. Xie, J. Li, and E. Benetos, “Few-shot class-incremental audio classification using dynamically expanded classifier with self-attention modified prototypes”, IEEE Transactions on Multimedia, vol. 26, pp. 1346-1360, 2024.
D. Edwards, S. Dixon, and E. Benetos, “PiJAMA: Piano Jazz with Automatic MIDI Annotations”, Transactions of the International Society for Music Information Retrieval, vol. 6, no. 1, pp. 89-102, 2023.
A. Ragano, E. Benetos, and A. Hines, “Automatic quality assessment of digitized and restored sound archives”, Journal of the Audio Engineering Society, vol. 70, no. 4, pp. 252-270, April 2022.
C. Wang, E. Benetos, V. Lostanlen, and E. Chew, “Adaptive scattering transforms for playing technique recognition”, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 30, pp. 1407-1421, March 2022.
postprint
E. Benetos, A. Ragano, D. Sgroi, and A. Tuckwell, “Measuring National Mood with Music: Using Machine Learning to Construct a Measure of National Valence from Audio Data”, Behavior Research Methods, Feb. 2022.
postprint
A. Terenzi, N. Ortolani, I. Nolasco, E. Benetos, and S. Cecchi, “Comparison of feature extraction methods for sound-based classification of honey bee activity”, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 30, pp. 112-122, 2022.
postprint
A. Holzapfel, E. Benetos, A. Killick, and R. Widdess, “Humanities and engineering perspectives on music transcription”, Digital Scholarship in the Humanities, Oct. 2021.
C. Lordelo, E. Benetos, S. Dixon, S. Ahlbäck, and P. Ohlsson, “Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation”, IEEE Signal Processing Letters, vol. 28, pp. 81-85, 2021.
Supplementary webpage postprint
B. Chettri, E. Benetos, and B. L. Sturm, “Dataset artefacts in anti-spoofing systems: a case study on the ASVspoof 2017 benchmark”, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp. 3018-3028, Nov. 2020.
postprint
B. Chettri, T. Kinnunen, and E. Benetos, “Deep generative variational autoencoding for replay spoof detection in automatic speaker verification”, Computer Speech and Language, vol. 63, article no. 101092, Sept. 2020.
Download: postprint
A. Ycart, L. Liu, E. Benetos, and M. T. Pearce, “Investigating the Perceptual Validity of Evaluation Metrics for Automatic Piano Music Transcription”, Transactions of the International Society for Music Information Retrieval, vol. 3, no. 1, pp. 68-81, June 2020.
A. Ycart and E. Benetos, “Learning and evaluation methodologies for polyphonic music sequence prediction with LSTMs”, IEEE/ACM Transactions on Audio, Speech and Language Processing, vol. 28, pp. 1328-1341, Apr. 2020.
Download: postprint
M. A. Martínez Ramírez, E. Benetos, and J. D. Reiss, “Deep learning for black-box modeling of audio effects”, Applied Sciences, vol. 10, no. 2, Jan. 2020.
Q. Zhou, Z. Feng, and E. Benetos, “Adaptive noise reduction for sound event detection using subband-weighted NMF”, Sensors, vol. 19, no. 14, July 2019.
E. Covas and E. Benetos, “Optimal Neural Network Feature Selection for Spatial-Temporal Forecasting”, Chaos, vol. 29, no. 6, June 2019.
Download: postprint
E. Benetos, S. Dixon, Z. Duan, and S. Ewert, “Automatic Music Transcription: An Overview”, IEEE Signal Processing Magazine, vol. 36, no. 1, pp. 20-30, Jan. 2019.
Download: postprint
J. J. Valero-Mas, E. Benetos, and J. M. Iñesta, “A supervised classification approach for note tracking in polyphonic piano transcription”, Journal of New Music Research, vol. 47, no. 3, pp. 249-263, June 2018.
Download: postprint
H. Ali and S. N. Tran, E. Benetos and A. S. d'Avila Garcez, “Speaker recognition with hybrid features from a deep belief network”, Neural Computing and Applications, vol. 29, no. 6, pp. 13-19, March 2018.
Download: postprint
M. Panteli, E. Benetos, and S. Dixon, “A review of manual and computational approaches for the study of world music corpora”, Journal of New Music Research, vol. 47, no. 2, pp. 176-189, March 2018.
Download: postprint
A. Mesaros, T. Heittola, E. Benetos, P. Foster, M. Lagrange, T. Virtanen, and M. D. Plumbley, “Detection and Classification of Acoustic Scenes and Events: Outcome of the DCASE 2016 Challenge”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 26, no. 2, pp. 379-393, Feb. 2018.
Challenge website
Download: postprint
M. Panteli, E. Benetos, and S. Dixon, “A computational study on outliers in world music”, PLoS ONE, vol. 12, no. 12, article no. e0189399, Dec. 2017.
Download: paper
A. McLeod, R. Schramm, M. Steedman, and E. Benetos, “Automatic Transcription of Polyphonic Vocal Music”, Applied Sciences, vol. 7, no. 12, article no. 1285, Dec. 2017.
Download: paper
E. Benetos, G. Lafay, M. Lagrange and M. D. Plumbley, “Polyphonic Sound Event Tracking using Linear Dynamical Systems”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 6, pp. 1266-1277, Jun. 2017.
postprint code
D. Stowell, E. Benetos, and L. F. Gill, “On-bird Sound Recordings: Automatic Acoustic Recognition of Activities and Contexts”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 25, no. 6, pp. 1193-1206, Jun. 2017.
postprint
S. Abdallah, E. Benetos, N. Gold, S. Hargreaves, T. Weyde, and D. Wolff, “The Digital Music Lab: A Big Data Infrastructure for Digital Musicology”, ACM Journal on Computing and Cultural Heritage, vol. 10, no. 1, pp. 2:1-2:21, April 2017.
postprint project website
G. Lafay, M. Lagrange, M. Rossignol, E. Benetos, and A. Roebel, “A morphological model for simulating acoustic scenes and its application to sound event detection”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 10, pp. 1854-1864, Oct. 2016.
Download: postprint
S. Sigtia, E. Benetos, and S. Dixon, “An End-to-End Neural Network for Polyphonic Piano Music Transcription”, IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol.24, no. 5, pp. 927-939, May 2016.
Download: postprint
E. Benetos and A. Holzapfel, “Automatic transcription of Turkish microtonal music”, Journal of the Acoustical Society of America, vol. 138, no. 4, pp. 2118-2130, Oct. 2015.
Download: paper
CODE
Copyright (2015) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The article appeared in the Journal of the Acoustical Society of America, 138(4):2118-2130 and may be found at http://link.aip.org/link/?JAS/138/2118
D. Stowell, D. Giannoulis, E. Benetos, M. Lagrange and M. D. Plumbley, “Detection and classification of acoustic scenes and events”, IEEE Transactions on Multimedia, vol. 17, no. 10, pp. 1733-1746, Oct. 2015.
Download: paper
Link: IEEE DCASE Challenge website (incl. datasets and code)
D. Tidhar, S. Dixon, E. Benetos, and T. Weyde, “The temperament police”, Early Music, vol. 42, no. 4, pp. 579-590, Nov. 2014.
Download: paper
E. Benetos, S. Dixon, D. Giannoulis, H. Kirchhoff, and A. Klapuri, “Automatic music transcription: challenges and future directions”, Journal of Intelligent Information Systems, vol. 41, no. 3, pp. 407-434, Dec. 2013.
Download: postprint
The final publication is available at http://link.springer.com/article/10.1007/s10844-013-0258-3.
E. Benetos and S. Dixon, “Multiple-instrument polyphonic music transcription using a temporally constrained shift-invariant model”, Journal of the Acoustical Society of America, vol. 133, no. 3, pp. 1727-1741, Mar. 2013.
Download: paper
Copyright (2013) Acoustical Society of America. This article may be downloaded for personal use only. Any other use requires prior permission of the author and the Acoustical Society of America. The article appeared in the Journal of the Acoustical Society of America, 133 (3):1727-1741 and may be found at http://link.aip.org/link/?JAS/133/1727
E. Benetos and S. Dixon, “A shift-invariant latent variable model for automatic music transcription”, Computer Music Journal, vol. 36, no. 4, pp. 81-94, Winter 2012.
Download: paper
Copyright (2013) Massachusetts Institute of Technology.
E. Benetos and S. Dixon, “Joint multi-pitch detection using harmonic envelope estimation for polyphonic music transcription”, IEEE Journal on Selected Topics in Signal Processing, vol. 5, no. 6, pp. 1111-1123, Oct. 2011.
Download: postprint
A. Anglade, E. Benetos, M. Mauch, and S. Dixon, “Improving music genre classification using automatically induced harmony rules”, Journal of New Music Research, vol. 39, no. 4, pp. 327-339, Dec. 2010.
Download: postprint
E. Benetos and Y. Stylianou, “Auditory spectrum-based pitched instrument onset detection”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 8, pp. 1968-1977, Nov. 2010.
Download: postprint
E. Benetos and C. Kotropoulos, “Non-negative tensor factorization applied to music genre classification”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, no. 8, pp. 1955-1967, Nov. 2010.
Download: postprint
M. Kotti, E. Benetos, and C. Kotropoulos, “Computationally efficient and robust BIC-based speaker segmentation”, IEEE Transactions on Audio, Speech, and Language Processing, vol. 16, no. 5, pp. 920-933, July 2008.
Download: postprint
M. Kotti, E. Benetos, C. Kotropoulos, and I. Pitas, “A neural network approach to audio-assisted movie dialogue detection”, Neurocomputing, vol. 71, pp. 157-166, Dec. 2007.
Download: postprint

Peer-reviewed Conference Papers

A. Xompero, M. Bontonou, J.-M. Arbona, E. Benetos, A. Cavallaro, "Explaining models relating objects and privacy", in 3rd Explainable AI for Computer Vision (XAI4CV) Workshop at CVPR 2024, accepted.
Z. Deng, Y. Ma, Y. Liu, R. Guo, G. Zhang, W. Chen, W. Huang, E. Benetos, "MusiLingo: bridging music and text with pre-trained language models for music captioning and query response", in 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), accepted.
preprint
Y. Li, R. Yuan, G. Zhang, Y. Ma, X. Chen, H. Yin, C. Xiao, C. Lin, A. Ragni, E. Benetos, N. Gyenge, R. Dannenberg, R. Liu, W. Chen, G. Xia, Y. Shi, W. Huang, Z. Wang, Y. Guo, J. Fu, "MERT: acoustic music understanding model with large-scale self-supervised training", in 12th International Conference on Learning Representations (ICLR), accepted.
preprint
J. Liang, H. Phan, E. Benetos, "Learning from taxonomy: multi-label few-shot classification for everyday sound recognition", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 771-775, April 2024.
postprint
E. Postolache, G. Mariani, L. Cosmo, E. Benetos, E. Rodolà, "Generalized multi-source inference for text conditioned music diffusion models", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6980-6984, April 2024.
postprint
D. Li, Y. Ma, W. Wei, Q. Kong, Y. Wu, M. Che, F. Xia, E. Benetos, W. Li, "MERTech: Instrument playing technique detection using self-supervised pretrained model with multi-task finetuning", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 521-525, April 2024.
postprint
R. Yuan, Y. Ma, Y. Li, G. Zhang, X. Chen, H. Yin, L. Zhuo, Y. Liu, J. Huang, Z. Tian, B. Deng, N. Wang, C. Lin, E. Benetos, A. Ragni, N. Gyenge, R. Dannenberg, W. Chen, G. Xia, W. Xue, S. Liu, S. Wang, R. Liu, Y. Guo, J. Fu, "MARBLE: Music Audio Representation Benchmark for Universal Evaluation", 37th Conf. Neural Information Processing Systems (NeurIPS), Dec. 2023.
O. Deb, E. Benetos, and P. Torr, "Remaining-useful-life prediction and uncertainty quantification using LSTM ensembles for aircraft engines", in NeurIPS Workshop on Advancing Neural Network Training (WANT): Computational Efficiency, Scalability, and Resource Optimization, Dec. 2023.
postprint
I. Manco, B. Weck, S. Doh, Y. Zhang, D. Bogdanov, Y. Wu, K. Chen, P. Tovstogan, E. Benetos, E. Quinton, G. Fazekas, J. Nam, and M. Won, "The Song Describer Dataset: a corpus of audio captions for music-and-language evaluation", in NeurIPS Machine Learning for Audio Workshop, Dec. 2023.
A. Ragano, E. Benetos and A. Hines, "Learning Music Representations with wav2vec 2.0", 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2023), Dec. 2023.
postprint
C. Papaioannou, E. Benetos, and A. Potamianos, "From West to East: Who can understand the music of the others better?", 24th International Society for Music Information Retrieval Conference (ISMIR), Nov. 2023.
postprint
Y. Ma, R. Yuan, Y. Li, G. Zhang, C. Lin, X. Chen, A. Ragni, H. Yin, E. Benetos, N. Gyene, R. Liu, G. Xia, R. Dannenberg, Y. Guo, J. Fu, "On the effectiveness of speech self-supervised learning for music", 24th International Society for Music Information Retrieval Conference (ISMIR), Nov. 2023.
postprint
L. Zhuo, R. Yuan, J. Pan, Y. Ma, Y. Li, G. Zhang, S. Liu, R. Dannenberg, J. Fu, C. Lin, E. Benetos, W. Chen, W. Xue, Y. Guo, "LyricWhiz: Robust Multilingual Lyrics Transcription by Whispering to ChatGPT", 24th International Society for Music Information Retrieval Conference (ISMIR), Nov. 2023.
postprint
C. Vahidi, S. Singh, G. Fazekas, E. Benetos, D. Stowell, H. Phan, M. Lagrange, "Perceptual musical similarity metric learning with graph neural networks", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct. 2023.
postprint
S. Sarkar, L. Thorpe, E. Benetos, M. Sandler, "Leveraging synthetic data for improving chamber ensemble separation", IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Oct. 2023. Best student paper award
postprint
J. Liang, X. Liu, H. Liu, H. Phan, E. Benetos, M. Plumbley, W. Wang, "Adapting language-audio models as few-Shot audio learners", 24th Annual Conference of the International Speech Communication Association (INTERSPEECH), Aug. 2023.
postprint
A. Ragano, E. Benetos, M. Chinen, H. B. Martinez, C. K. A. Reddy, J. Skoglund, A. Hines, "A Comparison Of Deep Learning MOS Predictors For Speech Synthesis Quality", Irish Signals & Systems Conference (ISSC), June 2023.
postprint
A. Ragano, E. Benetos, and A. Hines, "Audio quality assessment of vinyl music collections using self-supervised learning", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), June 2023.
postprint
I. Manco, E. Benetos, G. Fazekas, and E. Quinton, "Contrastive Audio-Language Learning for Music", in 23rd International Society for Music Information Retrieval Conference (ISMIR), Dec. 2022.
postprint
S. Sarkar, E. Benetos, and M. Sandler, "EnsembleSet: a new high quality dataset for chamber ensemble separation", in 23rd International Society for Music Information Retrieval Conference (ISMIR), Dec. 2022.
postprint
L. Liu, Q. Kong, V. Morfi, and E. Benetos, "Performance MIDI-to-score conversion by neural beat tracking", in 23rd International Society for Music Information Retrieval Conference (ISMIR), Dec. 2022. Best paper award
postprint
K. T. Mai, T. Davies, L. D. Griffin, and E. Benetos, "Explaining the decisions of anomalous sound detectors", in 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), Nov. 2022.
postprint
J. Liang, H. Phan, and E. Benetos, "Leveraging label hierarchies for few-shot everyday sound recognition", in 7th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), Nov 2022.
postprint
Y. Ozaki, J. Kuroyanagi, J. McBride, P. Proutskova, A. Tierney, P. Pfordresher, E. Benetos, F. Liu, P. E. Savage, "Similarities and differences in a cross-linguistic sample of song and speech recordings", in Joint Conference on Language Evolution (JCoLE), pp. 569-572, Sept. 2022.
postprint
S. Singh, H. Phan, and E. Benetos, "Hypernetworks for sound event detection: a proof-of-concept", in 30th European Signal Processing Conference (EUSIPCO 2022), pp. 429-433, Sept. 2022.
C. Wang, E. Benetos, S. Wang, and E. Versace, "Joint Scattering for Automatic Chick Call Recognition", in 30th European Signal Processing Conference (EUSIPCO 2022), pp. 195-199, Sept. 2022.
H. Daikoku, S. Ding, E. Benetos, A. L. C. Wood, T. Shimizono, U. S. Sanne, S. Fujii, P. E. Savage, "Agreement among human and automated estimates of similarity in a global music sample", in 10th International Workshop on Folk Music Analysis (FMA 2022), June 2022.
J. Huang, E. Benetos, S. Ewert, "Improving lyrics alignment through joint pitch detection", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 451-455, May 2022.
postprint
I. Manco, E. Benetos, E. Quinton, G. Fazekas, "Learning music audio representations via weak language supervision", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 456-460, May 2022.
postprint
L. Ou, Z. Guo, E. Benetos, J. Han, Y. Wang, "Exploring transformer's potential on automatic piano transcription", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 776-780, May 2022.
postprint
C. Lordelo, E. Benetos, S. Dixon, and S. Ahlbäck, "Pitch-informed instrument assignment using a deep convolutional network with multiple kernel shapes", in 22nd International Society for Music Information Retrieval Conference (ISMIR), pp. 389-395, Nov. 2021.
Y. Ozaki, J. McBride, E. Benetos, P. Pfordresher, J. Six, A. Tierney, P. Proutskova, E. Sakai, H. Kondo, H. Fukatsu, S. Fujii, and P. Savage, "Agreement among human and annotated transcriptions of global songs", in 22nd International Society for Music Information Retrieval Conference (ISMIR), pp. 500-508, Nov. 2021.
R. P. P. Bodo, E. Benetos and M. Queiroz, "A framework for music similarity and cover song identification", in 15th International Symposium on Computer Music Multidisciplinary Research (CMMR), pp. 205-214, Nov. 2021.
K. O'Hanlon, E. Benetos, and S. Dimon, "Detecting cover songs with pitch class key-invariant networks", in IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Oct. 2021.
postprint
S. Sarkar, E. Benetos, and M. Sandler, "Vocal harmony separation using time-domain neural networks", in 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 3515-3519, Aug. 2021.
H. L. Bear, V. Morfi, and E. Benetos, "An evaluation of data augmentation methods for sound scene geotagging", in 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 581-585, Aug. 2021.
Y. Zhao, C. Wang, G. Fazekas, E. Benetos, and M. Sandler, "Violinist identification based on vibrato features", in 29th European Signal Processing Conference (EUSIPCO), pp. 381-385, Aug. 2021.
postprint
I. Manco, E. Benetos, E. Quinton and G. Fazekas, "MusCaps: generating captions for music audio", in International Joint Conference on Neural Networks (IJCNN), July 2021.
postprint
K. W. Cheuk, Y.-J. Luo, E. Benetos and D. Herremans, "Revisiting the onsets and frames model with additive attention", in International Joint Conference on Neural Networks (IJCNN), July 2021.
postprint
A. Ragano, E. Benetos, and A. Hines, "More for Less: Non-Intrusive Speech Quality Assessment with Limited Annotations", in 13th International Conference on Quality of Multimedia Experience (QoMEX), pp. 103-108, June 2021.
postprint
L. Liu, V. Morfi, and E. Benetos, "Joint multi-pitch detection and score transcription for polyphonic piano music", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 281-285, June 2021.
postprint
S. Singh, H. L. Bear, and E. Benetos, "Prototypical networks for domain adaptation in acoustic scene classification", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 346-350, June 2021.
postprint
V. Subramanian, S. Gururani, E. Benetos, and M. D. Sandler, "Anomalous behaviour in loss-gradient based interpretability methods", in ICLR Robust and Reliable Machine Learning in the Real World Workshop (RobustML), May 2021.
postprint
K. W. Cheuk, Y.-J. Luo, E. Benetos, and D. Herremans, "The effect of spectrogram reconstructions on automatic music transcription: an alternative approach to improve transcription accuracy", in Proc. 25th International Conference on Pattern Recognition (ICPR2020), pp. 9091-9098, Jan. 2021.
postprint
B. Chettri, T. Kinnunen, and E. Benetos, "Subband modeling for spoofing detection in automatic speaker verification", in Proc. Odyssey 2020: The Speaker and Language Recognition Workshop, pp. 341-348, Nov. 2020.
postprint
A. Pankajakshan, H. L. Bear, V. Subramanian, and E. Benetos, "Memory controlled sequential self attention for sound recognition", in Proc. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH), Oct. 2020.
A. Ragano, E. Benetos, and A. Hines, "Development of a speech quality database under uncontrolled conditions", in Proc. 21st Annual Conference of the International Speech Communication Association (INTERSPEECH), Oct. 2020.
S. Mishra, E. Benetos, B. L. Sturm and S. Dixon, "Reliable local explanations for machine listening", International Joint Conference on Neural Networks (IJCNN), July 2020.
postprint
A. Ragano, E. Benetos, and A. Hines, "Audio Impairment Recognition Using a Correlation-Based Feature Representation", in Proc. 12th International Conference on Quality of Multimedia Experience (QoMEX), May 2020.
postprint presentation video
V. Subramanian, A. Pankajakshan, E. Benetos, N. Xu, S. McDonald, and M. Sandler, "A study on the transferability of adversarial attacks in sound event classification", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 301-305, May 2020.
postprint
C. Wang, V. Lostanlen, E. Benetos, and E. Chew, "Playing technique recognition by joint time–frequency scattering", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 881-885, May 2020.
postprint
M. A. Martínez Ramírez, E. Benetos, and J. D. Reiss, "Modeling plate and spring reverberation using a DSP-informed deep neural network", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 241-245, May 2020.
postprint
W. Wei, H. Zhu, E. Benetos, and Y. Wang, "A-CRNN: a domain adaptation model for sound event detection", IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 276-280, May 2020.
postprint
A. Holzapfel and E. Benetos, "Automatic music transcription and ethnomusicology: a user study", in Proc. 20th International Society for Music Information Retrieval Conference (ISMIR), pp. 678-684, Nov. 2019.
postprint
A. Ycart, D. Stoller, and E. Benetos, "A comparative study of neural models for polyphonic music sequence transduction", in Proc. 20th International Society for Music Information Retrieval Conference (ISMIR), pp. 470-477, Nov. 2019.
postprint
A. Ycart, A. McLeod, E. Benetos, and K. Yoshii, "Blending acoustic and language model predictions for automatic music transcription", in Proc. 20th International Society for Music Information Retrieval Conference (ISMIR), pp. 454-461, Nov. 2019.
postprint
C. Wang, E. Benetos, V. Lostanlen, and E. Chew, "Adaptive time-frequency scattering for periodic modulation recognition in music signals", in Proc. 20th International Society for Music Information Retrieval Conference (ISMIR), pp. 809-815, Nov. 2019.
postprint
A. Pankajakshan, H. L. Bear, and E. Benetos, "Onsets, activity, and events: a multi-task approach for polyphonic sound event modelling", in Proc. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 174-178, Oct. 2019.
postprint
V. Subramanian, E. Benetos, and M. Sandler, "Robustness of Adversarial Attacks in Sound Event Classification", in Proc. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 239-243, Oct. 2019.
postprint
S. Singh, A. Pankajakshan, and E. Benetos, "Audio tagging using a linear noise modelling layer", in Proc. 4th Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 234-238, Oct. 2019.
postprint
H. L. Bear, T. Heittola, A. Mesaros, E. Benetos, and T. Virtanen, "City classification from multiple real-world sound scenes", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 11-15, Oct. 2019.
postprint
A. Pankajakshan, H. L. Bear, and E. Benetos, "Polyphonic sound event and sound activity detection: a multi-task approach", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 318-322, Oct. 2019.
postprint
C. Lordelo, E. Benetos, S. Dixon, and S. Ahlbäck, "Investigating kernel shapes and skip connections for deep learning-based harmonic-percussive separation", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 40-44, Oct. 2019.
postprint
H. L. Bear, I. Nolasco, and E. Benetos, "Towards joint sound scene and polyphonic sound event recognition", in Proc. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 4594-4598, Sept. 2019.
postprint
B. Chettri, D. Stoller, V. Morfi, M. Martinez, E. Benetos and B. L. Sturm, "Ensemble models for spoofing detection in automatic speaker verification", in Proc. 20th Annual Conference of the International Speech Communication Association (INTERSPEECH), pp. 1018-1022, Sept. 2019.
postprint
M. A. Martínez Ramírez, E. Benetos, and J. D. Reiss, "A general-pupose deep learning approach to model time-varying audio effects", in Proc. 22nd International Conference on Digital Audio Effects (DAFx), Sept. 2019.
postprint
A. Ragano, E. Benetos, and A. Hines, "Adapting the Quality of Experience Framework for Audio Archive Evaluation", in Proc. 11th International Conference on Quality of Multimedia Experience (QoMEX), June 2019.
postprint
S. Mishra, D. Stoller, E. Benetos, B. Sturm and S. Dixon, "GAN-based generation and automatic selection of explanations for neural networks", in Proc. ICLR 2019 Workshop on Safe Machine Learning: Specification, Robustness and Assurance (SafeML), May 2019.
C. Wang, E. Benetos, X. Meng, and E. Chew, "HMM-based glissando detection for recordings of Chinese bamboo flute", in Proc. 16th Sound and Music Computing Conference (SMC), May 2019.
postprint
I. Nolasco, A. Terenzi, S. Cecchi, S. Orcioni, H. L. Bear, and E. Benetos, "Audio-based identification of beehive states", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 8256-8260, May 2019.
postprint
S. S. R. Phaye, E. Benetos, and Y. Wang, "SubSpectralNet - Using sub-spectrogram based convolutional neural networks for acoustic scene classification", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 825-829, May 2019.
postprint
F. Lins, M. Johann, E. Benetos, and R. Schramm, "Automatic transcription of diatonic harmonica recordings", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 256-260, May 2019.
postprint
B. Chettri, S. Mishra, B. Sturm, and E. Benetos, "Analysing the predictions of a CNN-based replay spoofing detection system", in Proc. 2018 IEEE Spoken Language Technology Workshop (SLT), pp. 92-97, Dec. 2018.
postprint
H. L. Bear and E. Benetos, "An extensible cluster-graph taxonomy for open set sound scene analysis", in Proc. Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 183-187, Nov. 2018.
webpage
I. Nolasco and E. Benetos, "To bee or not to bee: investigating machine learning approaches for beehive sound recognition", in Proc. Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE), pp. 133-137, Nov. 2018.
dataset
code
B. Chettri, B. Sturm, and E. Benetos, "Analysing replay spoofing countermeasure performance under varied conditions", in Proc. IEEE International Workshop on Machine Learning for Signal Processing (MLSP), Sep. 2018.
postprint
A. Ycart and E. Benetos, "Polyphonic music sequence transduction with meter-constrained LSTM networks", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 386-390, April 2018.
postprint
E. Nakamura, E. Benetos, K. Yoshii, and S. Dixon, "Towards Complete Polyphonic Music Transcription: Integrating Multi-Pitch Detection and Rhythm Quantization", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 101-105, April 2018.
A. Ycart and E. Benetos, "A study on LSTM networks for polyphonic music sequence modelling", 18th International Society for Music Information Retrieval Conference (ISMIR), pp. 421-427, Oct. 2017.
R. Schramm, A. McLeod, M. Steedman, and E. Benetos, "Multi-pitch detection and voice assignment for a capella recording of multiple singers", 18th International Society for Music Information Retrieval Conference (ISMIR), pp. 552-559, Oct. 2017.
code
G. Lafay, E. Benetos, and M. Lagrange, "Sound event detection in synthetic audio: analysis of the DCASE 2016 task results", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), pp. 11-15, Oct. 2017.
postprint
E. Benetos, "Polyphonic note and instrument tracking using linear dynamical systems", in Proc. 2017 AES International Conference on Semantic Audio, June 2017.
postprint
R. Schramm and E. Benetos, "Automatic transcription of a cappella recordings from multiple singers", in Proc. 2017 AES International Conference on Semantic Audio, June 2017.
Best paper award
postprint
J. J. Valero-Mas, E. Benetos, and J. M. Iñesta, "Assessing the relevance of onset information for note tracking in piano music transcription", in Proc. 2017 AES International Conference on Semantic Audio, June 2017.
A. J. Russell, E. Benetos and A. S. d'Avila Garcez, "On the Memory Properties of Recurrent Neural Models", in Proc. 2017 International Joint Conference on Neural Networks (IJCNN), pp. 2596-2603, May 2017.
J. J. Valero-Mas, E. Benetos, and J. M. Iñesta, "Classification-based Note Tracking for Automatic Music Transcription", in Proc. 9th International Workshop on Machine Learning and Music (MML), pp. 61-65, Sep. 2016.
postprint
S. Abdallah, E. Benetos, N. Gold, S. Hargreaves, T. Weyde, and D. Wolff, "Digital Music Lab: A Framework for Analysing Big Music Data", in Proc. 24th European Signal Processing Conference (EUSIPCO), pp. 1118-1122, Aug. 2016.
project webpage postprint
A. Holzapfel and E. Benetos, "The sousta corpus: beat-informed automatic transcription of traditional dance tunes", in 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 531-537, Aug. 2016.
supplementary webpage
T. Cheng, M. Mauch, E. Benetos and S. Dixon, "An attack/decay model for piano transcription", in 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 584-590, Aug. 2016.
M. Panteli, E. Benetos, and S. Dixon, "Learning a feature space for similarity in world music", in 17th International Society for Music Information Retrieval Conference (ISMIR), pp. 538-544, Aug. 2016.
Best student paper award
code
M. Panteli, E. Benetos, and S. Dixon, "Automatic detection of outliers in world music collections",in International Conference on Analytical Approaches to World Music (AAWM), June 2016.
postprint
E. Benetos, G. Lafay, M. Lagrange, and M. D. Plumbley, "Detection of overlapping acoustic events using a temporally-constrained probabilistic model", in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 6450-6454, Mar. 2016.
code postprint
E. Benetos and T. Weyde, "An efficient temporally-constrained probabilistic model for multiple-instrument music transcription", in 16th International Society for Music Information Retrieval Conference (ISMIR), pp. 701-707, Oct. 2015.
code
M. Rossignol, M. Lagrange, G. Lafay, and E. Benetos, "Alternate level clustering for drum transcription", in Proc. 23rd European Signal Processing Conference (EUSIPCO), pp. 2068-2072, Sep. 2015.
S. Abdallah, A. Alencar-Brayner, E. Benetos, S. Cottrell, J. Dykes, N. Gold, A. Kachkaev, M. Mahey, D. Tidhar, A. Tovell, T. Weyde, and D. Wolff, "Automatic transcription and pitch analysis of the British Library World & Traditional Music Collection", in Proc. 5th International Workshop on Folk Music Analysis (FMA), pp. 10-12, June 2015.
postprint
S. Sigtia, E. Benetos, N. Boulanger-Lewandowski, T. Weyde, A. d'Avila Garcez, and S. Dixon, "A hybrid recurrent neural network for music transcription", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 2061-2065, Apr. 2015.
postprint
E. Benetos, R. Badeau, T. Weyde, and G. Richard, "Template adaptation for improving automatic music transcription", in Proc. 15th International Society for Music Information Retrieval Conference (ISMIR), pp. 175-180, Oct. 2014.
VIDEO
S. Sigtia, E. Benetos, S. Cherla, T. Weyde, A. d'Avila Garcez, and S. Dixon, "An RNN-based music language model for improving automatic music transcription", in Proc. 15th International Society for Music Information Retrieval Conference (ISMIR), pp. 53-58, Oct. 2014.
D. Wolff, D. Tidhar, E. Benetos, E. Dumon, S. Cherla, and T. Weyde, "Incremental dataset definition for large scale musicological research", in Proc. 1st International Digital Libraries for Musicology workshop (DLfM), pp. 25-32, Sep. 2014.
postprint
T. Weyde, S. Cottrell, J. Dykes, E. Benetos, D. Wolff, D. Tidhar, A. Kachkaev, M. Plumbley, S. Dixon, M. Barthet, N. Gold, S. Abdallah, M. Mahey, A. Tovell and A. Alencar-Brayner, "Big Data for Musicology", in Proc. 1st International Digital Libraries for Musicology workshop (DLfM), pp. 85-87, Sep. 2014.
postprint
S. Tran, E. Benetos and A. Garcez, "Learning motion-difference features using Gaussian restricted Boltzmann machines for efficient human action recognition", in Proc. 2014 International Joint Conference on Neural Networks (IJCNN), pp. 2123-2129, July 2014.
postprint
E. Benetos and A. Holzapfel, "Incorporating pitch class profiles for improving automatic transcription of Turkish makam music", in Proc. 4th International Workshop on Folk Music Analysis (FMA), pp. 15-20, June 2014.
postprint
E. Benetos, S. Ewert, and T. Weyde, "Automatic transcription of pitched and unpitched sounds from polyphonic music", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 3131-3135, May 2014.
postprint code MIDI ANNOTATIONS
D. Giannoulis, E. Benetos, A. Klapuri, and M.D. Plumbley, "Improving instrument recognition in polyphonic music through system integration", in Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp. 5259-5263, May 2014.
postprint
E. Benetos, A. Jansson, and T. Weyde, "Improving automatic music transcription through key detection", in Proc. AES 53rd Int. Conf. on Semantic Audio, 7 pages, Jan. 2014.
postprint
E. Benetos and T. Weyde, "Explicit duration hidden Markov models for multiple-instrument polyphonic music transcription", in Proc. 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 269-274, Nov. 2013.
E. Benetos and A. Holzapfel, "Automatic transcription of Turkish makam music", 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 355-360, Nov. 2013.
SOUND EXAMPLES
R. De Valk, T. Weyde, and E. Benetos, "A machine learning approach to voice separation in lute tablature", 14th International Society for Music Information Retrieval Conference (ISMIR), pp. 555-560, Nov. 2013.
D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M, Lagrange, and M.D. Plumbley, "Detection and classification of acoustic scenes and events: an IEEE AASP challenge", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), 4 pages, Oct. 2013.
CHALLENGE WEBSITE
E. Benetos, S. Cherla, and T. Weyde, "An efficient shift-invariant model for polyphonic music transcription", 6th Int. Workshop on Machine Learning and Music, 4 pages, Sep. 2013.
CODE
D. Giannoulis, D. Stowell, E. Benetos, M. Rossignol, M, Lagrange, and M.D. Plumbley, "A database and challenge for acoustic scene classification and event detection", in Proc. 21st European Signal Processing Conf., 5 pages, Sep. 2013.
Paper won the SoundSoftware Reproducible Research Prize for a conference submission.
CHALLENGE WEBSITE (incl. datasets and code)
E. Benetos, S. Dixon, D. Giannoulis, H. Kirchhoff, and A. Klapuri, "Automatic music transcription: breaking the glass ceiling", in Proc. 13th Int. Society for Music Information Retrieval Conf., pp. 379-384, Oct. 2012.
E. Benetos, M. Lagrange, and S. Dixon, "Characterisation of acoustic scenes using a temporally-constrained shift-invariant model", in Proc. 15th Int. Conf. Digital Audio Effects, pp. 317-323, Sep. 2012.
E. Benetos, A. Klapuri, and S. Dixon, "Score-informed transcription for automatic piano tutoring", in Proc. 20th European Signal Processing Conf., pp. 2153-2157, Aug. 2012.
CODE DATASET
E. Benetos and S. Dixon, "Temporally-constrained convolutive probabilistic latent component analysis for multi-pitch detection", in Proc. Int. Conf. Latent Variable Analysis and Signal Separation, pp. 364-371, Mar. 2012.
WEBPAGE
S. Dixon, D. Tidhar, and E. Benetos, "The temperament police: The truth, the ground truth and nothing but the truth", in Proc. 12th Int. Society for Music Information Retrieval Conf., pp. 281-286, Oct. 2011.
WEBPAGE
E. Benetos and S. Dixon, "A temporally-constrained convolutive probabilistic model for pitch detection ", in Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, pp. 133-136, Oct. 2011.
WEBPAGE PRESENTATION VIDEO
E. Benetos and S. Dixon, "Multiple-instrument polyphonic music transcription using a convolutive probabilistic model", in Proc. 8th Sound and Music Computing Conf., pp. 19-24, Jul. 2011.
WEBPAGE
L. Mearns, E. Benetos, and S. Dixon, "Automatically detecting key modulations in J.S. Bach chorale recordings", in Proc. 8th Sound and Music Computing Conf., pp. 25-32, Jul. 2011.
WEBPAGE
E. Benetos and S. Dixon, "Polyphonic music transcription using note onset and offset detection", in Proc. 2011 Int. Conf. Acoustics, Speech, and Signal Processing, pp. 37-40, May 2011.
WEBPAGE PRESENTATION VIDEO
E. Benetos and S. Dixon, "Multiple-F0 estimation of piano sounds exploiting spectral structure and temporal evolution", in Proc. ISCA Tutorial and Research Workshop on Statistical and Perceptual Audition, pp. 13-18, Sep. 2010.
E. Benetos, A. Holzapfel, and Y. Stylianou, "Pitched instrument onset detection based on auditory spectra", in Proc. 2009 Int. Symposium Music Information Retrieval, pp. 105-110, Oct. 2009.
Y. Panagakis, E. Benetos, and C. Kotropoulos, "Music genre classification: a multilinear approach" in Proc. 2008 Int. Symposium Music Information Retrieval, pp. 583-588, Sep. 2008.
E. Benetos and C. Kotropoulos, "A tensor-based approach for automatic music genre classification" in Proc. 16th European Signal Processing Conf., Aug. 2008.
D. Spachos , A. Zlantintsi, V. Moschou, P. Antonopoulos, E. Benetos, M. Kotti, K. Tzimouli, C. Kotropoulos, N. Nikolaidis, P. Maragos, and I. Pitas, "MUSCLE movie-database: a multimodal corpus with rich annotation for dialogue and saliency detection" in Proc. 6th Language Resources and Evaluation Conference, pp. 16-19, May 2008.
V. Moschou, M. Kotti, E. Benetos, and C. Kotropoulos, “Systematic comparison of BIC-based speaker segmentation systems”, in Proc. Int. Workshop Multimedia Signal Processing, pp. 66-69, Oct. 2007.
M. Kotti, E. Benetos, and C. Kotropoulos, “Neural network-based movie dialogue detection”, in Proc. 10th Int. Conf. Engineering Applications of Neural Networks, Aug. 2007.
E. Benetos, M. Kotti, and C. Kotropoulos, “Large scale musical instrument identification”, in Proc. 4th Sound and Music Computing Conf., pp. 283-286, July 2007.
E. Benetos, C. Kotropoulos, T. Lidy, and A. Rauber, “Testing supervised classifiers based on non-negative matrix factorization to musical instrument classification”, in Proc.14th European Signal Processing Conf., Sep. 2006.
M. Kotti, L. G. Martins, E. Benetos, J. Cardoso, and C. Kotropoulos, “Automatic speaker segmentation using multiple features and distance measures: a comparison on three approaches”, in Proc. IEEE Int. Conf. Multimedia & Expo, pp. 1101-1104, July 2006.
E. Benetos, M. Kotti, and C. Kotropoulos, “Applying supervised classifiers on non-negative matrix factorization to musical instrument classification”, in Proc. IEEE Int. Conf. Multimedia & Expo, pp. 2105-2108, July 2006.
E. Benetos, M. Kotti, and C. Kotropoulos, “Musical instrument classification using non-negative matrix factorization and subset feature selection”, in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, vol. V, pp. 221-224, May 2006.
M. Kotti, E. Benetos, and C. Kotropoulos, “Automatic speaker change detection with the Bayesian information criterion using MPEG-7 and a fusion scheme”, in Proc. IEEE Int. Symposium on Circuits & Systems, pp. 1856-1859, May 2006.
E. Benetos, M. Kotti, and C. Kotropoulos, ‘’Musical instrument classification using non-negative matrix factorization algorithms’’, in Proc. IEEE Int. Symposium on Circuits & Systems, pp. 1844-1847, May 2006.
M. Kotti, E. Benetos, C. Kotropoulos, and L. G. Martins, “Speaker change detection using BIC: a comparison on two datasets”, in Proc. International Symposium on Communications, Control and Signal Processing, March 2006.
E. Benetos, M. Kotti, and C. Kotropoulos, “Application of non-negative matrix factorisation to musical instrument classification”, in Proc. 2nd Int. Symp. Communications, Control and Signal Processing, Marrakech, March 2006.
E. Benetos, M. Kotti, C. Kotropoulos, J.J. Burred, G. Eisenberg, M. Haller and T. Sikora, “Comparison of subspace analysis-based and statistical model-based algorithms for musical instrument classification”, in Proc. of 2nd Workshop On Immersive Communication And Broadcast Systems, Berlin, Oct. 2005.

Patents

K. O'Hanlon, E. Benetos, and S. Dixon, “Detecting Cover Songs with Key-Invariant Pitch Class Networks”, UK patent application 2113253.5, filing date 16 Sept. 2021.
M. A. Martinez Ramirez, E. Benetos, and J. D. Reiss, “Time-varying and nonlinear audio processing using deep neural networks”, United States Patent Application 20230197043, filing date 5 Dec. 2020.

Policy Submissions / White Papers

D. Stowell, E. Benetos, B. Sturm, and L. Tokarchuk, "Centre for Intelligent Sensing - written evidence (ALG0036)", Algorithms in decision-making inquiry, Science and Technology Committee (UK House of Commons), April 2017.
"New opportunities in signal processing", UK Engineering and Physical Sciences Research Council (EPSRC) report", Feb. 2017. (contributor)

Theses

E. Benetos, "Automatic transcription of polyphonic music exploiting temporal evolution," PhD Thesis, School of Electronic Engineering and Computer Science, Queen Mary University of London, UK, Dec. 2012.
VIDEO EURASIP REPOSITORY QMUL REPOSITORY
E. Benetos, "Musical genre classification using tensor analysis," MSc Thesis, Computer Science Dept., Aristotle Univ. Thessaloniki, Greece, June 2007 (in Greek with English abstract).

Tutorials

E. Benetos and R. Schramm, "Music information retrieval and automatic music transcription," in Workshop on Electronic Music and Computer Music, Porto Alegre, Brazil, Sep. 2017.
VIDEO
Z. Duan and E. Benetos, "Automatic music transcription," in 16th International Society for Music Information Retrieval Conference, Oct. 2015.

Abstracts / Demos / Posters

S. Elisha, E. Benetos, J. Karlgren, M. Beguerisse-Diaz, "Audio-Based Computational Analysis of Podcast Expressivity", UK Speech Conference, Sept. 2022.
Y. Ozaki, J. Kuroyanagi, J. McBride, P. Proutskova, A. Tierney, P. Pfordresher, E .Benetos, and P. Savage, "Cross-cultural similarities and differences in a global sample of song and speech recordings", 7th International Conference on Analytical Approaches to World Music (AAWM), June 2022.
S. Sarkar, E. Benetos, and M. Sandler, "Monotimbral Ensemble Separation using Permutation Invariant Training", in Proc. Music Demixing Workshop, Nov. 2021.
Y. Ozaki, J. McBride, E. Benetos, P. Pfordresher, J. Six, A. Tierney, P. Proutskova, E. Sakai, H. Kondo, H. Fukatsu, S. Fujii, and P. Savage, "Reliability of automated and human transcriptions of non-Western music", 16th International Conference on Music Perception and Cognition / The 11th Triennial Conference of ESCOM (ICMPC16-ESCOM11), July 2021.
H. Daikoku, S. Ding, E. Benetos, A. L. C. Wood, S. Fujii, and P. E. Savage, "Human and automated judgements of musical similarity in a global sample", 16th International Conference on Music Perception and Cognition / The 11th Triennial Conference of ESCOM (ICMPC16-ESCOM11), July 2021.
D. Sgroi, E. Benetos, A. Ragano, and A. Tuckwell, "Measuring National Happiness with Music", Monash-Warwick-Zurich Text-as-Data Workshop, Feb. 2021.
A. Ragano, E. Benetos, and A. Hines, "Context-aware audio QoE: a case study on the Apollo 11 audio archive", DMRN+14: Digital Music Research Network One-day Workshop, Dec. 2019.
L. Liu and E. Benetos, "Automatic music accompaniment with a chroma-based music data representation", DMRN+14: Digital Music Research Network One-day Workshop, Dec. 2019.
R. P. P. Bodo, E. Benetos and M. Queiroz, "The impact of dataset modifications on music similarity measures", DMRN+14: Digital Music Research Network One-day Workshop, Dec. 2019.
C. Cannam, E. Benetos, M. Mauch, M. E. P. Davies, S. Dixon, C. Landone, M. Levy, M. Mauch, K. Noland, and D. Stowell, "MIREX 2019: Vamp plugins from the Centre for Digital Music", Music Information Retrieval Evaluation eXchange (MIREX), Nov. 2019.
A. Ycart and E. Benetos, "Polyphonic music sequence classification with LSTM networks", Music Information Retrieval Evaluation eXchange (MIREX), Nov. 2019.
C. Wang, E. Benetos, and E. Chew, "CBF-periDB: A Chinese Bamboo Flute Dataset for Periodic Modulation Analysis ", 20th International Society for Music Information Retrieval Conference (ISMIR) Late Breaking Demo session, Nov. 2019.

M. Mcloughlin, S. Wang, D. Stowell, E. Benetos and E. Versace, "A System for Robot-Chick Vocal interactions", 2nd International Workshop on Vocal Interactivity in-and-between Humans, Animals and Robots, Aug. 2019.
A. Ragano, E. Benetos, and A. Hines, "Data-driven Quality Prediction for Digitally Restored Audio Archives", DMRN+13: Digital Music Research Network One-day Workshop, Dec. 2018.
H. L. Bear, D. Stoller, Y. Li, E. Demirel, W. Gui, E. Benetos, and S. Dixon, "A Multi-modal Approach for Learning from Singing", DMRN+13: Digital Music Research Network One-day Workshop, Dec. 2018.
C. Wang, E. Benetos and E. Chew, "Characterising Glissando and Flutter-tongue Techniques in Recordings of Chinese Bamboo Flute", DMRN+13: Digital Music Research Network One-day Workshop, Dec. 2018.
C. Wang, E. Benetos, X. Meng, and E. Chew, "HMM-based glissando detection for recordings of Chinese bamboo flute", 19th International Society for Music Information Retrieval Conference, Late-Breaking Demos Session, Sep. 2018.
A. Ycart and E. Benetos, "A-MAPS: Augmented MAPS Dataset with Rhythm and Key Annotations", 19th International Society for Music Information Retrieval Conference, Late-Breaking Demos Session, Sep. 2018.
C. Cannam, E. Benetos, M. E. P. Davies, S. Dixon, C. Landone, M. Levy, M. Mauch, K. Noland, and D. Stowell, "MIREX 2017: Vamp plugins from the Centre for Digital Music", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2017.
R. Schramm, Helena S. Nunes, and E. Benetos, "A score-informed approach for pitch visualisation of a cappella vocal quartet performances", In Proc. 16th Brazilian Symposium on Computer Music, September 2017.
M. Panteli, E. Benetos, and S. Dixon, "A review of computational approaches for the analysis of world music recordings", 20th Congress of the International Musicological Society, March 2017.
A. Ycart and E. Benetos, "Towards a Music Language Model for Audio Analysis", DMRN+11: Digital Music Research Network One-day Workshop, Dec. 2016.
R. Schramm and E. Benetos, "Automatic transcription of vocal quartets", DMRN+11: Digital Music Research Network One-day Workshop, Dec. 2016.
C. Cannam, E. Benetos, M. Mauch, M. E. P. Davies, S. Dixon, C. Landone, K. Noland, and D. Stowell, "MIREX 2016: Vamp plugins from the Centre for Digital Music", Music Information Retrieval Evaluation eXchange (MIREX), Aug. 2016.
L. Gill, W. Goymann, D. Stowell, E. Benetos, and M. Gahr, "Determining wild jackdaw call types and contexts via microphone backpacks", 16th congress of the International Society for Behavioral Ecology (ISBE), July 2016.
T. Weyde, S. Cottrell, J. Dykes, E. Benetos, D. Wolff, A. Kachkaev, S. Dixon, S. Hargreaves, M. Barthet, N. Gold, S. Abdallah, D. Tidhar, M. Plumbley, "The Digital Music Lab: a Big Data infrastructure for digital musicology", 16th International Society for Music Information Retrieval Conference, Demos and Late Breaking News Session, Oct. 2015.
E. Benetos and T. Weyde, "Multiple-F0 estimation and note tracking for MIREX 2015 using a sound state-based spectrogram factorization model", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2015.
C. Cannam, E. Benetos, M. Mauch, M. E. P. Davies, S. Dixon, C. Landone, K. Noland, and D. Stowell, "MIREX 2015: Vamp plugins from the Centre for Digital Music", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2015.
E. Benetos, "Machine Listening: Extracting Meaningful Information from Sound", RAEng Research Forum, Sep. 2015.
E. Benetos, "Matrix factorization methods for environmental sound analysis", Listening in the Wild 2015 workshop, Aug. 2015.
A. Leroi, M. Mauch, P. Savage, E. Benetos, J. P. Bello, M. Panteli, J. Six, and T. Weyde, "The deep history of music project", in Proc. 5th International Workshop on Folk Music Analysis, pp. 83-84, Jun. 2015.
E. Benetos and T. Weyde, "Multiple-F0 estimation and note tracking for MIREX 2014 using a variable-Q transform", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2014.
C. Cannam, E. Benetos, M. Mauch, M. E. P. Davies, S. Dixon, C. Landone, K. Noland, and D. Stowell, "MIREX 2014: Vamp plugins from the Centre for Digital Music", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2014.
T. Weyde, S. Cottrell, E. Benetos, D. Wolff, D. Tidhar, J. Dykes, M. Plumbley, S. Dixon, M. Barthet, N. Gold, S. Abdallah, and M. Mahey, "Digital Music Lab - A Framework for Analysing Big Music Data", European Conference on Data Analysis, Jul. 2014.
E. Benetos, "Automatic music transcirption [in Greek]", AΩ Magazine, vol. 63, Mar. 2014.
English Version
DML Consortium, "The DML Research Project: Digital Music Lab - Analysing Big Music Data", Big Data: Challenges and Applications Workshop, London, UK, Feb. 2014.
F. Wiering and E. Benetos, "Digital musicology and MIR: papers, projects, and challenges", ISMIR 2013 Late-breaking session, Nov. 2013.
E. Benetos and T. Weyde, "Multiple-F0 estimation and note tracking for MIREX 2013 using an efficient latent variable model", Music Information Retrieval Evaluation eXchange (MIREX), Nov. 2013.
RESULTS WEBPAGE
Submitted system reaches high scores and ranks first for the Multiple-F0 Estimation & Tracking task
D. Stowell, D. Giannoulis, E. Benetos, D. Barchiesi, and M. D. Plumbley, "Machine listening: automatic analysis of soundscapes at C4DM", in Symposium on Acoustic Ecology, Nov. 2013.
E. Benetos, "Acoustic identification of bird species using probabilistic latent component analysis", in Proc. ICML 2013 Workshop on Machine Learning for Bioacoustics, pp. 77-78, June 2013.
WORKSHOP WEBSITE CHALLENGE RESULTS
D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M, Lagrange, and M.D. Plumbley, "Detection and classification of acoustic scenes and events", Technical Report, EECSRR-13-01, Queen Mary University of London, Mar. 2013.
CHALLENGE WEBSITE
D. Giannoulis, E. Benetos, D. Stowell, M. Rossignol, M, Lagrange, and M.D. Plumbley, "Detection and classification of acoustic scenes and events - an IEEE AASP challenge", DMRN+7: Digital Music Research Network One-day Workshop, Dec. 2012.
CHALLENGE WEBSITE POSTER
MIReS Consortium (contributor), "MIReS roadmap: challenges for discussion", Late-breaking session, 13th Int. Society for Music Information Retrieval Conf., Oct. 2012.
PROJECT WEBSITE
E. Benetos and S. Dixon, "Multiple-F0 estimation and note tracking for MIREX 2012 using a shift-invariant latent variable model", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2012.
RESULTS WEBPAGE
E. Benetos and S. Dixon, "Automatic music transcription using probabilistic latent variable models", Semantic Media Project Meeting, Oct. 2012.
E. Benetos and S. Dixon, "Polyphonic music transcription by modelling the temporal evolution of sounds", DMRN+6: Digital Music Research Network One-day Workshop, Dec. 2011.
E. Benetos and S. Dixon, "Multiple-F0 estimation and note tracking using a convolutive probabilistic model", Music Information Retrieval Evaluation eXchange (MIREX), Oct. 2011.
RESULTS WEBPAGE
E. Benetos and S. Dixon, "Transcription prelude", in 12th Int. Society for Music Information Retrieval Conference Concert, Oct. 2011.
WEBPAGE VIDEO
Audio! Magazine #3 (contributor), P. Curzon, M. Barthet, and S. Dixon (eds.), Queen Mary University of London, May 2011.
E. Benetos and S. Dixon, "Multiple fundamental frequency estimation using spectral structure and temporal evolution rules", Music Information Retrieval Evaluation eXchange (MIREX), Aug. 2010.
RESULTS WEBPAGE
A. Rauber, N. Sebe, P. Joly, T. Lidy, J. Frank, C. Snoek, T. Foures, C. Kotropoulos, E. Benetos, B. Christmas, A. Yeredor, “Content analysis showcase and evaluation web portal (CASEWP),” in ACM Int. Conf. Image and Video Retrieval, July 2007.
PROJECT REPORT

Invited Talks

E. Benetos, "Machine learning paradigms for music and audio understanding," Telecom Paris ListenLab 4th semi-annual workshop, April 2024.
E. Benetos, "Learning methodologies for music and audio data," Dynamics, Data and Deep Learning Workshop, March 2024.
E. Benetos, "Learning methodologies for music and audio data," BAAI Music & Audio Processing Workshop invited talk, June 2023.
E. Benetos, "Learning methodologies for music and audio data," ByteDance SAMI invited talk, Oct. 2021.
E. Benetos, "Learning methodologies for music and audio data," invited talk, Huawei MTI Forum, March 2021.
E. Benetos, "Machine learning for machine listening," Invited talk at the Institute of Applied Data Science colloquium, London, UK, November 2019.
SLIDES
E. Benetos, "The Digital Music Lab project: summary and perspectives," Invited talk at Digital Musicology and Libraries: Challenges and Opportunities study day, London, UK, July 2019.
E. Benetos, "Music informatics & computational musicology: A case study in automatic music transcription," Keynote talk at First Annual TROMPA Workshop for Music Scholars, London, UK, April 2019.
E. Benetos, "Automatic music transcription," Tutorial at National University of Singapore, Singapore, January 2019.
VIDEO
E. Benetos, "Signal processing methods for sound recognition," Invited talk at National University of Singapore, Singapore, January 2019.
VIDEO
E. Benetos, "Automatic transcription of world music collections," Keynote talk at 8th International Workshop on Folk Music Analysis, Thessaloniki, Greece, June 2018.
E. Benetos, "Automatic Music Transcription: Representations and Categorical (mis)Conceptions," Fifth International Conference on Analytical Approaches to World Music, Thessaloniki, Greece, June 2018.
E. Benetos, "Automatic transcription of world music collections," International Symposium on Computational Ethnomusicological Archiving, Hamburg, Germany, Dec. 2017.
VIDEO
E. Benetos, "Music informatics & computational musicology: A case study in automatic music transcription," Keynote talk at Symposium for Digital Musicology, London, UK, Sep. 2017.
VIDEO
E. Benetos, "Multiple-timbre note tracking using linear dynamical systems," 5th Joint Meeting of the Acoustical Society of America and the Acoustical Society of Japan, Honolulu, Hawaii, USA, Nov. 2016. Abstract published at Journal of the Acoustical Society of America, vol. 140, no. 4, pt. 2, pp. 3039, Oct. 2016.
E. Benetos, "Matrix Decomposition Methods for Audio Analysis," Intelligent Sensing Summer School, London, UK, Sep. 2016.
VIDEO
E. Benetos, "Automatic music transcription using matrix decomposition methods," Music Informatics and Cognition Workshop, Edinburgh, UK, Jul. 2016.
VIDEO
E. Benetos, "Matrix Decomposition Methods for Audio Analysis," Audio Analytic Tech Talk, Cambridge, UK, Feb. 2016.
VIDEO
E. Benetos, "Automatic music transcription using spectrogram factorization methods," in Proc. 3rd Vienna Talk on Music Acoustics, pp. 297, Vienna, Austria, Sep. 2015.
E. Benetos, "Spectrogram factorization methods for music and audio analysis," Centre for Vision, Speech, and Signal Processing, University of Surrey, Mar. 2015.
E. Benetos and T. Weyde, "Instrument transcription and instrumentation recognition," Workshop on Musical Timbre, Télécom ParisTech, Nov. 2014.
D. Wolff and E. Benetos, "The DML Project: Objectives and Methodology," British Library, Jul. 2013.
E. Benetos, "Music informatics / music signal analysis research at City University London," Signal and Image Processing Dept., Télécom ParisTech, Paris, France, Nov. 2013.
E. Benetos, S. Cherla, and D. Wolff, "The Music Informatics Research Group (MIRG)," Music Tech Fest, London, UK, May 2013.
VIDEO
E. Benetos, "Non-negative matrix factorization: algorithms, extensions, and applications," Department of Computer Science, City University London, UK, Mar. 2013.
E. Benetos and S. Dixon, "Polyphonic music transcription using shift-invariant latent variable models," Signal and Image Processing Dept., Télécom ParisTech, Paris, France, Nov. 2011.
S. Dixon, D. Tidhar, M. Mauch, and E. Benetos, "Automatic estimation of harpsichord inharmonicity and temperament," Institute of Musical Research, University of London, UK, Oct. 2011.

Categories:

Publication List

Books and Book Chapters

Journal Papers

Peer-reviewed Conference Papers

Patents

Policy Submissions / White Papers

Theses

Tutorials

Abstracts / Demos / Posters

Invited Talks