Keren Shao, Ke Chen, Taylor Berg-Kirkpatrick, & Shlomo Dubnov. (2023). Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction. Proceedings of the 24th International Society for Music Information Retrieval Conference, 657‑663. Full publication Download publication Abstract: In deep learning research, many melody extraction models rely on redesigning neural network architectures to improve performance. […]
Publications
Toward Real-Time Recognition of Instrumental Playing Techniques for Mixed Music: A Preliminary Analysis
Nicolas Brochec, Tsubasa Tanaka. Toward Real-Time Recognition of Instrumental Playing Techniquesfor Mixed Music: A Preliminary Analysis. International Computer Music Conference (ICMC 2023),Oct 2023, Shenzhen, China. ffhal-04263718f Full publication Download PDF Abstract: In contemporary mixed music, real-time digital sound processing is often applied to live instrumental performances. However, switching between sound effects often relies on manual […]
L’hybridité vue à partir du sujet. Le cas de Charles Kely Zana-Rotsy et l’open gasy
Marc Chemillier et Yuri Prado, L’hybridité vue à partir du sujet : le cas de Charles Kely Zana-Rotsy et l’ »open gasy », Cahiers d’ethnomusicologie, n° 36, 2023, pp. 125-143 (preprint). Full publication: Full text document will be published online on October 2024. Abstract: Bien que les musiques du monde, en tant qu’elles représentent particulièrement bien les processus d’échanges […]
Learning Sub-Dimensional HRTF Representations Towards Individualization Applications – Traditional and Deep Learning Approaches
Devansh Zurale, Shlomo Dubnov, Learning Sub-Dimensional HRTF Representations Towards Individualization Applications-Traditional and Deep Learning Approaches, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, USA, 2023 Full publication Abstract: Individualized Head Related Transfer Functions (HRTFs) are indispensable in order to accurately reproduce spatial audio over headphones. […]
Accessible Co-Creativity with AI through Language and Voice Input
Basica, C., Verma, P., Alessandrini, P., & Berceanu, A. (2023). Accessible Co-Creativity with AI through Language and Voice Input. The International Conference on AI and Musical Creativity @ The University of Sussex (AIMC), 2023. Full publication Abstract: This project introduces a set of tools for humans to co-create music with AI-based architectures. The aim is […]
Spatial Upsampling of Sparse Head Related Transfer Functions – A VQ-VAE & Transformer based approach
Devansh Zurale, Shlomo Dubnov, Spatial Upsampling of Sparse Head Related Transfer Functions – A VQ-VAE & Transformer based Approach, Audio Engineering Society: AES 2023 International Conference on Spatial and Immersive Audio, U. of Huddersfield, U.K., 2023. Read full publication Abstract: With the increasing demand for AR/VR technologies, enabling accurate reproduction of binaural spatial audio through […]
Characterizing and Interpreting Music Expressivity through Rhythm and Loudness Simplices
Paul Lascabettes, Elaine Chew & Isabelle Bloch. Characterizing and Interpreting Music Expressivity through Rhythm and Loudness Simplices, International Computer Music Conference (ICMC 2023), Shenzen, China 2023. Full publication Download publication Abstract: Characterizing and interpreting expressivity in performed music remains an open problem. In this paper, we explore the novel representation of recorded performances of triple […]
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction
Read full publication. Published by Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction. Abstract: In deep learning research, many melody extraction models rely on redesigning neural network architectures to improve performance. In this paper, we propose an input feature modification and a training objective modification based on two assumptions. First, harmonics in […]
MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies
Ke Chen, K., Wu, Y., Liu, H., Nezhurina, M., Berg-Kirkpatrick, T., and Dubnov, S., MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies, arXiv preprint,, 2023. doi:10.48550/arXiv.2308.01546. Full publication Download publication Abstract: Diffusion models have shown promising results in cross-modal generation tasks, including text-to-image and text-to-audio generation. However, generating music, as a special type […]
Improvised Musical Interaction with Creative Agents
Marco Fiorini, Improvised Musical Interaction with Creative Agents, Aalborg University Copenhagen, Juin 2023 Full publication Download publication Abstract: This thesis was written as the final project of the Master of Science in Sound and Music Computing at Aalborg University Copenhagen. The presented work has been carried out in the Music Representations team at IRCAM – […]