Conferences Publications

Spatial Upsampling of Sparse Head Related Transfer Functions – A VQ-VAE & Transformer based approach

Devansh Zurale, Shlomo Dubnov, Spatial Upsampling of Sparse Head Related Transfer Functions – A VQ-VAE & Transformer based Approach, Audio Engineering Society: AES 2023 International Conference on Spatial and Immersive Audio, U. of Huddersfield, U.K., 2023. Read full publication Abstract:  With the increasing demand for AR/VR technologies, enabling accurate reproduction of binaural spatial audio through […]

Publications

Characterizing and Interpreting Music Expressivity through Rhythm and Loudness Simplices

Paul Lascabettes, Elaine Chew & Isabelle Bloch. Characterizing and Interpreting Music Expressivity through Rhythm and Loudness Simplices, International Computer Music Conference (ICMC 2023), Shenzen, China 2023. Full publication Download publication Abstract: Characterizing and interpreting expressivity in performed music remains an open problem. In this paper, we explore the novel representation of recorded performances of triple […]

Publications

Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction

Read full publication. Published by Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction. Abstract: In deep learning research, many melody extraction models rely on redesigning neural network architectures to improve performance. In this paper, we propose an input feature modification and a training objective modification based on two assumptions. First, harmonics in […]

Publications

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Ke Chen, K., Wu, Y., Liu, H., Nezhurina, M., Berg-Kirkpatrick, T., and Dubnov, S., MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies, arXiv preprint,, 2023. doi:10.48550/arXiv.2308.01546. Full publication Download publication Abstract: Diffusion models have shown promising results in cross-modal generation tasks, including text-to-image and text-to-audio generation. However, generating music, as a special type […]

Publications

Multitrack music transformer

Read full publication. Published by Hao-Wen Dong, Ke Chen, Shlomo Dubnov, Julian McAuley, Taylor Berg-Kirkpatrick. Abstract: Existing approaches for generating multitrack music with transformer models have been limited in terms of the number of instruments, the length of the music segments and slow inference. This is partly due to the memory requirements of the lengthy […]

Publications

Observing Musical Communities Dedicated to Improvisation and Duet Practice on TikTok Using Web Scraping

Marc Chemillier (CAMS-EHESS), Yohann Rabearivelo (CAMS-EHESS), Rémi Jaylet (Télécom Paris), contact: chemilli@ehess.fr Read full publication Abstract: In this paper we present a research dealing with musical improvisation practices on TikTok as part of a project devoted to the development of a music improvisation software. Finding musicians to experiment with led us to explore the musical collaborations […]

Conferences Publications

Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

Yusong Wu, K. Chen, T. Zhang, Y. Hui, T. Berg-Kirkpatrick and S. Dubnov, « Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation, » ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095969. Full publication Abstract: Contrastive learning has shown remarkable success in the […]

Conferences Publications

Multitrack Music Transformer

HAo-Wen Dong, K. Chen, S. Dubnov, J. McAuley and T. Berg-Kirkpatrick, « Multitrack Music Transformer, » ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10094628. Full publication Abstract: Existing approaches for generating multitrack music with transformer models have been limited in terms of the […]

Publications

Switching Machine Improvisation Models by Latent Transfer Entropy Criteria

Shlomo Dubnov, Vignesh Gokul, Gérard Assayag. Switching Machine Improvisation Models by Latent Transfer Entropy Criteria. Physical Sciences Forum, 2023, 5 (1), pp.49. ff10.3390/psf2022005049ff.ffhal-04010744 Full publication Abstract: Music improvisation is the ability of musical generative systems to interact with either another music agent or a human improviser. This is a challenging task, as it is not trivial […]