Publications

MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies

Ke Chen, K., Wu, Y., Liu, H., Nezhurina, M., Berg-Kirkpatrick, T., and Dubnov, S., MusicLDM: Enhancing Novelty in Text-to-Music Generation Using Beat-Synchronous Mixup Strategies, arXiv preprint,, 2023. doi:10.48550/arXiv.2308.01546. Full publication Download publication Abstract: Diffusion models have shown promising results in cross-modal generation tasks, including text-to-image and text-to-audio generation. However, generating music, as a special type […]

Software

Somax2 – A Distributed Co-Creative System for Human-Machine Co-Improvisation

Marco Fiorini, Mikhail Malt. Somax2 – A Distributed Co-Creative System for Human-Machine Co-Improvisation. Proceedings of the Second International Conference on Hybrid Human-Artificial Intelligence HHAI 2023, Jun 2023, Munich, Germany. ⟨hal-04444997⟩ Full publication Download PDF Abstract: Somax2 is a multi-agent interactive system, based on machine-listening, machine learning and generative units, performing live machine co-improvisation with musicians. […]

Conferences Workshops

Performance, contrôle et instrumentalité dans REACH

avec Mikhail Malt, chercheur à l’Ircam-STMS et Marco Fiorini, doctorant à l’Ircam-STMS. Après une brève présentation du fonctionnement de l’environnement Somax2, nous vous proposerons une réflexion en action des sujets de la performance, du contrôle et de l’instrumentalisé dans le cadre de l’utilisation d’algorithmes d’I.A. en improvisation musicale. Nous questionnerons notamment le cas non idiomatique en musique […]

Publications

Observing Musical Communities Dedicated to Improvisation and Duet Practice on TikTok Using Web Scraping

Marc Chemillier (CAMS-EHESS), Yohann Rabearivelo (CAMS-EHESS), Rémi Jaylet (Télécom Paris), contact: chemilli@ehess.fr Read full publication Abstract: In this paper we present a research dealing with musical improvisation practices on TikTok as part of a project devoted to the development of a music improvisation software. Finding musicians to experiment with led us to explore the musical collaborations […]

Conferences Publications

Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation

Yusong Wu, K. Chen, T. Zhang, Y. Hui, T. Berg-Kirkpatrick and S. Dubnov, « Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation, » ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095969. Full publication Abstract: Contrastive learning has shown remarkable success in the […]

Conferences Publications

Multitrack Music Transformer

HAo-Wen Dong, K. Chen, S. Dubnov, J. McAuley and T. Berg-Kirkpatrick, « Multitrack Music Transformer, » ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10094628. Full publication Abstract: Existing approaches for generating multitrack music with transformer models have been limited in terms of the […]