Yusong Wu, K. Chen, T. Zhang, Y. Hui, T. Berg-Kirkpatrick and S. Dubnov, « Large-Scale Contrastive Language-Audio Pretraining with Feature Fusion and Keyword-to-Caption Augmentation, » ICASSP 2023 – 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095969. Full publication Abstract: Contrastive learning has shown remarkable success in the […]
Devansh Zurale, Shlomo Dubnov, Spatial Upsampling of Sparse Head Related Transfer Functions – A VQ-VAE & Transformer based Approach, Audio Engineering Society: AES 2023 International Conference on Spatial and Immersive Audio, U. of Huddersfield, U.K., 2023. Read full publication Abstract: With the increasing demand for AR/VR technologies, enabling accurate reproduction of binaural spatial audio through […]