{"id":1063,"date":"2023-09-15T11:15:44","date_gmt":"2023-09-15T09:15:44","guid":{"rendered":"https:\/\/reach.ircam.fr\/?p=1063"},"modified":"2024-03-09T17:07:06","modified_gmt":"2024-03-09T16:07:06","slug":"learning-sub-dimensional-hrtf-representations-towards-individualization-applications-traditional-and-deep-learning-approaches","status":"publish","type":"post","link":"https:\/\/reach.ircam.fr\/index.php\/2023\/09\/15\/learning-sub-dimensional-hrtf-representations-towards-individualization-applications-traditional-and-deep-learning-approaches\/","title":{"rendered":"Learning Sub-Dimensional HRTF Representations Towards Individualization Applications &#8211; Traditional and Deep Learning Approaches"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"1063\" class=\"elementor elementor-1063\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-ae07d4f elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"ae07d4f\" data-element_type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-5360a12\" data-id=\"5360a12\" data-element_type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-06a5401 elementor-widget elementor-widget-text-editor\" data-id=\"06a5401\" data-element_type=\"widget\" data-widget_type=\"text-editor.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t\t<p><span data-sheets-root=\"1\" data-sheets-value=\"{&quot;1&quot;:2,&quot;2&quot;:&quot;Devansh Zurale, Shlomo Dubnov, Learning Sub-Dimensional HRTF Representations Towards Individualization Applications-Traditional and Deep Learning Approaches, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, USA, 2023&quot;}\" data-sheets-userformat=\"{&quot;2&quot;:6915,&quot;3&quot;:{&quot;1&quot;:0},&quot;4&quot;:{&quot;1&quot;:2,&quot;2&quot;:16777215},&quot;11&quot;:4,&quot;12&quot;:0,&quot;14&quot;:{&quot;1&quot;:2,&quot;2&quot;:1136076},&quot;15&quot;:&quot;Arial, Helvetica, sans-serif&quot;}\">Devansh Zurale, Shlomo Dubnov, Learning Sub-Dimensional HRTF Representations Towards Individualization Applications-Traditional and Deep Learning Approaches, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, USA, 2023<\/span><\/p><p><a href=\"https:\/\/ieeexplore.ieee.org\/document\/10248076\">Full publication<\/a><\/p><p><strong>Abstract<\/strong>: Individualized Head Related Transfer Functions (HRTFs) are indispensable in order to accurately reproduce spatial audio over headphones. Encoding the high-dimensional HRTFs to a sub-dimensional space has proven to be useful in many previous research efforts in predicting individualized HRTFs. In this work, we provide a comparative study of some traditional methods such as Principle Component Analysis (PCA) or Multi-Layer Perceptron (MLP) based Autoencoders and the more recent generative deep learning approaches such as a Convolutional Neural Network (CNN) based Vector Quantized Variational Autoencoder (VQ-VAE) for learning HRTF representations. We further demonstrate the benefits of using 3D-CNNs for this task to learn correlations between neighboring HRTFs, along both spatial and frequency dimensions. To this end, we provide evidence suggesting that such a 3D-CNN based approach enables the derived latent space to encode features more representative of the individuality of the HRTFs while also allowing for the representations to be significantly more compact. Finally, we also explore the advantages of such robust representations towards downstream applications of predicting Individualized HRTFs.<\/p>\t\t\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Devansh Zurale, Shlomo Dubnov, Learning Sub-Dimensional HRTF Representations Towards Individualization Applications-Traditional and Deep Learning Approaches, IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA), Mohonk Mountain House, New Paltz, NY, USA, 2023 Full publication Abstract: Individualized Head Related Transfer Functions (HRTFs) are indispensable in order to accurately reproduce spatial audio over headphones. [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":1065,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[52,46],"tags":[],"class_list":["post-1063","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-conferences","category-publications-research"],"aioseo_notices":[],"blog_post_layout_featured_media_urls":{"thumbnail":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-150x150.jpeg",150,150,true],"full":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf.jpeg",1743,632,false]},"categories_names":{"52":{"name":"Conferences","link":"https:\/\/reach.ircam.fr\/index.php\/category\/research\/conferences\/"},"46":{"name":"Publications","link":"https:\/\/reach.ircam.fr\/index.php\/category\/research\/publications-research\/"}},"tags_names":[],"comments_number":"0","wpmagazine_modules_lite_featured_media_urls":{"thumbnail":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-150x150.jpeg",150,150,true],"cvmm-medium":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-300x300.jpeg",300,300,true],"cvmm-medium-plus":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-305x207.jpeg",305,207,true],"cvmm-portrait":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-400x600.jpeg",400,600,true],"cvmm-medium-square":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-600x600.jpeg",600,600,true],"cvmm-large":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-1024x632.jpeg",1024,632,true],"cvmm-small":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf-130x95.jpeg",130,95,true],"full":["https:\/\/reach.ircam.fr\/wp-content\/uploads\/2024\/03\/F8Jqh4yWIAA1Uzf.jpeg",1743,632,false]},"_links":{"self":[{"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/posts\/1063","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/comments?post=1063"}],"version-history":[{"count":4,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/posts\/1063\/revisions"}],"predecessor-version":[{"id":1068,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/posts\/1063\/revisions\/1068"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/media\/1065"}],"wp:attachment":[{"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/media?parent=1063"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/categories?post=1063"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/reach.ircam.fr\/index.php\/wp-json\/wp\/v2\/tags?post=1063"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}