Publications

Peer Reviewed Journals and Book chapters

H. Wierstorf, C. Hold, A. Raake – Listener Preference for Wave Field Synthesis, Stereophony, and Different Mixes in Popular Music, Journal of the Audio Engineering Society, 66(5) pp. 385-396, 2018, doi:10.17743/jaes.2018.0019

T. Bentsen, T. May, A. A. Kressner and T. Dau – The benefit of combining a deep neural network architecture with ideal ratio mask estimation in computational speech segregation to improve speech intelligibility, PLoS ONE, 13(5) e0196924, 2018, doi:10.1371/journal.pone.0196924

G. Bustamante, P. Danès, T. Forgue, A. Podlubne and J. Manhès – An information based feedback control for audio-motor binaural localization, Autonomous Robots, 42(2) pp. pp 477-490, 2018, doi:10.1007/s10514-017-9639-8

T. Bentsen, A. A. Kressner, T. Dau and T. May – The impact of exploiting spectro-temporal context in computational speech segregation, Journal of the Acoustical Society of America, 143(1) pp. 248-259, 2018, doi:10.1121/1.5020273

N. Ma, T. May and G. J. Brown – Exploiting Deep Neural Networks and Head Movements for Robust Binaural Localisation of Multiple Sources in Reverberant Environments, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 25(12) pp. 2444-2453, 2017, doi:10.1109/TASLP.2017.2750760

N. Desphande and J. Braasch – Blind localization and segregation of two sources including a binaural head movement model, Journal of the Acoustical Society of America, 142(1) EL113, 2017, doi:10.1121/1.4986800

I. Trowitzsch, J. Mohr, Y. Kashef and K. Obermayer – Robust Detection of Environmental Sounds in Binaural Auditory Scenes, IEEE/ACM Transactions on Audio, Speech and Language Processing, 25(6) pp. 1344-1356, 2017, doi:10.1109/TASLP.2017.2690573

H. Wierstorf, A.Raake and S. Spors – Assessing localization accuracy in sound field synthesis, Journal of the Acoustical Society of America, 141(2) pp. 1111-1119, 2017, doi:10.1121/1.4976061

J. Braasch and J. Blauert – Auditory perception in rooms, N. Xiang (ed). Architectural Acoustics Handbook, pp. 173–196, 2017

F. Winter, J. Ahrens and S. Spors – On Analytic Methods for 2.5-D Local Sound Field Synthesis Using Circular Distributions of Secondary Sources, IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(5) pp. 914-926, 2016, doi:10.1109/TASLP.2016.2531902

H. Relaño-Iborraa, T. May, J. Zaar, Ch. Scheidiger and T. Dau – Predicting speech intelligibility based on a correlation metric in the envelope power spectrum Domain, Journal of the Acoustical Society of America, 140(4) pp. 2670-2679, 2016, doi:10.1121/1.4964505

A. A. Kressnera and T. May, Ch. J. Rozell – Outcome measures based on classification performance fail to predict the intelligibility of binary-masked speech, Journal of the Acoustical Society of America, 139(6) pp. 3033-3036, 2016, doi:10.1121/1.4952439

U. Remesa, A. Ramírez Lópeza, L. Juvelaa, K. Palomäkia, G. J. Brown, P. Alkua and M. Kurimoa – Comparing human and automatic speech recognition in a perceptual restoration Experiment, Computer Speech and Language, 35 pp. 14-31, 2016, doi:10.1016/j.csl.2015.06.005

J. Blauert and A. Raake – Can current room-acoustics indices specify the quality of experience in concert halls?, Psychomusicology, music, mind, and brain, 25(3) pp. 253-255, 2015, doi:10.1037/pmu0000074

S. Keronen, H. Kallasjoki, K. J. Palomäki, G. J. Brown and J. F. Gemmeke – Feature enhancement of reverberant speech by distribution matching and non-negative matrix factorization, EURASIP Journal on Advances in Signal Processing, pp. 76-86, 2015, doi:10.1186/s13634-015-0259-1

S. Argentiere, P. Danès, P. Souères – A survey on sound source localization in robotics: From binaural to array processing methods, Computer Speech and Language, 34(1) pp. 87-112, 2015, doi:10.1016/j.csl.2015.03.003

R. Saeidi, R. Astudillo and D. Kolossa – Uncertain LDA: Including Observation uncertainties in discriminative transforms, IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(7) pp. 1479-1488, 2015, doi:10.1109/TPAMI.2015.2481420

J. Blauert and A. Raake – Komplexe instrumentelle Sound-Qualitätsbe-urteilung als Ausgangspunkt für Schätzungen des Kulturgrades der Rezipienten ‒ ein Versuch (Complex instrumental judgements on sound quality as a starting point for estimations of the cultural Level), Schmidt, W.G. (Ed.) Die Natur-Kultur-Grenze in Kunst und Wissenschaft, pp. 193‒214, 2014

T. May, T. Dau – Computational speech segregation based on an auditory-inspired modulation analysis, Journal of the Acoustical Society of America, 136(6) pp.3350-3359, 2014, doi:10.1121/1.4901711

Conference Proceedings and Presentations

A. Raake, H. Wierstorf, C. Hold – Die Mischung machts: Einfluss von Mix und Wiedergabe auf die Präferenz von Hörern bei Popmusik (Mix it: influence of mix and presentation of pop music on listener preference), German Annual Conference on Acoustics (DAGA), Munich, 2018

C. Schymura, J. Rios, D. Kolossa – Monte Carlo Exploration for Active Binaural Localization, IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, 2017

H. Meutzner, N. Ma, R. Nickel, C. Schymura, D. Kolossa – Improving Audio-Visual Speech Recognition using Deep Neural Networks with Dynamic Stream Reliability Estimates, IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, 2017

J. Blauert, Th. Walther – Aufmerksam hören (Attentive listening), German Annual Conference on Acoustics (DAGA), Kiel, 2017

C. Hold, L. Nagel, H. Wierstorf and A. Raake – Positioning of Musical Foreground Parts in Surrounding Sound Stages, AES Int. Conf. on Audio for Virtual and Augmented Reality, Los Angeles, 2016

A. Raake and H. Wierstorf – Assessment of audio quality and experience using binaural-hearing models, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

B. Cohen-Lhyver – Multimodal fusion and inference using binaural audition and Vision, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

J. Braasch, T. Pastore, N. Deshpande and J. Blauert – A Precedence-effect model with top-down processing stages based on visual cues, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

N. Deshpande and J. Braasch – Source-blind binaural source segregation utilizing head movement, Proceedings of the 22nd International Congress on Acoustics, (ICA), Buenos Aires, 2016

J. Braasch and N. Deshpande – A binaural model to segregate sound sources in the presence of early reflections using a multi-source precedence-effect model, Proceedings of the 22nd International Congress on Acoustics, (ICA), Buenos Aires, 2016

Th. Walther and J. Blauert – Simulating cognitive feedback in the context of binaural scene Analysis, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

J. Blauert – The advent of Communication Acoustics in retrospect, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

J. Käsbach, M. Hahmann, T. May and T.Dau – Assessing the contribution of binaural cues for apparent source width perception via a functional model, Proceedings of the 22nd International Congress on Acoustics (ICA), Buenos Aires, 2016

F. Winter and S. Spors – On Fractional Delay Interpolation for Local Wave Field Synthesis, European Signal Processing Conference (EURASIP), pp. 2415-2419, Budapest, 2016

N. Hahn and S. Spors – Comparison of Continuous Measurement Techniques for Spatial Room Impulse Responses, European Signal Processing Conference (EURASIP), pp. 1638-1642, Budapest, 2016
G. Bustamante, P. Danès, T. Forgue and A. Podlubne – A One-step-ahead Information- based Feedback Control for Binaural Active Localization, European Signal Processing Conference (EURASIP), Budapest, 2016

N. Hahn, F. Winter and S. Spors – Local Wave Field Synthesis by Spatial Band-limitation in the Circular/Spherical Harmonics Domain, 140th AES Convention, Paris, 2016

C. Hold. H. Wierstorf and A. Raake – The Difference Between Stereophony and Wave Field Synthesis in the Context of Popular Music, 140th AES Convention, Paris, 2016

H. Wierstorf – Perceptual assessment of spatial sound: the Two!Ears Project, 140th AES Convention (invited talk), Paris, 2016

F. Winter, H. Wierstorf, A. Podlubne, T. Forgue, J. Manhès, M. Herrb, S. Spors, A. Raake and P. Danès – Database of Binaural Room Impulse Responses of an Apartment-Like Environment, 140th AES Convention, Paris, 2016

St. Zeiler, H. Meutzner, A. H. Abdelaziz and D. Kolossa – Introducing the Turbo-Twin-HMM for Audio-Visual Speech Enhancement, Proceedings of Interspeech, San Francisco, 2016

N. Ma and G.J. Brown – Speech localisation in a multitalker mixture by humans and machines, Proceedings of Interspeech, pp. 1149-1152, San Francisco, 2016

Y. Guo, X. Wang, C. Wu, Q. Fu, N. Ma and G.J. Brown – A robust dual-microphone speech source localization algorithm for reverberant Environments, Proceedings of Interspeech, pp. 1063-1066, San Francisco, 2016

Th. Bentsen, T. May, A. A. Kressner and T. Dau – Comparing the influence of spectro-temporal integration in computational speech Segregation, Proceedings of Interspeech, San Francisco, 2016

St. Zeiler, R. Nickel, N. Ma, G.J. Brown and D. Kolossa – Robust audiovisual speech recognition using noise-adaptive linear discriminant Analysis, IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, 2016

G. Bustamante, P. Danès, T. Forgue, A. Podlubne – Towards information-based feedback control for binaural active localization, IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, 2016

Th. Walther, J. Blauert and A. Raake – System zur Simulation von kognitivem Feedback im Kontext auditiver Szenenanalyse und auditiver Qualitätsbeurteilung (A system for the simulation of cognitive feedback in the context of auditory scene analysis and auditory sound-quality assessment), German Annual Conference on Acoustics (DAGA), Aachen, 2016

C. Hold, H. Wierstorf, A. Raake – Tonmischung für Stereophonie und Wellenfeldsynthese im Vergleich, German Annual Conference on Acoustics (DAGA), Aachen, 2016

C. Schymura, J. D. R. Grajales, D. Kolossa – Active localization of sound sources with binaural models, German Annual Conference on Acoustics (DAGA), Aachen, 2016

T. Walther, J. Blauert, A. Raake – System zur Simulation von kognitivem Feedback im Kontext auditiver Szene-
nanalyse und auditiver Qualitätsbeurteilung, German Annual Conference on Acoustics (DAGA), Aachen, 2016

H. Wierstorf, A. Raake – Auf dem Weg zu binauraler Modellierung mit Kognition: das Two!Ears Modell, German Annual Conference on Acoustics (DAGA), Aachen, 2016

F. Winter, S. Spors – A comparison of sound field synthesis techniques for non-smooth secondary source distributions, German Annual Conference on Acoustics (DAGA), pp. 1463-1466, Aachen, 2016

N. Hahn and S. Sports – Analysis of time-varying system identification using the Normalized Least Mean Square algorithm in the context of data-based binaural Synthesis, German Annual Conference on Acoustics (DAGA), pp. 1012-1015, Aachen, 2016

B. Cohen-Lhyver – Modulating the Auditory Turn-to Reflex on the Basis of Multimodal Feedback Loops: the Dynamic Weighting Model, IEEE ROBIO, Zhuhai, 2015

F. Winter and S. Spors – Physical Properties of Local Wave Field Synthesis using Linear Loudspeaker Arrays, 138th Convention of the Audio Engineering Society, Warsaw, 2015

J. Blauert – Psychoakustik aus perzeptionistischer Sicht (Psychoacoustics from a perceptualist’s point of view), Proceedings 18th Annual German Society of Audiology Society (DGA) (invited plenary keynote lecture), 2015

J. Braasch, T. Pastore, N. Deshpande and J. Blauert – A bi-modal model to simulate auditory expectation for reverberation time and direct-to-reverberant energy from visual Feedback, 169th Meeting Acoustic Society of America (invited talk), Pittsburgh, 2015

N. Ma, R. Marxer, J. Barker and G. J. Brown – Exploiting synchrony spectra and deep neural networks for noise-robust automatic speech recognition, ASRU Workshop on the CHiME-3 Challenge, 2015

A. Raake, H. Wierstorf and J. Blauert – Audioqualitätsbeurteilung: Ein Fall für TWO!EARS – German Annual Conference on Acoustics (DAGA), Nuremberg, 2015

H. Wierstorf, Ch. Ende and A. Raake – Klangverfärbung in der Wellenfeldsynthese – Experimente und Modellierung, German Annual Conference on Acoustics (DAGA), Nuremberg, 2015

H. Wierstorf, C. Ende and A. Raake – Klangverfärbung in der Wellenfeldsynthese – Experimente und Modellierung, German Annual Conference on Acoustics (DAGA), Nuremberg, 2015

F. Winter and S. Spors – Parameter analysis for range extrapolation of head-related transfer functions using virtual local wave field synthesis, German Annual Conference on Acoustics (DAGA), Nuremberg, 2015

N. Hahn and S. Spors – Modal bandwidth reduction in data-based binaural synthesis including translatory head-movements,  German Annual Conference on Acoustics (DAGA), Nuremberg, 2015

E. Teret, J. Braasch and M. Torben Pastore – The influence of signal type on the internal auditory representation of a room, Journal of the Acoustical Society of America, 2015

F. Winter and S. Spors – Physical properties of local wave field synthesis using circular loudspeaker arrays, Proceedings of EuroNoise, Maastricht, 2015

N. Hahn and S. Spors – Sound field synthesis of virtual cylindrical waves using circular and spherical loudspeaker arrays, Proceedings of the 138th Convention of the Audio Engineering Society, 2015

F. Winter and S. Spors – Physical properties of local wave field sythesis using linear loudspeaker arrays, Proceedings of the 138th Convention of the Audio Engineering Society, 2015

V. Erbes, M. Geier, S. Weinzierl, and S. Spors – Database of single-channel and binaural room impulse responses of a 64-channel loudspeaker array, Proceedings of the 138th Convention of the Audio Engineering Society, 2015

N. Hahn and S. Spors – Continuous measurement of impulse responses on a circle using a uniformly moving microphone, European Signal Processing Conference,  2015

N. Ma, G.J. Brown and T. May – Robust localisation of multiple speakers exploiting deep neural networks and head movements, Proceedings of Interspeech, pp.3302–3306, Dresden, 2015

C. Schymura, F. Winter, D. Kolossa, S. Spors – Binaural Sound Source Localisation and Tracking using a Dynamic Spherical Head Model, Proceedings of Interspeech, Dresden, 2015

N. Ma, G. J. Brown and J. A. Gonzalez – Exploiting top-down Source Models to improve binaural Localisation of multiple Sources in reverberant Environments, Proceedings of Interspeech, Dresden, 2015

N. Ma, G.J. Brown and T. May – Exploiting deep neural networks and head movements for binaural localisation of multiple speakers in reverberant conditions, Proceedings of Interspeech, Dresden, 2015

T. May, T. Bentsen and T. Dau – The role of temporal resolution in modulation based speech segregation, Proceedings of Interspeech, pp.170-174, Dresden, 2015

G. Bustamante, A. Portello, P. Danès – A Three-Stage Framework to Active Source Localization from a Binaural Head, IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), Invited Paper, Brisbane, Australia, 2015

T. May, N. Ma, G.J. Brown – Robust localisation of multiple speakers exploiting head movements and multi-conditional training of binaural cues, Proceedings of IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pp.2679–2683, 2015

N. Ma, T. May, H. Wierstorf, G.J. Brown – A machine-hearing system exploiting head movements for binaural sound localisation in reverberant conditions, Proceedings of IEEE  Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP), pp.2699–2703, 2015

G. Manfredi, M. Devy and D. Sidobre – Textured Object Recognition: Balancing Model Robustness and Complexity, 16th Int. Conf. on Computer Analysis of Images and Patterns (CAIP), 2015

A. Raake, H. Wierstorf and J. Blauert – A case for Two!Ears in audio-quality assessment, Proceedings of Forum Acusticum, Krakow, 2014

J. Käsbach, T. May, G. Oskarsdottir, Ch.-H. Jeong and J. Chang – The effect of interaural-time-difference fluctuations on apparent source width, Proceedings of Forum Acusticum, Krakow, 2014

F. Winter, F. Schultz, S. Spors – Localization Properties of Data-based Binaural Synthesis including Translatory Head-Movements, Forum Acusticum, Krakow, 2014 

H. Wierstorf, S. Spors – Predicting localization accuracy for stereophonic downmixes in Wave Field Synthesis, Forum Acusticum, Krakow, 2014 

T. Walther, B. Cohen-Lhyver – Multimodal feedback in auditory-based active scene Exploration, Forum Acousticum, Krakow, 2014 

C. Schymura, T. Walther, D. Kolossa, N. Ma, G.J. Brown – Binaural Sound Source Localisation using a Bayesian-network-based Blackboard System and Hypothesis-driven Feedback, Forum Acusticum, Krakow, 2014 

A. Raake, H. Wierstorf – A case for TWO!EARS in audio Quality assessment, Forum Acusticum, Krakow, 2014 

J. Blauert, D. Kolossa, P.Danès – Feedback Loops in Engineering Models of Binaural Listening, Proceedings of Meetings on Acoustics, 21(1), 2014

A. Raake, J. Blauert – Listening and Assessing with binaural models, EAA Joint Symposium on Auralization and Ambisonics, Berlin, 2014 

A. Raake et al. – Integral interactive model of auditory perception and experience, Jahrestagung der Deutschen Gesellschaft für Akustik, Oldenburg, 2014

T. May, T. Gerkmann – Generalization of supervised learning for binary mask estimation, IEEE IWAENC, pp. 154-158, Juan le pins, France, 2014

A. Portello, G. Bustamante, P. Danès, J. Piat, J. Manhès – Active Localization of an Intermittent Sound Source from a Moving Binaural Sensor, Forum Acusticum, Krakow, 2014

A. Portello, G. Bustamante, P. Danès, A. Mifsud – Localization of Multiple Sources from a Binaural Head in a Known Noisy Environment, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Chicago, 2014

A. Raake, J. Blauert – Umfassende Modellierung der Vorgänge bei der Beurteilung von Sound-Qualität, 28. Tonmeistertagung, Cologne, 2014

Final project report

Two!Ears Final Project Report

Project deliverables

D1.1 First Database of Audio-Visual Scenarios
D1.2 Intermediate database of audio-visual scenarios
D1.3 Final database of audio-visual Scenarios
D2.1 Specification of software architecture, including bottom-up/topdown interfaces
D2.2 Extension of the monaural model (The Auditory Front-End Framework User Manual)
D2.3 Extension of the binaural model (The Auditory Front-End Framework User Manual)
D2.4 Extension to a dynamic binaural model (Evaluation and progress report)
D3.1 Specification of software architecture
D3.2 Progress report on software architecture
D3.3 Implementation of software architecture
D3.4 Progress report on feature selection and semantic labelling
D3.5 Report on evaluation of expert System
D4.1 Feedback-loop selection and listing
D4.2 Specification of feedback loops and implementation progress
D4.3 Final integration & evaluation Report
D5.1 First intermediate report on hardware/software integration & robotics test bed
D5.2 Second intermediate report on hardware/software integration & robotics test bed
D5.3 Final report on hardware/software integration & robotics test bed
D6.1.1 Scene-model framework for auditory scene analysis
D6.1.2 Intermediate report on software for analysis of dynamic auditory scenes
D6.1.3 Final report and evaluated software for analysis of dynamic auditory Scenes
D6.2.1 QoE test method specification
D6.2.2 QoE model software, first version
D6.2.3 QoE model software, final Version

Related publications from other projects

T. May, T. Dau – Requirements for the evaluation of computational speech segregation systems, Journal of the Acoustical Society of America, 136(6) EL398 2014, doi:10.1121/1.4901133

Two!Ears software components and databases

Here we provide links to the collection of software modules and databases created by the Two!Ears consortium.

Main repository of the Two!Ears Auditory Model:
Two!Ears Auditory Model

Single software modules:
Two!Ears Binaural Simulator
Two!Ears Auditory Front End
Two!Ears Blackboard System

Public available data from psychoacoustic experiments and acoustical measurements:
Two!Ears Database

Other software components and databases

In this section, links to additional databases and open source software collection are provided to which members of our consortium have contributed outside of the Two!Ears project.

 Auditory Modelling Toolbox
 Sound Field Synthesis Toolbox
SOFA (Spatially Oriented Format for Acoustics)
 Head realted transfer functions (HRTFs) in the horizontal plane with a resolution of 1 degree