180 likes | 302 Vues
This paper presents a framework for user-driven quality enhancement in audio signal processing, discussed at the 134th AES Convention in Rome, Italy. The talk outlines the problem definition, application areas such as immersive speech and gaming, and preliminary results. The framework utilizes interactive evolutionary algorithms to adapt audio processing settings based on user preferences, tackling issues like user fatigue and time constraints. Its applications span immersive audio experiences, forensic audio, and gaming audio, indicating potential for practical implementations in various fields.
E N D
134th AES CONVENTION Rome, Italy – 2013 May 4-7 Convention Paper8823 User-Driven Quality Enhancement for Audio Signal Processing D. Comminiello, S. Scardapane, M. Scarpiniti, A. Uncini
2013 May 4-7 Outline of the Talk • (1) • Problem Definition • Our Framework • (2) • Applications • Immersive Speech, Games… • (3) • Preliminary Results • Conclusions
2013 May 4-7 Understanding the User Development Stage ??? Personal Judgement Audio Processor
2013 May 4-7 Evaluation Procedures • ObjectiveIndexes • PsychoacusticModels • SubjectiveTests
2013 May 4-7 ClassicalApproach
Possible Enhanced Audio 2013 May 4-7 OurApproach
2013 May 4-7 Interactive EvolutionaryAlgorithm Subjective Evaluation «Reproduction» New Pool Selection Pool of PossibleSettings
2013 May 4-7 MainDrawbacks User Fatigue Time Constraint User Discrimination Partial Ordering Fast convergenceobtained with fewpossible fitness values
2013 May 4-7 OurProposal IEC hasbeenseldomlyusedin audio processing applications. Webelieveit to be of practicalinterest for a wide range of tasks. Thisis the mainreasonwe are proposingthisframework.
2013 May 4-7 Applications – Games Audio for Games
2013 May 4-7 Applications – Forensic Audio Forensic Audio (Image property of SoundAndSound)
2013 May 4-7 Applications – Immersive Experience Immersive Audio (Image property of Integrated Media Systems Center)
2013 May 4-7 Interactive AEC
2013 May 4-7 Test Setup Five signalsdistorted by female voice. Echocancellationthroughaffine projectionalgorithm (APA) with 4 parameters. A standard GA minimizesnormalizedmisalignment: An IGA shouldminimizeuser’spreferences.
2013 May 4-7 Test Setup - Workflow OriginalSignals Standard Minimization IEC Minimization Set 2 Set 1 SubjectiveComparison
2013 May 4-7 Results
2013 May 4-7 Conclusions Goodresults of ourframework on AEC. User fatigueis the maindrawback to be confronted. Severalapplicationsawaits in the future. Possiblecombination of objective and subjectivemeasurements.