Multimodal Source Separation - AGPIG Accéder directement au contenu
Article Dans Une Revue Lecture Notes in Computer Science Année : 2010

Multimodal Source Separation

Résumé

The work of Bernstein and Benoît has confirmed that it is advantageous to use multiple senses, for example to employ both audio and visual modalities, in speech perception. As a consequence, looking at the speaker's face can be useful to better hear a speech signal in a noisy environment and to extract it from competing sources, as originally identified by Cherry, who posed the so-called "Cocktail Party" problem. To exploit the intrinsic coherence between audition and vision within a machine, the method of blind source separation (BSS) is particularly attractive.

Dates et versions

hal-00463627 , version 1 (13-03-2010)

Identifiants

Citer

Bertrand Rivet, Jonathon Chambers. Multimodal Source Separation. Lecture Notes in Computer Science, 2010, Advances in Nonlinear Speech Processing. International Conference on Nonlinear Speech Processing, NOLISP 2009, Vic, Spain, June 25-27, 2009, Revised Selected Papers, LNAI 5933, pp.1-11. ⟨10.1007/978-3-642-11509-7⟩. ⟨hal-00463627⟩
69 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More