inria-00326529, version 1
A Lightweight Speech Detection System for Perceptive Environments
Dominique Vaufreydaz 1Rémi Emonet 1Patrick Reignier 1
3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms (2006)
Résumé : In this paper, we address the problem of speech activity detection in multimodal perceptive environments. Such space may contain many different microphones (lapel, distant or table top). Thus, we need a generic speech activity detector in order to cope with different speech conditions (from closetalking to noisy distant speech). Moreover, as the number of microphones in the room can be high, we also need a very light system. The speech activity detector presented in this article works efficiently on dozens of microphones in parallel. We will see that even if its absolute score of the evaluation is not perfect (30% and 40% of error rate respectively on the two tasks), its accuracy is good enough in the context we are using it.
- 1 : PRIMA (INRIA Grenoble Rhône-Alpes / LIG Laboratoire d'Informatique de Grenoble)
- INRIA – Université Joseph Fourier - Grenoble I – Institut polytechnique de Grenoble (Grenoble INP) – Université Pierre-Mendès-France - Grenoble II – CNRS : UMR5217
- Domaine : Informatique/Informatique ubiquitaire
Informatique/Son
- inria-00326529, version 1
- http://hal.inria.fr/inria-00326529
- oai:hal.inria.fr:inria-00326529
- Contributeur : Dominique Vaufreydaz
- Soumis le : Vendredi 3 Octobre 2008, 12:41:14
- Dernière modification le : Vendredi 3 Octobre 2008, 16:35:03