ViVoVAD: a Voice Activity Detection Tool based on Recurrent Neural Networks

  • Pablo Gimeno Jordán Universidad de Zaragoza
  • Ignacio Viñals Bailo
  • Alfonso Ortega Giménez
  • Antonio Miguel Artiaga
  • Eduardo Lleida Solano

Resumen

Voice Activity Detection (VAD) aims to distinguish
correctly those audio segments containing human
speech. In this paper we present our latest approach
to the VAD task that relies on the modelling
capabilities of Bidirectional Long Short Term
Memory (BLSTM) layers to classify every frame in
an audio signal as speech or non-speech

Publicado
2019-05-20
Sección
Artículos (Tecnologías de la Información y las Comunicaciones)