Pseudo-determined blind source separation for ad-hoc microphone networks


Abstract

We propose a pseudo-determined blind source separation framework that exploits the information from a large number of microphones in an ad-hoc network to extract and enhance sound sources in a reverberant scenario. After compensating for the time offsets and sampling rate mismatch between (asynchronous) signals, we interpret as a determined M × M mixture the over-determined M × N mixture, where M > N is the number of microphones and N is the number of sources. Next, we propose a pseudodetermined mixture model that can apply an M × M independent component analysis (ICA) directly to the M-channel recordings. Moreover, we propose a reference-based permutation alignment scheme that aligns the permutation of the ICA outputs and classifies them into target channels, which contain the N sources, and nontarget channels, which contain reverberation residuals. Finally, using the signals from nontarget channels, we estimate in each target channel the power spectral density of the noise component that we suppress with a spectral postfilter. Interestingly, we also obtain late-reverberation suppression as byproduct. Experiments show that each processing block improves incrementally source separation and that the performance of the proposed pseudodetermined separation improves as the number of microphones increases.

Reference

This page shows audio demos comparing the following algorithms.

  • Applied to asynchronous signal directly: AsyBSS
  • Determined source separation: DBSS
  • Over-determined source separation: BFBSS, SSBSS, MOBSS, ROBSS
  • Post-filtering: POST, UMMSE, BENCHMARK

The experiment conditions is as below.

  • 8 independent microphones in an ad-hoc network
  • Sampling rate: 16kHz
  • Signal duration: 20s
  • Reverberation time: 800ms
  • The audio data is downloaded from http://sisec.inria.fr/sisec-2015/2015-asynchronous-recordings-of-speech-mixtures/

The processing results for different number of sources are as below.

2 sources

Algorithm Ch 1 Ch 2
Clean microphone signal
Mixed microphone signal
AsyBSS
DBSS
BFBSS
SSBSS
MOBSS
ROBSS
POST
UMMSE
BENCHMARK


3 sources

Algorithm Ch 1 Ch 2 Ch 3
Clean microphone signal
Mixed microphone signal
AsyBSS
DBSS
BFBSS
SSBSS
MOBSS
ROBSS
POST
UMMSE
BENCHMARK

4 sources

Algorithm Ch 1 Ch 2 Ch 3 Ch 4
Clean microphone signal
Mixed microphone signal
AsyBSS
DBSS
BFBSS
SSBSS
MOBSS
ROBSS
POST
UMMSE
BENCHMARK
This page is maintained by Lin Wang
Last modification: | Created: 30/11/2016