Spectral analysis for non-stationary audio

A. Meynard and B. Torrésani

Institut de Mathématiques de Marseille (I2M)

Aix-Marseille Université




Abstract :

A new approach for the analysis of non-stationary signals is proposed, with a focus on audio applications. Following earlier contributions, non-stationarity is modeled via stationarity-breaking operators acting on Gaussian stationary random signals. The focus is here on time warping and amplitude modulation, and an approximate maximum-likelihood approach based on suitable approximations in the wavelet transform domain is developed. This papers provides theoretical analysis of the approximations, and describes and analyses a corresponding estimation algorithm. The latter is tested and validated on synthetic as well as real audio signal.


Keywords :

Non-stationary signals, deformation, wavelet analysis, local spectrum, Doppler effect


Link to the manuscript :

Spectral analysis for non-stationary audio


Link to the code :

JEFAS algorithm


Supplementary data :

Illustrations for the 3 test scripts provided with the paper :

Validation on a synthetic signal : starting from a random stationary gaussian process with given power spectrum, a non-stationary signal is generated by transforming it with given amplitude modulation and time warping. The simulation is used to provide quantitative assessment for the precision of estimators.

Doppler example : the signal is a sound originating from a moving racing car, recorded from a fixed location. Time warping is generated from Doppler effect, and amplitude modulation results from the variation of distance between the car and the recording device.

Singing female voice : the signal is a recording from a soprano singing a simple melody. Time warping models pitch variations.

Dolphin sound : the signal is a recording of a dolphin sound.

Wind sound : the signal is a recording of a wind sound.


Sound files :

Example Original Unwarped Unwarped and demodulated
Synthetic
Doppler
Female voice
Dolphin
Wind

Cross-synthesis :

To synthesize new sounds, we start from a stationarized sound obtained with JEFAS and we apply the deformation estimated from another nonstationarysound.

Cross-synthesis  : Matlab file enabling the cross-synthesis.


Deformation
Female voice Wind Racing car
Spectrum
Female voice
Wind
Racing car




A. Meynard and B. Torrésani

Last update: May 24, 2018