Institut de Mathématiques de Marseille (I2M)
Aix-Marseille Université
Abstract :
A new approach for the analysis of non-stationary signals is proposed, with a focus on audio applications. Following earlier contributions, non-stationarity is modeled via stationarity-breaking operators acting on Gaussian stationary random signals. The focus is here on time warping and amplitude modulation, and an approximate maximum-likelihood approach based on suitable approximations in the wavelet transform domain is developed. This papers provides theoretical analysis of the approximations, and describes and analyses a corresponding estimation algorithm. The latter is tested and validated on synthetic as well as real audio signal.
Keywords :
Non-stationary signals, deformation, wavelet analysis, local spectrum, Doppler effect
Link to the manuscript :
Spectral analysis for non-stationary audio
Link to the code :
Supplementary data :
Illustrations for the 3 test scripts provided with the paper :
Validation on a synthetic signal : starting from a random stationary gaussian process with given power spectrum, a non-stationary signal is generated by transforming it with given amplitude modulation and time warping. The simulation is used to provide quantitative assessment for the precision of estimators.
Doppler example : the signal is a sound originating from a moving racing car, recorded from a fixed location. Time warping is generated from Doppler effect, and amplitude modulation results from the variation of distance between the car and the recording device.
Singing female voice : the signal is a recording from a soprano singing a simple melody. Time warping models pitch variations.
Dolphin sound : the signal is a recording of a dolphin sound.
Wind sound : the signal is a recording of a wind sound.
Sound files :
| Example | Original | Unwarped | Unwarped and demodulated |
| Synthetic | |||
| Doppler | |||
| Female voice | |||
| Dolphin | |||
| Wind |
Cross-synthesis :
To synthesize new sounds, we start from a stationarized sound obtained with JEFAS and we apply the deformation estimated from another nonstationarysound.
Cross-synthesis : Matlab file enabling the cross-synthesis.
| Deformation | ||||
|---|---|---|---|---|
| Female voice | Wind | Racing car | ||
Spectrum
|
Female voice | |||
| Wind | ||||
| Racing car | ||||