paddlespeech.s2t.frontend.augmentor.online_bayesian_normalization module

Contain the online bayesian normalization augmentation model.

class paddlespeech.s2t.frontend.augmentor.online_bayesian_normalization.OnlineBayesianNormalizationAugmentor(rng, target_db, prior_db, prior_samples, startup_delay=0.0)[source]

Bases: AugmentorBase

Augmentation model for adding online bayesian normalization.

Parameters:

rng (random.Random) -- Random generator object.
target_db (float) -- Target RMS value in decibels.
prior_db (float) -- Prior RMS estimate in decibels.
prior_samples (int) -- Prior strength in number of samples.
startup_delay -- Default 0.0s. If provided, this function will accrue statistics for the first startup_delay seconds before applying online normalization.

Methods

`__call__`(x[, uttid, train])	Call self as a function.
`transform_audio`(audio_segment)	Normalizes the input audio using the online Bayesian approach.
`transform_feature`(spec_segment)	Adds various effects to the input audo feature segment.

transform_audio(audio_segment)[source]

Normalizes the input audio using the online Bayesian approach.

Note that this is an in-place transformation.

Parameters:: audio_segment (AudioSegment|SpeechSegment) -- Audio segment to add effects to.