paddleaudio.datasets.gtzan module
- class paddleaudio.datasets.gtzan.GTZAN(mode='train', seed=0, n_folds=5, split=1, feat_type='raw', **kwargs)[source]
Bases:
AudioClassificationDataset
The GTZAN dataset consists of 1000 audio tracks each 30 seconds long. It contains 10 genres, each represented by 100 tracks. The dataset is the most-used public dataset for evaluation in machine listening research for music genre recognition (MGR).
- Reference:
Musical genre classification of audio signals https://ieeexplore.ieee.org/document/1021072/
Methods
alias of
META_INFO
- archieves = [{'url': 'http://opihi.cs.uvic.ca/sound/genres.tar.gz', 'md5': '5b3d6dddb579ab49814ab86dba69e7c7'}]
- audio_path = 'genres'
- label_list = ['blues', 'classical', 'country', 'disco', 'hiphop', 'jazz', 'metal', 'pop', 'reggae', 'rock']
- meta = 'genres/input.mf'
- meta_info
alias of
META_INFO