paddlespeech.vector.io.dataset_from_json module

class paddlespeech.vector.io.dataset_from_json.JSONDataset(json_file: str, feat_type: str = 'raw', **kwargs)[source]

Bases: Dataset

dataset from json file.

class paddlespeech.vector.io.dataset_from_json.meta_info(utt_id: str, duration: float, wav: str, start: int, stop: int, record_id: str)[source]

Bases: object

the audio meta info in the vector JSONDataset Args:

utt_id (str): the segment name duration (float): segment time wav (str): wav file path start (int): start point in the original wav file stop (int): stop point in the original wav file lab_id (str): the record id

duration: float

record_id: str

start: int

stop: int

utt_id: str

wav: str