paddlespeech.vector.io.dataset_from_json module
- class paddlespeech.vector.io.dataset_from_json.JSONDataset(json_file: str, feat_type: str = 'raw', **kwargs)[source]
Bases:
Dataset
dataset from json file.
- class paddlespeech.vector.io.dataset_from_json.meta_info(utt_id: str, duration: float, wav: str, start: int, stop: int, record_id: str)[source]
Bases:
object
the audio meta info in the vector JSONDataset Args:
utt_id (str): the segment name duration (float): segment time wav (str): wav file path start (int): start point in the original wav file stop (int): stop point in the original wav file lab_id (str): the record id
- duration: float
- record_id: str
- start: int
- stop: int
- utt_id: str
- wav: str