data_juicer.format.json_formatter module#
- class data_juicer.format.json_formatter.JsonFormatter(dataset_path, suffixes=None, **kwargs)[source]#
Bases:
LocalFormatterThe class is used to load and format json-type files.
Default suffixes is [‘.json’, ‘.jsonl’, ‘.json.gz’, ‘.jsonl.gz’, ‘.json.zst’, ‘.jsonl.zst’]
- SUFFIXES = ['.json', '.jsonl', '.json.gz', '.jsonl.gz', '.json.zst', '.jsonl.zst']#