data_juicer.format.parquet_formatter module#
- class data_juicer.format.parquet_formatter.ParquetFormatter(dataset_path, suffixes=None, **kwargs)[source]#
Bases:
LocalFormatterThe class is used to load and format parquet-type files.
Default suffixes is [‘.parquet’]
- SUFFIXES = ['.parquet']#