data_juicer.format.csv_formatter module#

class data_juicer.format.csv_formatter.CsvFormatter(dataset_path, suffixes=None, **kwargs)[source]#

Bases: LocalFormatter

The class is used to load and format csv-type files.

Default suffixes is [‘.csv’]

SUFFIXES = ['.csv']#
__init__(dataset_path, suffixes=None, **kwargs)[source]#

Initialization method.

Parameters:
  • dataset_path – a dataset file or a dataset directory

  • suffixes – files with specified suffixes to be processed

  • kwargs – extra args