data_juicer_agents.tools.context.list_dataset_fields#
list_dataset_fields tool package.
- class data_juicer_agents.tools.context.list_dataset_fields.ListDatasetFieldsInput(*, filter_prefix: str | None = None, include_descriptions: bool = True)[source]#
Bases:
BaseModelInput for list_dataset_fields.
This tool lists all dataset-related configuration fields recognized by Data-Juicer, including their types, default values, and descriptions. Use this before build_dataset_spec to discover advanced dataset options such as export_type, export_shard_size, load_dataset_kwargs, suffixes, or modality special tokens.
- filter_prefix: str | None#
- include_descriptions: bool#
- model_config = {}#
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- data_juicer_agents.tools.context.list_dataset_fields.list_dataset_fields(*, filter_prefix: str | None = None, include_descriptions: bool = True) Dict[str, Any][source]#
List dataset-related configuration fields from Data-Juicer.
This function lists all available dataset configuration parameters from Data-Juicer, including their types, default values, and descriptions.
- Parameters:
filter_prefix – Optional filter to show only parameters matching this prefix
include_descriptions – Whether to include parameter descriptions
- Returns:
Dict containing configuration information and available parameters