data_juicer_agents.tools.plan.build_dataset_spec#

build_dataset_spec tool package.

class data_juicer_agents.tools.plan.build_dataset_spec.BuildDatasetSpecInput(*, intent: str, dataset_path: str, export_path: str, dataset_profile: ~typing.Dict[str, ~typing.Any], modality_hint: str = '', text_keys_hint: ~typing.List[str] = <factory>, image_key_hint: str = '', audio_key_hint: str = '', video_key_hint: str = '', image_bytes_key_hint: str = '')[源代码]#

基类:BaseModel

intent: str#
dataset_path: str#
export_path: str#
dataset_profile: Dict[str, Any]#
modality_hint: str#
text_keys_hint: List[str]#
image_key_hint: str#
audio_key_hint: str#
video_key_hint: str#
image_bytes_key_hint: str#
model_config = {}#

Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

data_juicer_agents.tools.plan.build_dataset_spec.build_dataset_spec(*, user_intent: str, dataset_path: str, export_path: str, dataset_profile: Dict[str, Any] | None = None, modality_hint: str = '', text_keys_hint: Iterable[Any] | None = None, image_key_hint: str = '', audio_key_hint: str = '', video_key_hint: str = '', image_bytes_key_hint: str = '') Dict[str, Any][源代码]#