data_juicer_agents.tools.plan.build_dataset_spec#
build_dataset_spec tool package.
- class data_juicer_agents.tools.plan.build_dataset_spec.BuildDatasetSpecInput(*, intent: str, dataset_path: str, export_path: str, dataset_profile: ~typing.Dict[str, ~typing.Any], modality_hint: str = '', text_keys_hint: ~typing.List[str] = <factory>, image_key_hint: str = '', audio_key_hint: str = '', video_key_hint: str = '', image_bytes_key_hint: str = '')[源代码]#
基类:
BaseModel- intent: str#
- dataset_path: str#
- export_path: str#
- dataset_profile: Dict[str, Any]#
- modality_hint: str#
- text_keys_hint: List[str]#
- image_key_hint: str#
- audio_key_hint: str#
- video_key_hint: str#
- image_bytes_key_hint: str#
- model_config = {}#
Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].
- data_juicer_agents.tools.plan.build_dataset_spec.build_dataset_spec(*, user_intent: str, dataset_path: str, export_path: str, dataset_profile: Dict[str, Any] | None = None, modality_hint: str = '', text_keys_hint: Iterable[Any] | None = None, image_key_hint: str = '', audio_key_hint: str = '', video_key_hint: str = '', image_bytes_key_hint: str = '') Dict[str, Any][源代码]#