data_juicer.utils.job.common module#
DataJuicer Job Utilities - Common Functions
Shared utilities for job stopping and monitoring operations.
- class data_juicer.utils.job.common.JobUtils(job_id: str, work_dir: str = None, base_dir: str = None)[源代码]#
基类:
objectCommon utilities for DataJuicer job operations.
- __init__(job_id: str, work_dir: str = None, base_dir: str = None)[源代码]#
Initialize job utilities.
- 参数:
job_id -- The job ID to work with
work_dir -- Work directory that already includes job_id (preferred)
base_dir -- Base directory containing job outputs (deprecated, use work_dir instead)
- extract_process_thread_ids() Dict[str, Set[int]][源代码]#
Extract process and thread IDs from event logs. Returns a dict with 'process_ids' and 'thread_ids' sets.
- find_processes_by_ids(process_ids: Set[int]) List[Process][源代码]#
Find running processes by their PIDs.