imgdiff_difference_area_generator_mapper#

Generates and filters bounding boxes for image pairs based on similarity, segmentation, and text matching.

This operator processes image pairs to identify and filter regions with significant differences. It uses a sequence of operations:

Filters out image pairs with large differences.
Segments the images to identify potential objects.
Crops sub-images based on bounding boxes.
Determines if the sub-images contain valid objects using image-text matching.
Filters out sub-images that are too similar.
Removes overlapping bounding boxes.
Uses Hugging Face models for similarity and text matching, and FastSAM for segmentation.
Caches intermediate results in DATA_JUICER_ASSETS_CACHE.
Returns the filtered bounding boxes in the MetaKeys.bbox_tag field.

基于相似性、分割和文本匹配生成并过滤图像对的边界框。

此算子处理图像对以识别和过滤具有显著差异的区域。它使用一系列操作：

Type 算子类型: mapper

Tags 标签: cpu

🔧 Parameter Configuration 参数配置#

name 参数名	type 类型	default 默认值	desc 说明
`image_pair_similarity_filter_args`	typing.Optional[typing.Dict]	`{}`	Arguments for image pair similarity filter.
`image_segment_mapper_args`	typing.Optional[typing.Dict]	`{}`	Arguments for image segmentation mapper.
`image_text_matching_filter_args`	typing.Optional[typing.Dict]	`{}`	Arguments for image-text matching filter.
`args`		`''`
`kwargs`		`''`

not available 暂无