ImageHash

pipeline

图像哈希管道生成感知图像哈希。这些哈希可用于检测近乎重复的图像。此方法不依赖于机器学习模型，也不旨在寻找概念上相似的图像。

示例

以下展示了一个使用此管道的简单示例。

from txtai.pipeline import ImageHash

# 创建并运行管道
ihash = ImageHash()
ihash("图像文件路径")

请参阅以下链接以获取更详细的示例。

笔记本	描述
近似重复图像检测	识别重复和近似重复的图像

配置驱动示例

管道可以通过 Python 或配置运行。管道可以通过配置使用管道的类名的小写形式实例化。配置驱动的管道可以通过工作流或API运行。

config.yml

# 使用类名的小写形式创建管道
imagehash:

# 使用工作流运行管道
workflow:
  imagehash:
    tasks:
      - action: imagehash

使用工作流运行

from txtai import Application

# 使用工作流创建并运行管道
app = Application("config.yml")
list(app.workflow("imagehash", ["图像文件路径"]))

使用 API 运行

CONFIG=config.yml uvicorn "txtai.api:app" &

curl \
  -X POST "http://localhost:8000/workflow" \
  -H "Content-Type: application/json" \
  -d '{"name":"imagehash", "elements":["图像文件路径"]}'

方法

管道的 Python 文档。

`init(algorithm='average', size=8, strings=True)`

Creates an ImageHash pipeline.

Parameters:

Name	Description	Default
`algorithm`	image hashing algorithm (average, perceptual, difference, wavelet, color)	`'average'`
`size`	hash size	`8`
`strings`	outputs hex strings if True (default), otherwise the pipeline returns numpy arrays	`True`

Source code in txtai/pipeline/image/imagehash.py

def __init__(self, algorithm="average", size=8, strings=True):
    """
    Creates an ImageHash pipeline.

    Args:
        algorithm: image hashing algorithm (average, perceptual, difference, wavelet, color)
        size: hash size
        strings: outputs hex strings if True (default), otherwise the pipeline returns numpy arrays
    """

    if not PIL:
        raise ImportError('ImageHash pipeline is not available - install "pipeline" extra to enable')

    self.algorithm = algorithm
    self.size = size
    self.strings = strings

`call(images)`

Generates perceptual image hashes.