GlinerGraphTransformer#

class langchain_experimental.graph_transformers.gliner.GlinerGraphTransformer(allowed_nodes: List[str], allowed_relationships: List[str] | Dict[str, Any], gliner_model: str = 'urchade/gliner_mediumv2.1', glirel_model: str = 'jackboyla/glirel_beta', entity_confidence_threshold: float = 0.1, relationship_confidence_threshold: float = 0.1, device: str = 'cpu', ignore_self_loops: bool = True)[source]#

一个用于将文档转换为图结构的转换器类使用GLiNER和GLiREL模型。

该类利用GLiNER进行命名实体识别，并利用GLiREL从文本文档中提取关系，将其转换为图格式。提取的实体和关系根据指定的置信度阈值和允许的类型进行过滤。

For more details on GLiNER and GLiREL, visit their respective repositories:: GLiNER: urchade/GLiNER GLiREL: jackboyla/GLiREL

Parameters:

allowed_nodes (List[str]) – 用于实体提取的允许节点类型列表。
allowed_relationships (Union[List[str], Dict[str, Any]]) – 允许的关系类型列表或包含关系提取额外配置的字典。
gliner_model (str) – 使用的预训练GLiNER模型的名称。默认是“urchade/gliner_mediumv2.1”。
glirel_model (str) – 使用的预训练GLiREL模型的名称。默认值为“jackboyla/glirel_beta”。
entity_confidence_threshold (float) – 用于过滤提取实体的置信度阈值。默认值为0.1。
relationship_confidence_threshold (float) – 用于过滤提取关系的置信度阈值。默认值为0.1。
device (str) – 用于模型推理的设备（‘cpu’ 或 ‘cuda’）。默认值为“cpu”。
ignore_self_loops (bool) – 是否忽略源节点和目标节点相同的关系。默认值为 True。

方法

`__init__`(allowed_nodes, allowed_relationships)
`convert_to_graph_documents`(documents)	将一系列文档转换为图形文档。
`process_document`(document)

__init__(allowed_nodes: List[str], allowed_relationships: List[str] | Dict[str, Any], gliner_model: str = 'urchade/gliner_mediumv2.1', glirel_model: str = 'jackboyla/glirel_beta', entity_confidence_threshold: float = 0.1, relationship_confidence_threshold: float = 0.1, device: str = 'cpu', ignore_self_loops: bool = True) → None[源代码]#

Parameters:

allowed_nodes (列表[字符串])
allowed_relationships (List[str] | Dict[str, Any])
gliner_model (str)
glirel_model (str)
entity_confidence_threshold (float)
relationship_confidence_threshold (float)
device (str)
ignore_self_loops (bool)

Return type:

无

convert_to_graph_documents(documents: Sequence[Document]) → List[GraphDocument][source]#

将一系列文档转换为图形文档。

Parameters:

documents (Sequence[Document]) – 原始文档。
kwargs – 额外的关键字参数。

Returns:

将文档转换为图形。

Return type:

序列[GraphDocument]

process_document(document: Document) → GraphDocument[source]#

Parameters:: 文档 (Document)
Return type:: GraphDocument