Toggle Menu

`langchain_experimental.data_anonymizer.deanonymizer_matching_strategies`.fuzzy_matching_strategy¶

langchain_experimental.data_anonymizer.deanonymizer_matching_strategies.fuzzy_matching_strategy(text: str, deanonymizer_mapping: Dict[str, Dict[str, str]], max_l_dist: int = 3) → str[source]¶

模糊匹配策略用于去匿名化。

它使用模糊匹配来找到文本中匿名实体的位置。它用原始实体替换所有匿名实体。

参数：: text：要去匿名化的文本 deanonymizer_mapping：匿名实体和原始实体之间的映射 max_l_dist：匿名实体和文本段之间的最大Levenshtein距离，以便将其视为匹配项
匹配示例：: Kaenu Reves -> Keanu Reeves John F. Kennedy -> John Kennedy

Parameters

text (str) –
deanonymizer_mapping (Dict[str, Dict[str, str]]) –
max_l_dist (int) –

Return type

str

Examples using fuzzy_matching_strategy¶

Reversible data anonymization with Microsoft Presidio {#reversible-data-anonymization-with-microsoft-presidio}