Google语音转文字加载器#

class langchain_community.document_loaders.google_speech_to_text.GoogleSpeechToTextLoader(project_id: str, file_path: str, location: str = 'us-central1', recognizer_id: str = '_', config: RecognitionConfig | None = None, config_mask: FieldMask | None = None)[source]#

自版本0.0.32起已弃用：请改用:class:`~langchain_google_community.SpeechToTextLoader`。在langchain-community==1.0之前不会移除。

Google Cloud Speech-to-Text 音频转录的加载器。

它使用Google Cloud Speech-to-Text API来转录音频文件，并根据指定的格式将转录的文本加载到一个或多个文档中。

要使用，您应该安装google-cloud-speech python包。

音频文件可以通过Google Cloud Storage的URI或本地文件路径来指定。

有关Google Cloud Speech-to-Text的详细解释，请参阅产品文档。 https://cloud.google.com/speech-to-text

初始化GoogleSpeechToTextLoader。

Parameters:

project_id (str) – Google Cloud 项目 ID。
file_path (str) – 一个Google云存储URI或本地文件路径。
location (str) – 语音转文本识别器的位置。
recognizer_id (str) – 语音转文本识别器ID。
config (可选[RecognitionConfig]) – 识别选项和功能。更多信息： https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognitionConfig
config_mask (可选[FieldMask]) – 在此识别请求期间，覆盖识别器default_recognition_config中值的配置字段列表。更多信息： https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognizeRequest

方法

`__init__`(project_id, file_path[, location, ...])	初始化 GoogleSpeechToTextLoader。
`alazy_load`()	文档的懒加载器。
`aload`()	将数据加载到Document对象中。
`lazy_load`()	一个用于文档的懒加载器。
`load`()	将音频文件转录并将转录内容加载到文档中。
`load_and_split`([text_splitter])	加载文档并将其分割成块。

__init__(project_id: str, file_path: str, location: str = 'us-central1', recognizer_id: str = '_', config: RecognitionConfig | None = None, config_mask: FieldMask | None = None)[source]#

初始化GoogleSpeechToTextLoader。

Parameters:

project_id (str) – Google Cloud 项目 ID。
file_path (str) – 一个Google云存储URI或本地文件路径。
location (str) – 语音转文本识别器的位置。
recognizer_id (str) – 语音转文本识别器ID。
config (可选[RecognitionConfig]) – 识别选项和功能。更多信息： https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognitionConfig
config_mask (可选[FieldMask]) – 在此识别请求期间，覆盖识别器default_recognition_config中值的配置字段列表。更多信息： https://cloud.google.com/python/docs/reference/speech/latest/google.cloud.speech_v2.types.RecognizeRequest

async alazy_load() → AsyncIterator[Document]#

文档的懒加载器。

Return type:: AsyncIterator[Document]

async aload() → list[Document]#

将数据加载到Document对象中。

Return type:: 列表[Document]

lazy_load() → Iterator[Document]#

文档的懒加载器。

Return type:: 迭代器[文档]

load() → List[Document][source]#

转录音频文件并将转录内容加载到文档中。

它使用Google Cloud Speech-to-Text API来转录音频文件，并阻塞直到转录完成。

Return type:: 列表[文档]

load_and_split(text_splitter: TextSplitter | None = None) → list[Document]#

加载文档并将其分割成块。块以文档形式返回。

不要重写此方法。它应该被视为已弃用！

Parameters:: text_splitter (可选[TextSplitter]) – 用于分割文档的TextSplitter实例。默认为RecursiveCharacterTextSplitter。
Returns:: 文档列表。
Return type:: 列表[Document]