#3909 我该如何在fast-whisper中使用自定义的大模型？比如我要用kotoba-tech/kotoba-whisper-v2.2这个模型该怎么操作

103.173* Posted at: 3 months ago

我该如何在fast-whisper中使用自定义的大模型？比如我要用kotoba-tech/kotoba-whisper-v2.2这个模型该怎么操作

7 Replies

Floor #1 DeepSeek reply at 3 months ago

要在 faster-whisper 中使用自定义模型，需要手动下载模型文件并放置到正确的目录中。对于 kotoba-tech/kotoba-whisper-v2.2 模型，操作步骤如下：

创建模型文件夹：在 pyVideoTrans 软件根目录下的 models 文件夹内，创建一个名为 models--kotoba-tech--kotoba-whisper-v2.2 的文件夹。
下载模型文件：访问该模型的 HuggingFace 页面（https://huggingface.co/kotoba-tech/kotoba-whisper-v2.2/tree/main），将该页面中的所有 .json、.bin、.txt 等模型文件下载到上一步创建的文件夹中。
在软件中选择：启动 pyVideoTrans，在“翻译视频或音频”界面的“语音识别”下拉列表中，选择 Huggingface_ASR 渠道。然后在右侧的模型选择下拉列表中，找到并选择 kotoba-tech/kotoba-whisper-v2.2 模型即可使用。

注意：如果自动下载失败或网络不畅，手动下载是推荐的方式。确保模型文件夹的命名与 HuggingFace 上的路径严格一致（将 / 替换为 --）。

请查阅相关文档：

Floor #2 103.173* reply at 3 months ago

Jim6789/kotoba-whisper-bilingual-ct2这个怎么命名？

Floor #3 DeepSeek reply at 3 months ago

模型文件夹应命名为 models--Jim6789--kotoba-whisper-bilingual-ct2。

操作步骤：

在 pyVideoTrans 软件根目录下的 models 文件夹内，创建名为 models--Jim6789--kotoba-whisper-bilingual-ct2 的文件夹。
访问该模型的 HuggingFace 页面（https://huggingface.co/Jim6789/kotoba-whisper-bilingual-ct2/tree/main），下载页面中的所有 .json、.bin、.txt 等模型文件到该文件夹。
在软件中选择 Huggingface_ASR 渠道，然后在模型列表中找到 Jim6789/kotoba-whisper-bilingual-ct2 并选择。

命名规则是将 HuggingFace 模型路径中的 / 替换为 --。

请查阅相关文档：

Floor #4 103.173* reply at 3 months ago

chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2

Floor #5 DeepSeek reply at 3 months ago

模型文件夹应命名为 models--chickenrice0721--whisper-large-v2-translate-zh-v0.2-st-ct2。操作步骤：在 pyVideoTrans 软件根目录下的 models 文件夹内，创建名为 models--chickenrice0721--whisper-large-v2-translate-zh-v0.2-st-ct2 的文件夹。访问该模型的 HuggingFace 页面（https://huggingface.co/chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2/tree/main），下载页面中的所有 .json、.bin、.txt 等模型文件到该文件夹。在软件中选择 Huggingface_ASR 渠道，然后在模型列表中找到 chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2 并选择。命名规则是将 HuggingFace 模型路径中的 / 替换为 --。请查阅相关文档：从huggingface添加新模型 HuggingFace_ASR 语音识别渠道

Floor #6 103.173* reply at 3 months ago

chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2
这个模型改怎么填写？这个是openai的模型吗？fast-whisper可以直接用吗

Floor #7 DeepSeek reply at 3 months ago

该模型是 faster-whisper 格式的模型，不是 OpenAI 的原版 Whisper。它可以直接在 faster-whisper(本地) 渠道中使用。操作步骤：在 pyVideoTrans 软件根目录下的 models 文件夹内，创建名为 models--chickenrice0721--whisper-large-v2-translate-zh-v0.2-st-ct2 的文件夹。访问该模型的 HuggingFace 页面（https://huggingface.co/chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2/tree/main），下载所有 .json、.bin、.txt 等模型文件到该文件夹。在软件主界面的“语音识别”下拉列表中，选择 faster-whisper(本地) 渠道。在右侧的模型选择下拉列表中，找到并选择 chickenrice0721/whisper-large-v2-translate-zh-v0.2-st-ct2 模型即可使用。命名规则是将 HuggingFace 模型路径中的 / 替换为 --。请查阅相关文档：从huggingface添加新模型 faster和openai识别模式