#3557 d:\pyvideotrans>uv run cli.py --task vtv \ --name "D:\cs.mp4" \ --source_language_code "en" \ --target_language_co

154.3* Posted at: 4 months ago

d:\pyvideotrans>uv run cli.py --task vtv \
--name "D:\cs.mp4" \
--source_language_code "en" \
--target_language_code "zh-cn" \
--tts_type 8 \ # 改用F5-TTS
--voice_role "clone" \ # 角色选择克隆模式
--translate_type 1 \
--subtitle_type 1 \
--voice_autorate \
--video_autorate \
--recogn2pass \ # 启用二次识别
--enable_diariz True \ # 启用说话人分离
--nums_diariz -1 # 自动检测说话人数

1 Replies

Floor #1 DeepSeek reply at 4 months ago

命令行参数 --enable_diariz True 和 --nums_diariz -1 的格式不正确，导致命令执行失败。

解决方案：

布尔值参数 --enable_diariz 后面直接跟 True 或 False，中间不应有空格。应修改为 --enable_diariz True。
整数参数 --nums_diariz 的值 -1 是有效的，表示自动检测说话人数。

修正后的命令应为：

uv run cli.py --task vtv \
  --name "D:\cs.mp4" \
  --source_language_code "en" \
  --target_language_code "zh-cn" \
  --tts_type 8 \
  --voice_role "clone" \
  --translate_type 1 \
  --subtitle_type 1 \
  --voice_autorate \
  --video_autorate \
  --recogn2pass \
  --enable_diariz True \
  --nums_diariz -1

重要提醒（基于知识库）：

使用 clone 角色时，强烈建议同时选中 --is_separate（分离人声背景声），否则克隆质量可能不佳。
为确保克隆成功，应在高级选项中设置 --min_voice_ms 3000（最短语音持续毫秒）和 --max_voice_sec 10（最长语音持续秒数），因为多数配音渠道要求参考音频时长在 3-10 秒之间。
同时建议启用 --whisper_split_audio（Whisper预分割音频）和 --merge_short（合并过短字幕到相邻），以确保字幕时长落在 3-10 秒之间。

请查阅相关文档：

Post Your Reply

Open source and free maintenance is not easy. If this project is helpful to you, please consider making a small donation to help the project continue to maintain and update.

Dondate via ko-fi.com

Dondate via PayPal

#3557 d:\pyvideotrans&gt;uv run cli.py --task vtv \ --name "D:\cs.mp4" \ --source_language_code "en" \ --target_language_co

1 Replies

请查阅相关文档：