[cuda:0 float16 storage viewed as 1x1500x1280], [[50258, 50266, 50359]]; kwargs: length_penalty=1, repetition_penalty=1, no_repeat_ngram_size=0, max_length=448, return_scores=True, return_no_speech_prob=True, suppress_blank=True, suppress_tokens=(1, 2, 7, 8, 9, 10, 14, 25, 26, 27, 28, 29, 31, 58, 59, 60, 61, 62, 63, 90, 91, 92, 93, 359, 503, 522, 542, 873, 893, 902, 918, 922, 931, 1350, 1853, 1982, 2460, 2627, 3246, 3253, 3268, 3536, 3846, 3961, 4183, 4667, 6585, 6647, 7273, 9061, 9383, 10428, 10929, 11938, 12033, 12331, 12562, 13793, 14157, 14635, 15265, 15618, 16553, 16604, 18362, 18956, 20075, 21675, 22520, 26130, 26161, 26435, 28279, 29464, 31650, 32302, 32470, 36865, 42863, 47425, 49870, 50254, 50258, 50358, 50359, 50360, 50361), max_initial_timestamp_index=50, beam_size='5', patience=1:FasterAvg

语音识别阶段出错:Traceback (most recent call last):

File "videotrans\recognition\_base.py", line 64, in run

File "videotrans\recognition\_average.py", line 107, in _exec

RuntimeError: generate(): incompatible function arguments. The following argument types are supported:

1. (self: ctranslate2._ext.Whisper, features: ctranslate2._ext.StorageView, prompts: Union[List[List[str]], List[List[int]]], *, asynchronous: bool = False, beam_size: int = 5, patience: float = 1, num_hypotheses: int = 1, length_penalty: float = 1, repetition_penalty: float = 1, no_repeat_ngram_size: int = 0, max_length: int = 448, return_scores: bool = False, return_logits_vocab: bool = False, return_no_speech_prob: bool = False, max_initial_timestamp_index: int = 50, suppress_blank: bool = True, suppress_tokens: Optional[List[int]] = [-1], sampling_topk: int = 1, sampling_temperature: float = 1) -> Union[List[ctranslate2._ext.WhisperGenerationResult], List[ctranslate2._ext.WhisperGenerationResultAsync]]

Invoked with: , 0.132202 -0.92041 0.912598 ... -0.10022 -0.580566 -0.477783

[cuda:0 float16 storage viewed as 1x1500x1280], [[50258, 50266, 50359]]; kwargs: length_penalty=1, repetition_penalty=1, no_repeat_ngram_size=0, max_length=448, return_scores=True, return_no_speech_prob=True, suppress_blank=True, suppress_tokens=(1, 2, 7, 8, 9, 10, 14, 25, 26, 27, 28, 29, 31, 58, 59, 60, 61, 62, 63, 90, 91, 92, 93, 359, 503, 522, 542, 873, 893, 902, 918, 922, 931, 1350, 1853, 1982, 2460, 2627, 3246, 3253, 3268, 3536, 3846, 3961, 4183, 4667, 6585, 6647, 7273, 9061, 9383, 10428, 10929, 11938, 12033, 12331, 12562, 13793, 14157, 14635, 15265, 15618, 16553, 16604, 18362, 18956, 20075, 21675, 22520, 26130, 26161, 26435, 28279, 29464, 31650, 32302, 32470, 36865, 42863, 47425, 49870, 50254, 50258, 50358, 50359, 50360, 50361), max_initial_timestamp_index=50, beam_size='5', patience=1

The above exception was the direct cause of the following exception:

Traceback (most recent call last):

File "videotrans\task\job.py", line 77, in run

File "videotrans\task\_speech2text.py", line 93, in recogn

File "videotrans\recognition\__init__.py", line 237, in run

File "videotrans\recognition\_base.py", line 67, in run

videotrans.configure._except.SpeechToTextError: generate(): incompatible function arguments. The following argument types are supported:

Invoked with: , 0.132202 -0.92041 0.912598 ... -0.10022 -0.580566 -0.477783

=====

Windows-10-10.0.26100-SP0

version:v3.78

frozen:True

language:zh

#350 [cuda:0 float16 storage viewed as 1x1500x1280], [[50258, 50266, 50359]]; kwargs: length_penalty=1, repetition_penalty=1,

Post Your Reply

Related resource