cns00
15th February 2025, 19:27
I have some questions about whisper ai.
I am following https://www.youtube.com/watch?v=ABFqbY_rmEk to install whisper ai. I have Windows 11 on my laptop.
I did the 5 steps in the video. Assume that the audio file is test.wav. I said:
whisper test.wav
It is downloading large-v3-turbo.pt since this is the first time that I ran the command. It is 1.5gb. The problem is that my internet is horrible. The download is failing. It reached 18% and then the download fails and I get an error message that says:
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.
I tried it again. It reached 34% and failed. I tried it again. It reached 55% and failed. I even tested it by using a VPN and still no luck. In the end the download got stuck on say 34% and it's not proceeding. The download got stuck 3 times.
I saw https://github.com/openai/whisper/discussions/1027. They said that the issue was fixed if you deleted:
"C:\Users\.cache\torch\whisperx-vad-segmentation.bin"
"C:\Users\USER-NAME.cache\torch\whisperx-vad-segmentation.bak"
I don't have those files. What do I do? How do I download large-v3-turbo.pt?
I have subtitle files for the non English parts of my movies in which lines show before\after a person starts speaking and they go away before\after a person stops speaking and the delays are noticeable.
I want to make the lines show exactly when a person starts speaking and they go away exactly when a person stops speaking. Can whisper do that?
Assume that I have an English movie that has French dialogue and I have a subtitle file for the French part. ChatGPT said that I can use whisper to generate a subtitle file for the movie and then the timing in that subtitle file will be used to fix the timing of my subtitle file and the command is:
python sync_subtitles.py my_movie.mp4 my_subtitles.srt
https://drive.google.com/file/d/1ShP5MmmOrr1CCuHPkPHhZgRKOVBzWyMt/view?usp=sharing is sync_subtitles.py that ChatGPT made.
After that it said this:
https://imgur.com/a/nCcZBm6
I said yes and it made for me a modified file which is modified_sync_subtitles.py and it is https://drive.google.com/file/d/1bvLnCeG9WUfXOydFCQWiUamj_-aS7sA_/view?usp=sharing. The modified script does this:
https://imgur.com/a/k0ql3OM
I want to ask is the coding in both scripts correct? Which script should I use?
I am following https://www.youtube.com/watch?v=ABFqbY_rmEk to install whisper ai. I have Windows 11 on my laptop.
I did the 5 steps in the video. Assume that the audio file is test.wav. I said:
whisper test.wav
It is downloading large-v3-turbo.pt since this is the first time that I ran the command. It is 1.5gb. The problem is that my internet is horrible. The download is failing. It reached 18% and then the download fails and I get an error message that says:
RuntimeError: Model has been downloaded but the SHA256 checksum does not not match. Please retry loading the model.
I tried it again. It reached 34% and failed. I tried it again. It reached 55% and failed. I even tested it by using a VPN and still no luck. In the end the download got stuck on say 34% and it's not proceeding. The download got stuck 3 times.
I saw https://github.com/openai/whisper/discussions/1027. They said that the issue was fixed if you deleted:
"C:\Users\.cache\torch\whisperx-vad-segmentation.bin"
"C:\Users\USER-NAME.cache\torch\whisperx-vad-segmentation.bak"
I don't have those files. What do I do? How do I download large-v3-turbo.pt?
I have subtitle files for the non English parts of my movies in which lines show before\after a person starts speaking and they go away before\after a person stops speaking and the delays are noticeable.
I want to make the lines show exactly when a person starts speaking and they go away exactly when a person stops speaking. Can whisper do that?
Assume that I have an English movie that has French dialogue and I have a subtitle file for the French part. ChatGPT said that I can use whisper to generate a subtitle file for the movie and then the timing in that subtitle file will be used to fix the timing of my subtitle file and the command is:
python sync_subtitles.py my_movie.mp4 my_subtitles.srt
https://drive.google.com/file/d/1ShP5MmmOrr1CCuHPkPHhZgRKOVBzWyMt/view?usp=sharing is sync_subtitles.py that ChatGPT made.
After that it said this:
https://imgur.com/a/nCcZBm6
I said yes and it made for me a modified file which is modified_sync_subtitles.py and it is https://drive.google.com/file/d/1bvLnCeG9WUfXOydFCQWiUamj_-aS7sA_/view?usp=sharing. The modified script does this:
https://imgur.com/a/k0ql3OM
I want to ask is the coding in both scripts correct? Which script should I use?