We're looking at creating about 400 3-5 minute videos this year. Our current process is to record ourselves talking over a PowerPoint for the lesson videos.
Instead, I'm thinking of using WellSaidLabs text-to-voice or Descript Overdub with a trained voice that can speak the scripts to an audio file. Then drop the correlating audio file onto each slide and export it to video. Done.
Modifying the lesson in the future will be as easy as editing the PowerPoint or dropping in a new audio file.
There's some resistance to using an AI voice. Do any of you have experience with AI text-to-speech that sounds realistic? What about using Descript Overdub? Videos I've seen where people have trained it with their voice sound impressive.
I want to focus on writing good content and automating the production part as much as possible. Do you have any tips or ideas to help make this happen?