Iām currently making YouTube Shorts by manually scrolling in incognito until I find a video idea, then taking screenshots of each scene, using Google image search to find the original source video, downloading the original, transcribing it, and bringing everything into ChatGPT to rewrite the script with the same hook/tone/pacing before generating a new voiceover in ElevenLabs. This process is slow and repetitive, and I want a single tool/app that can automate most of it end-to-end (scene detection + first-frame extraction, source matching, transcript + script rewrite, and voiceover generation), ideally with a simple workflow where I upload a link or video and get a ready-to-use rewritten script and VO assets back.
No solutions yet. Be the first to share yours!
Developer role required to post solutions.
No comments yet
Be the first to share your thoughts!