Automated Shorts Generator ▶️

Tested And Build on Legion 5 15ACH6

Nvidia RTX 3050 4GB VRAM GPU
16GB RAM
AMD Ryzen 7 5800H CPU
Windows 11

This is still under development and take time to build it properly.

🔴 You can watch demo here: Automated Short Generator | Development Version

Until then if you want to test it, then follow this steps:

conda create -n automated-short-generator python==3.11
conda activate automated-short-generator
pip install torch==2.5.0 torchvision==0.20.0 torchaudio==2.5.0 --index-url https://download.pytorch.org/whl/cu118
pip install -e .
pip install -r requirements.txt
python main.py

This will open PyQt6 UI where we can do our work.

From here generate your google gemini api key: Google AI Studio

Flow 🔄️

Used gemini model to create the short text of what we want to talk.
Then used F5-TTS which will take those text and user has to just provide one 15 second reference audio (either english or chinese) then F5-TTS will convert the text into the human like audio based on the reference audio provided.
And you will be able to download the audio
Added option to merge this audio with the video and even provide hard subtitles for the video
Using whisper large-v2 model for generating subtitles
Using FFmpeg I am merging the generated audio and the subtitles.srt file created by whisper in one single video.

To-Do List 🎯

Prompt Improving, to generate response text only for the short form video content.
UI Improvement
Provide both option
- As go everything in one flow
- Choose any one tool to use
Proper Subtitles management with good font and proper size
Proper length synchronization between audio and the input video

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
ckpts		ckpts
data		data
src		src
tests		tests
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
config.py		config.py
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
ruff.toml		ruff.toml
sample_audio.mp3		sample_audio.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automated Shorts Generator ▶️

Tested And Build on Legion 5 15ACH6

Flow 🔄️

To-Do List 🎯

About

Releases

Packages

Languages

RushabhShahPrograms/automated-short-generator

Folders and files

Latest commit

History

Repository files navigation

Automated Shorts Generator ▶️

Tested And Build on Legion 5 15ACH6

Flow 🔄️

To-Do List 🎯

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages