RedditVideoMakerBot

Commit Graph

Author	SHA1	Message	Date
Abdessamad Haddouche	076b65f04c	feat: pro caption system with WhisperX word-level alignment Core changes: - utils/caption_renderer.py: new single-responsibility rendering engine - Three display modes: aligned, single, multi - 8-direction stroke technique for clean text outlines - Transparent PNG overlays (no more solid box) - utils/whisper_aligner.py: WhisperX forced alignment module - Word-level timestamps from any TTS audio - Graceful fallback to single mode if unavailable - utils/imagenarator.py: refactored as thin orchestrator - Delegates to caption_renderer - Saves timing_map.json for final_video sync - utils/sentiment_map.py: added STYLE_MAP with display_mode per sentiment - utils/sentiment.py: stores sentiment in settings for downstream use - TTS/engine_wrapper.py: runs WhisperX after each TTS save - video_creation/final_video.py: reads timing_map, handles absolute + fraction timing - video_creation/screenshot_downloader.py: clean imagemaker call Assets: - fonts/: added Montserrat, Nunito, Oswald, Raleway, Lato, Anton font families Dependencies: - requirements.txt: updated with all current dependencies	2 months ago
Abdessamad Haddouche	af0940045c	feat: sentiment-aware video pipeline with DeepSeek, metadata generation, and per-video folder structure SENTIMENT DETECTION (utils/sentiment.py) - Integrate DeepSeek API using OpenAI-compatible SDK to classify each Reddit post - Detect sentiment from post title + body (first 500 chars) into 8 labels: sad, happy, angry, mysterious, funny, dramatic, wholesome, scary - Override in-memory config per post (background_video, background_audio, voice) - Falls back to 'dramatic' label if DeepSeek API fails or is unavailable - Can be enabled/disabled via config.toml [deepseek] enabled = true/false SENTIMENT MAPS (utils/sentiment_map.py) - BACKGROUND_MAP: maps each sentiment to optimal background video + audio pair - OPENAI_VOICE_MAP: maps each sentiment to best-fit OpenAI TTS voice - ELEVENLABS_VOICE_MAP: maps each sentiment to best-fit ElevenLabs voice (fully mapped to real voices: Adam, George, Harry, Callum, Jessica, Brian, Laura, Matilda) - All overrides are in-memory only — config.toml is never modified METADATA GENERATION (utils/sentiment.py) - Single DeepSeek API call generates both sentiment + social media metadata - Generates per-platform content: * YouTube: title (max 70 chars) + full description * TikTok: caption (max 150 chars) with hashtags * Instagram: caption with hashtags * Facebook: caption * Hashtags: list of relevant tags - Falls back to basic title-based metadata if DeepSeek fails - Saves metadata.json inside each video's output folder RESULTS FOLDER RESTRUCTURE (video_creation/final_video.py) - Changed output structure from results/{subreddit}/{filename}.mp4 - New structure: results/{actual_subreddit}/{thread_id}_{sentiment}/video.mp4 - Each video now has its own isolated folder containing: * video.mp4 * metadata.json * thumbnail.png (if thumbnail generation is enabled) * OnlyTTS/video.mp4 (if enable_extra_audio is enabled) SUBREDDIT TRACKING (reddit/subreddit.py) - Added thread_subreddit field to reddit_object using submission.subreddit.display_name - Posts from r/AmItheAsshole now save to results/AmItheAsshole/ - Posts from r/tifu now save to results/tifu/ - Posts from r/confession now save to results/confession/ - Previously all posts were grouped under the combined subreddit string PIPELINE INTEGRATION (main.py) - Added apply_sentiment_config() call between post fetching and video generation - Sentiment detection runs before TTS and background selection - Controlled by settings.config['deepseek']['enabled'] flag CONFIG CHANGES (config.toml + utils/.config.template.toml) - Added [deepseek] section with api_key and enabled fields - elevenlabs_voice_name changed from optional=false to optional=true - Prevents prompt appearing when ElevenLabs is not the selected TTS provider	2 months ago
Abdessamad Haddouche	7c679b8136	fix: update ElevenLabs integration and config - Fix ElevenLabs voice lookup by name using voice_id instead of name string - Update model from eleven_multilingual_v1 to eleven_multilingual_v2 (free tier) - Remove hardcoded voice options restriction in config template - Update default voice to Sarah - Enable ffmpeg verbose output for better error debugging	2 months ago
cyteon	d531c34b53	blocked words	4 months ago
Jason Cameron	902ff00cb0	chore: release 3.4.0 (#2426 ) Co-authored-by: Jason Cameron <git@jasoncameron.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: github-actions <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: RequieMa <57754488+RequieMa@users.noreply.github.com> Co-authored-by: prafful <praffulsharma1230@gmail.com> Co-authored-by: M. Sanaullah <thebolly@gmail.com> Co-authored-by: embee <emilien.bev.com@gmail.com> Co-authored-by: Rodrigo <55567123+rodrigodasilv@users.noreply.github.com> Co-authored-by: Emilien Bevierre <44171454+emilienbev@users.noreply.github.com> Co-authored-by: tkhmielnitzky <tkhmielnitzky@gmail.com> Co-authored-by: bnfone <89687390+bnfone@users.noreply.github.com> Co-authored-by: Cyteon <129582290+Cyteon@users.noreply.github.com>	9 months ago
github-actions	53db79ab29	fixup: Format Python code with Black	2 years ago
Jason	9e60d83580	Fmt using black & isort	2 years ago
Jason	a4f0022a5a	Fix: AttributeError: 'FreeTypeFont' object has no attribute 'getsize'	2 years ago
cyteon	0522d195da	some fixes	2 years ago
Kristian	903081fca3	Revert "meme support" This reverts commit `b508f2af73`.	2 years ago
Kristian	d42ff50a35	meme support	2 years ago
github-actions	bbf5a8265d	fixup: Format Python code with Black	2 years ago
Jason	35fac14447	fix: fixed the GUI chore: reformatted and optimized imports. Co-authored-by: Jan Tumpa <jtumpa@gmail.com>	2 years ago
Jo	1b146ab1f1	fix: Fixes #1812 , random_voice True/False acceptance	2 years ago
Jason Cameron	8859e01905	Fix random voice prompt closes #1999	2 years ago
github-actions	57e7b55fa7	fixup: Format Python code with Black	3 years ago
Simon	3ad42ba126	Refactor FFmpeg download and fix watermark bug	3 years ago
github-actions	7dd8b2a3e8	fixup: Format Python code with Black	3 years ago
Simon	a8046a8290	Version 3.2	3 years ago
Simon	280b125505	Fix PR	3 years ago
Syed Aman Raza	0a1ba17d0e	Merge branch 'develop' into dev	3 years ago
Simon	cab5359e08	Reformat	3 years ago
Simon	8e58cd67ec	Little fix	3 years ago
Simon	1dec673841	Update all the requirements, reformat a bit and fix the story mode character limit	3 years ago
Simon	59cbf94207	Merge pull request #1663 from liamb13/reddit-redesign Reddit redesign	3 years ago
Simon	53ab45bba9	Merge branch 'develop' into elevenlabs	3 years ago
Simon	f7bc316bfc	Merge pull request #1578 from liamb13/zoom Adds zoom function	3 years ago
electro199	65cc5a4074	added validater , auto spacy model dowloader	3 years ago
Simon	19b44b1302	Add random_voice	3 years ago
electro199	2ada32a84f	deleted unused	3 years ago
electro199	7ce04d7d7f	removed the transition	3 years ago
electro199	8e19b9fb9a	Merge branch 'develop' of https://github.com/elebumm/RedditVideoMakerBot into dev	3 years ago
electro199	10d002f4e0	removed opacity	3 years ago
Xpl0itU	47f762eb0d	Fix adding ffmpeg to path This fixes adding ffmpeg to the path by adding it to the user path instead of the system path, which doesn't require admin privileges to do	3 years ago
electro199	4a83cd0f6f	better err handle and run.bat	3 years ago
electro199	7486e04b26	Merge branch 'develop' of https://github.com/elebumm/RedditVideoMakerBot into dev	3 years ago
liamb13	c935d865ca	Create playwright.py util file	3 years ago
liamb13	9a90363f56	Update utils/.config.template.toml Co-authored-by: Simon <65854503+OpenSourceSimon@users.noreply.github.com>	3 years ago
liamb13	facae5efd5	Update utils/.config.template.toml Co-authored-by: Simon <65854503+OpenSourceSimon@users.noreply.github.com>	3 years ago
liamb	d19dfac8a3	adds elevenlabs	3 years ago
Lucas	7613ac59e7	added only no copyright songs	3 years ago
Lucas	778d9c0c37	changing allow_only_tts to enable_extra_audio	3 years ago
Lucas	e488ef6e0c	better explanation on toml and new default value	3 years ago
Lucas de Almeida	472abed599	Merge branch 'develop' into master	3 years ago
Lucas	d9ff36b034	Merge with develop branch and no pytube required	3 years ago
Lucas	1f48c53a74	new toml config and added background audio feature	3 years ago
Lucas	1abfc4b321	2 samples with backgroundMusic and without it	3 years ago
Lucas	5a103e76cd	rendering video and audio	3 years ago
Lucas	a21c17ef55	composing the video but with no audio	3 years ago
Lucas	ba5e8b8987	adaptations in filePath and toml for audio option	3 years ago

1 2 3 4 5

240 Commits (076b65f04c39efe3d2a6b90e6bed620b86381341)