Showing results for Video combiner
GitHub Repo
https://github.com/mbzuai-oryx/Video-ChatGPT
mbzuai-oryx/Video-ChatGPT
[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
GitHub Repo
https://github.com/BB31420/AI-Auto-Video-Generator
BB31420/AI-Auto-Video-Generator
An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
GitHub Repo
https://github.com/nnx0r/vidage
nnx0r/vidage
Your solution to full-screen background video & image combined.
GitHub Repo
https://github.com/amosyuen/FFmpegVideoRecorder
amosyuen/FFmpegVideoRecorder
Customizable Android video recorder library that can combine multiple videos
GitHub Repo
https://github.com/KranX/Vangers
KranX/Vangers
The video game that combines elements of the racing and role-playing genres.
GitHub Repo
https://github.com/jeremy-friesen/flutter-video-chat
jeremy-friesen/flutter-video-chat
flutter-video-chat is a simple front-end combining text-messaging (using Google Firebase) and video chat (using Agora.io for Flutter).
GitHub Repo
https://github.com/davide-coccomini/Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection
davide-coccomini/Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection
Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.
GitHub Repo
https://github.com/wjy5446/Real-time-video-stitching
wjy5446/Real-time-video-stitching
:telescope: This is a framework that combines multiple frames acquired from moving cameras
GitHub Repo
https://github.com/win4r/VideoFinder-Llama3.2-vision-Ollama
win4r/VideoFinder-Llama3.2-vision-Ollama
VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
GitHub Repo
https://github.com/HA6Bots/TikTok-Compilation-Video-Generator