Moozonian
Web Images Developer News Books Maps Shopping Moo-AI Generate Art
Showing results for Video combiner
GitHub Repo https://github.com/mbzuai-oryx/Video-ChatGPT

mbzuai-oryx/Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.
GitHub Repo https://github.com/BB31420/AI-Auto-Video-Generator

BB31420/AI-Auto-Video-Generator

An AI-powered storytelling video generator that takes user input as a story prompt, generates a story using OpenAI's GPT-3, creates images using OpenAI's DALL-E, adds voiceover using ElevenLabs API, and combines the elements into a video.
GitHub Repo https://github.com/nnx0r/vidage

nnx0r/vidage

Your solution to full-screen background video & image combined.
GitHub Repo https://github.com/amosyuen/FFmpegVideoRecorder

amosyuen/FFmpegVideoRecorder

Customizable Android video recorder library that can combine multiple videos
GitHub Repo https://github.com/KranX/Vangers

KranX/Vangers

The video game that combines elements of the racing and role-playing genres.
GitHub Repo https://github.com/jeremy-friesen/flutter-video-chat

jeremy-friesen/flutter-video-chat

flutter-video-chat is a simple front-end combining text-messaging (using Google Firebase) and video chat (using Agora.io for Flutter).
GitHub Repo https://github.com/davide-coccomini/Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection

davide-coccomini/Combining-EfficientNet-and-Vision-Transformers-for-Video-Deepfake-Detection

Code for Video Deepfake Detection model from "Combining EfficientNet and Vision Transformers for Video Deepfake Detection" presented at ICIAP 2021.
GitHub Repo https://github.com/wjy5446/Real-time-video-stitching

wjy5446/Real-time-video-stitching

:telescope: This is a framework that combines multiple frames acquired from moving cameras
GitHub Repo https://github.com/win4r/VideoFinder-Llama3.2-vision-Ollama

win4r/VideoFinder-Llama3.2-vision-Ollama

VideoFinder is an advanced video analysis tool powered by multimodal AI, designed to help users easily locate and identify specific objects or people within video content. By combining the capabilities of Llama Vision model with a streamlined web interface, it enables real-time, frame-by-frame video analysis with natural language descriptions.
GitHub Repo https://github.com/HA6Bots/TikTok-Compilation-Video-Generator

HA6Bots/TikTok-Compilation-Video-Generator

A system of bots that collects clips automatically via custom made filters, lets you easily browse these clips, and puts them together into a compilation video ready to be uploaded straight to any social media platform. Full VPS support is provided, along with an accounts system so multiple users can use the bot at once. This bot is split up into three separate programs. The server. The client. The video generator. These programs perform different functions that when combined creates a very powerful system for auto generating compilation videos.