OpenAI API and Whisper based Video Translation
AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.
Scribe is a Python script that transcribes audio and video files using OpenAI Whisper and exports the transcriptions as PDF documents, enhanced by the gpt-3.5-turbo model.
#大语言模型#An end-to-end project: YouTube Video to Notes Transcription application using Google Gemini.
AI-powered tool to turn long videos into short, viral-ready clips. Combines transcription, speaker diarization, scene detection & 9:16 resizing — perfect for creators & smart automation.
Python package to scrape webpages and transcribe video content from a video sharing platform.
Convert YouTube playlists & videos to clean Markdown. Features multiple transcript sources (official, Whisper, yt-dlp), batch processing, smart formatting, and progress tracking. Perfect for research ...
#计算机科学#AI-powered transcription for audio & video with Whisper — self-hosted, fast, and open-source.
A subtitle generator for videos up to 10GB, automatically transcribing and translating spoken content into Brazilian Portuguese. Ideal for multilingual content, this tool creates accurate `.srt` files...
Generate High Quality Midjourney Prompts and Transcripts from Videos Using Python and Cohere.ai
Whisper ASR Transcription Project
A Multimodal Attention-Based Deep Learning Framework For Real-Time Activity Recognition At The Edge
Offline audio/video transcriber using Whisper, saving to .txt or .srt. Ensures privacy, no external servers used.
Automatically transcribe video recordings into plain text.
📼 A streamlit web interface designed to extract words from video/audio files into text • Python, FFmpeg, Whisper, YT-DLP
#自然语言处理#Self-hosted AI-powered transcription platform with speaker diarization, search, and collaboration features. Built with Svelte, FastAPI, and Docker for easy deployment.
YouTube Video Summarizer: a flask based user interface which will make a request to a backend REST API where it will perform NLP and respond with a summarized version of a YouTube transcript.
Speech-To-Text (STT) project