W A Flask-based web application that lets users upload a video file, automatically extracts the audio, transcribes speech into text using Facebook’s Wav2Vec2 ASR model, and summarizes it with the BART text summarization model. The app displays both the complete transcript and a concise summary in the browser, providing an end-to-end video-to-text processing solution.
A simple Flask application that allows users to upload a video file and:
moviepy
facebook/wav2vec2-base-960h
(ASR model)facebook/bart-large-cnn
(text summarization model)Both the full transcript and the summary are displayed in the browser after processing.
.mp4
, .mov
, .avi
, etc.)librosa
git clone https://github.com/sujit-codezen/video_summary.git cd video_summary
python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate pip install -r requirements.txt
pip install -r requirements.txt
python app.py
Task | Model | Source |
---|---|---|
Transcription | facebook/wav2vec2-base-960h |
Hugging Face 🤗 |
Summarization | facebook/bart-large-cnn |
Hugging Face 🤗 |