GitHub - itsandyd/Transcribo: An ML based program which transcribes Online video lectures/meetings and adds highlights/topics generated and summaries of the video lecture.

Transcibo

An ML based program which transcribes Online video lectures/meetings and adds highlights/topics generated by the video lecture.

View Demo · Report Bug · Request Feature

Table of Contents

About The Project
- Built With
Getting Started
- Prerequisites
- Installation
Usage
Contributing
License
Contact
Acknowledgements

About The Project

[] []

Transcribo basically trascribes Online video lectures/meetings and adds highlights/topics related to content present in the source! This project is can be useful in many interests.

For eg:

If one needs keywords for their Yt video, they can use it for subtitles as well as for reach via topics.
Students can use the returned document as reference or side notes for studying.
It's tedious to just read a transcript of the words spoken, so it adds highlights to the content to be aware what part interests someone!

Of course, this project is at it's initial stage, but with more data and more model validation, it can turn out to be quite useful.

Built With

This project is entirely based on Python. The following packages and Models were used:

Scikit-Learn
MoviePy and Pydub
SpeechRegonition by Google
Latent Dirchlett Allocation
Numpy
Gensim
TfidVectorizer
NTLK

Getting Started

Prerequisites

I assume you are using either anaconda or Google colab for running the python notebooks. *

pip install -r requirements.txt

or Download the environment file for Anaconda prompt

https://drive.google.com/file/d/1mYBmfRa5E3BshmDhCVgsDhNcqoQOWqqr/view?usp=sharing

Installation

Just add your video title and files_path variable, and you are good to go!

System Implementation

Firstly the project is based on Flask, so all the data dealing is done by the framework smoothly. The first real task of the program is to convert the video file provided to an .wav audio file for transcription. We did this using a package called moviepy. Then the output audio clip was gone through Google’s Speech recognition API in a loop to cover the whole audio. The transcription from the API was saved in a local .txt file for better use. We tried to append the whole transcription to a single string, but we got some runtime issues (Basically out of RAM). Though Google API is one of the best speech Recognition API out there, it doesn’t punctuate the text in the output. Thus, we came across a pre-trained punctuation model based on theano and a package called “punctuator”. We punctuated our text with the model itself. The summarization process: For getting the summary out of the text, we went through several options, and chose Scikit learn’s tfidfvectorizer. The process seems to be complicated but was quite easy. First, dividing the text with sentences with the help of punctuations, then tokenize each word with the help of nltk’s word_tokenise. Then getting the average of frequency of words and removing stopwords/regular words from the text. Then calculating the importance or Accuracy of the words in the sentence. Then with that accuracy, calculate which sentences are more valuable. Lastly, adding the sentences with the highest threshold to the cut of final summary. And all the data we obtained from the following system is given out as html with the help of Flask!

Applications

This project has wide range of applications, some of which are listed below:-

For School/College Students, to get notes, if they missed any class by chance.
For School/College Students, to get notes with highlighted keywords, to revise the class during examinations.
For Business professionals to get the minutes of meeting.
For all the online events and workshops, which happens online, and after that report is to be prepared.
To know the name of medicines and brief of call, when the doctor consults the patients online using video/audio call.
To store the financial based startup’s (like policy bazaar) call data in transcripted files, for future evidence and reference.
For all Specially abled who can’t hear the audio/video can see the transcription.

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

Fork the Project
Create your Feature Branch (git checkout -b feature/AmazingFeature)
Commit your Changes (git commit -m 'Add some AmazingFeature')
Push to the Branch (git push origin feature/AmazingFeature)
Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

Contact

Team - @just_a_folk - @MeeraliN-

Acknowledgements

[GitHub Emoji Cheat Sheet]

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Images		Images
flask app		flask app
LICENSE		LICENSE
Project report.pdf		Project report.pdf
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Transcibo

About The Project

Built With

Getting Started

Prerequisites

Installation

System Implementation

Applications

Contributing

License

Contact

Acknowledgements

About

Releases

Packages

Languages

License

itsandyd/Transcribo

Folders and files

Latest commit

History

Repository files navigation

Transcibo

About The Project

Built With

Getting Started

Prerequisites

Installation

System Implementation

Applications

Contributing

License

Contact

Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages