Create Audio Instructions Based on Video?

Anudha Mittal
Sep 22, 2023

--

Can Gen-AI create voice instructions on how to do something?

It’s easier to record a video of a task: change a flat tire, juggle, demo yoga — without also speaking.

Mediapipe can detect the movement someone is carrying out. Can another model use this inference to generate audio instructions on movement?

Link to videos with movement detection:

Original video: https://www.youtube.com/shorts/DnDH4OshzB4

Another example with a juggling video:

Original video: https://www.youtube.com/shorts/czM2Tib4QAU

Code: https://github.com/Anudha/Yoga/blob/master/MediaPipe_ForYogaPose_GenAI_Public.ipynb

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

--

--

No responses yet

Write a response