Create Audio Instructions Based on Video?

Sep 22, 2023

--

Can Gen-AI create voice instructions on how to do something?

It’s easier to record a video of a task: change a flat tire, juggle, demo yoga — without also speaking.

Mediapipe can detect the movement someone is carrying out. Can another model use this inference to generate audio instructions on movement?

Link to videos with movement detection:

Audition 6

Mediapipe pose estimation Movement detection Create audio instruction Parsva Kakasana Yoga

youtube.com

Original video: https://www.youtube.com/shorts/DnDH4OshzB4

Another example with a juggling video:

Juggling Movement Detection

Mediapipe Juggling Pose EstimationPythonArtificial Intelligence

youtube.com

Original video: https://www.youtube.com/shorts/czM2Tib4QAU

Code: https://github.com/Anudha/Yoga/blob/master/MediaPipe_ForYogaPose_GenAI_Public.ipynb

Sign up to discover human stories that deepen your understanding of the world.

Free

Distraction-free reading. No ads.

Organize your knowledge with lists and highlights.

Tell your story. Find your audience.

Membership

Read member-only stories

Support writers you read most

Earn money for your writing

Listen to audio narrations

Read offline with the Medium app

Pose Estimation

Written by Anudha Mittal

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams