Name: ml5.js: Pose Regression with PoseNet and ml5.neuralNetwork()
Uploaded: 2020-04-06T08:20:33.000Z
Duration: 18 min 25 s
Description: Thousands of YouTube videos with English-Chinese subtitles! Now you can learn to understand native speakers, expand your vocabulary, and improve your pronunciation...

Hello and welcome to another beginner's guide

to machine learning with ML5.js video on pose estimation

So this is the third, the last one that I'll do in this series

First I looked at just what posenet is and how it works

and how you can get the key points of a human skeleton.

Then I took the output of the posenet model, all

those key points, and fed them into another neural network

to do pose classification, to recognize different poses

I will do exactly what I did in the previous video

So the final output instead of being a classifier,

am I making a Y, M, C, or A pose, I will make a regression.

So to review, the setup I have is as follows.

It sends that image into the pre-trained posenet

That model performs pose estimation and gives as its

Wrist, elbow, shoulder, shoulder, elbow, wrist,

And then I take all of those and feed them

into another neural network, an ML5 neural network, which

then classifies those key points as Y, M, C, or A.

So that's the process that I've built in the first two videos.

I want the final output to no longer be categorical.

So you could think of it as the final output

And that slider is going to have some sort of range.

So what I did previously in other examples of regression

I used a neural network to output a frequency value

And I could actually have something that output like

Make a gesture or posed based musical instrument.

And this comes from a project that I referenced inspired

by a viewer, Darshawn, who made a project that does an output.

Because specifically what I want to demonstrate

here is that the regression output doesn't

In this case, I want to have three values.

And I'm going to think of those values as an R for red,

So I can say things like, and the training can be,

And then this pose is this other particular color.

between those colors by trying to guess the value according

Now I'm ready to start implementing this in code.

So I'm not going to write everything again.

I'm going to start from the pose classifier.

is adjust the configuration of the neural network.

The differences instead of four categorical outputs, Y, M, C,

or A, I just need three continuous outputs.

So I could actually just change this number to three.

Because it's still a number of outputs but the task

is think about during the training process,

how am I going to create these target values?

So maybe this color scenario isn't the best one.

the best way would be for me to make these literal sliders.

And then when I actually deploy the model,

the model will control the sliders themselves

I don't have a target label, there's no categorical output.

So let's comment this out and say, three sliders four red,

They're all going to have a range between 0 and 255