Ronny Votel and Na Li of Google Research announced on the Tensorflow blog the release of a new posture detection model called MoveNet. The model is compatible with TensorFlow.js , the Javascript port of Tensorflow. The platform is capable of analyzing an image to provide accurate data on most of the poses a person can make. This information can be used in a medical, sports or simply to improve his daily comfort.
Eventually, the tool could be used by hospitals, the military or insurance companies to enable people who need special care to perform it correctly and potentially remotely. Ryan Eder, founder and CEO of IncludeHealth refers to this possibility in his words:
Each of the heads is at the origin of a step in the sequence of operations that allows the model to function and thus, to define all 17 key points of a posture:
An application to provide remote care
MoveNet was created in collaboration with IncludeHealth, a company specializing in digital health. The app offers the possibility for a user to analyze the positioning of 17 key points of the human body. Thanks to this, it could be possible to offer remote care for patients who cannot go directly to a physiotherapist for example. The application guides patients through a series of movements to be carried out almost daily called "routines". They are digitally developed and prescribed by physiotherapists to test a person's balance as well as the strength and amplitude of the movements they perform. At the same time, MoveNet analyzes all the key points to check if the patient is performing the movement correctly.
Eventually, the tool could be used by hospitals, the military or insurance companies to enable people who need special care to perform it correctly and potentially remotely. Ryan Eder, founder and CEO of IncludeHealth refers to this possibility in his words:
"The MoveNet model combines speed with accuracy, both of which are necessary to provide prescriptive care. While other models are either fast or accurate, MoveNethas that unique balance that will enable the next generation of care delivery. The Google team has been a fantastic collaborator in this quest."
The architecture and operation of MoveNet
The model, offered on TensorFlow Hub, comes in two variants depending on use cases:- Lightning, for latency critical applications.
- Thunder, for applications requiring greater accuracy.
Each of the heads is at the origin of a step in the sequence of operations that allows the model to function and thus, to define all 17 key points of a posture:
- Global heat map: prediction of a person's center of gravity. This data is then taken into account by the other predictive heads.
- Key points regression field: Based on the center of gravity, the model predicts all the key points of a person according to his position at time T.
- Heat map of key points: Using the regression field, the system then predicts the location of all the key points of a person, taking into account only the person in the foreground.
- Two-dimensional shift field: The final set of key points is selected by taking into account local 2D shift predictions to refine the final result.