AxGazeEstimation : A Machine Learning Model for Estimating Gaze

David Cochard
axinc-ai
Published in
2 min readJun 29, 2021

--

This is an introduction to「AxGazeEstimation」, a machine learning model that can be used with ailia SDK. You can easily use this model to create AI applications using ailia SDK as well as many other ready-to-use ailia MODELS.

Overview

AxGazeEstimation is a machine learning model developed by ax Inc. to detect the direction of gaze of a person from an input image.

Source: https://pixabay.com/ja/photos/%E3%83%93%E3%83%B3%E3%83%86%E3%83%BC%E3%82%B8-%E5%A5%B3%E6%80%A7-%E5%B8%BD%E5%AD%90-635244/

Architecture

AxGazeEstimation uses BlazeFace to detect faces in an image and estimates the gaze using the detected face as input. Two methods of gaze estimation are available: direct estimation from the face image, and estimation from face image combined with face orientation.

The network backbone uses a reduced version of ResNet50 (stage 3).

The training was performed using our in-house dataset made of 97,059 training images, and 11,775 validation images.

Usage

Use the following command to run the gaze estimation on the webcam video stream.

$ python3 ax_gaze_estimation.py -v 0

The following command can be used to estimate the face orientation in combination with the face detection.

$ python3 ax_gaze_estimation.py -v 0 --include-head-pose

Here is an example of AxGazeEstimation in action.

Related topics

ax Inc. has developed ailia SDK, which enables cross-platform, GPU-based rapid inference.

ax Inc. provides a wide range of services from consulting and model creation, to the development of AI-based applications and SDKs. Feel free to contact us for any inquiry.

--

--