Efficiency in Real-time Webcam Gaze Tracking

Conference Paper (2020)
Author(s)

Amogh Gudi (Vicarious Perception Technologies, TU Delft - Electrical Engineering, Mathematics and Computer Science)

Xin li (Vicarious Perception Technologies, Student TU Delft)

Jan van Gemert (TU Delft - Electrical Engineering, Mathematics and Computer Science)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1007/978-3-030-66415-2_34 Final published version
More Info
expand_more
Publication Year
2020
Language
English
Related content
Research Group
Pattern Recognition and Bioinformatics
Pages (from-to)
529 - 543
Publisher
Springer
ISBN (print)
978-3-030-66414-5
ISBN (electronic)
978-3-030-66415-2
Event
European Conference on Computer Vision (ECCV) 2020 Workshop on Eye Gaze in AR, VR, and in the Wild (OpenEyes) (2020-08-23 - 2020-08-28), Glassgow, United Kingdom
Downloads counter
174
Collections
Institutional Repository
Reuse Rights

Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the author(s) and/or copyright holder(s), unless the work is under an open content license such as Creative Commons.

Abstract

Efficiency and ease of use are essential for practical applications of camera based eye/gaze-tracking. Gaze tracking involves estimating where a person is looking on a screen based on face images from a computer-facing camera. In this paper we investigate two complementary forms of efficiency in gaze tracking: 1. The computational efficiency of the system which is dominated by the inference speed of a CNN predicting gaze-vectors; 2. The usability efficiency which is determined by the tediousness of the mandatory calibration of the gaze-vector to a computer screen. To do so, we evaluate the computational speed/accuracy trade-off for the CNN and the calibration effort/accuracy trade-off for screen calibration. For the CNN, we evaluate the full face, two-eyes, and single eye input. For screen calibration, we measure the number of calibration points needed and evaluate three types of calibration: 1. pure geometry, 2. pure machine learning, and 3. hybrid geometric regression. Results suggest that a single eye input and geometric regression calibration achieve the best trade-off.

Files

2009.01270_1.pdf
(pdf | 1.16 Mb)
- Embargo expired in 10-01-2022
License info not available