GESCAM : A Dataset and Method on Gaze Estimation for Classroom Attention Measurement

Abstract

Human gaze provides crucial insights into individual attention during social or educational interactions. Attention systems often rely on head and facial features to predict gaze direction, but reliable gaze target detection (GTD) requires rich contextual cues. These cues inform the system about an individual's position within a scene and the surrounding objects they might be interacting with. Our paper proposes attention measurement using GTD in educational classrooms, leveraging a synthetic dataset called GESCAM (Gaze Estimation based Synthetic Classroom Attention Measurement). This dataset was meticulously generated using 3D modelling, animation, simulation, and rendering techniques comprising 60,000 images with 650,000 instances of individuals (students, teachers) engaged in various activities, including looking at blackboard, notebooks, mobile phones etc. Our novel network trained on GESCAM proficiently identifies gaze fixations within complex classroom scenes, offering insights into human attention in classrooms across diverse contexts.

Dataset

Get a Sample of the GESCAM Dataset

You can download a subset of the annotated GESCAM dataset here.

Looking for the Full Dataset?

For access to the complete GESCAM dataset, please contact us at:

BibTeX

If you find this work useful, please cite:

@InProceedings{Mathew_2024_CVPR,
    author    = {Mathew, Athul M. and Khan, Arshad Ali and Khalid, Thariq and Souissi, Riad},
    title     = {GESCAM : A Dataset and Method on Gaze Estimation for Classroom Attention Measurement},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2024},
    pages     = {636-645}
}

GESCAM : A Dataset and Method on Gaze Estimation for Classroom Attention Measurement

Abstract

RGB image of classroom

Depth map for each corresponding RGB image

Annotation of bounding boxes and labels for people and various objects within the scene

Masks highlighting individual objects for precise identification

Gaze lines indicating points of interests within the scene

Dataset

Get a Sample of the GESCAM Dataset

Looking for the Full Dataset?

BibTeX