Sign in

Class-attention Video Transformer for Engagement Intensity Prediction

By Xusheng Ai and others
In order to deal with variant-length long videos, prior works extract multi-modal features and fuse them to predict students' engagement intensity. In this paper, we present a new end-to-end method Class Attention in Video Transformer (CavT), which involves a single vector to process class embedding and to uniformly perform end-to-end... Show more
November 10, 2022
=
0
Loading PDF…
Loading full text...
Similar articles
Loading recommendations...
=
0
x1
Class-attention Video Transformer for Engagement Intensity Prediction
Click on play to start listening