SAILS Lunch Time Seminar: Hazel Doughty

Monday 8 January 2024
Online only

Towards an Automatic and Detailed Understanding of Videos

Thanks to revolutions in supervised learning and the emergence of large, labelled video datasets, tremendous progress has been made in video understanding to automatically understand what is happening. However, current algorithms cannot understand more detail such as identifying how actions happen, which is key to achieving the desired outcome. For instance, existing models could recognize someone performing CPR, but fail to identify it needs to be done faster, firmer and further up the body to have a chance of resuscitation. This talk will cover what is currently possible in video understanding as well as several recent works aiming more a more detailed understanding by (1) skill assessment, (2) identifying relevant adverbs and (3) reducing the supervision needed to distinguish subtly different actions.

