Universiteit Leiden

nl en

Data Science Research Programme

Webinars

On this page you will find a collection of presentations and videos of the Florence Nightingale Colloquia, seminars at the faculty and other event recordings hosted by the Data Science Research Programme.

26-11-21 Peter Flach: The highs and lows of performance evaluation: Towards a measurement theory for machine learning

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

The highs and lows of performance evaluation: Towards a measurement theory for machine learning

Abstract:
Our understanding of performance evaluation measures for machine-learned classifiers has improved considerably over the last decades. However, there is a range of areas where this understanding is still lacking, leading to ill-advised practices in classifier evaluation. This is clearly problematic, since if machine learning researchers are unclear about what exactly their experiments are telling them about their machine learning algorithms, then how can end-users trust systems deploying those algorithms?  

 

I suggest that in order to make further progress we need to develop a proper measurement theory of machine learning. Measurement theory studies the concepts of measurement and scale. If one has a way to measure, say, the length of individual rods or planks, this should also allow one to then calculate the combined length of concatenated rods or planks. What relevant concatenation operations are there in data science and AI, and what does that mean for the underlying measurement scale?

 

I discuss by example what such a measurement theory might look like and what kinds of new results it would entail. I furthermore argue that key properties such as classification ability and data set difficulty are unlikely to be directly observable, suggesting the need for latent-variable models. Ultimately, machine learning experiments need to go beyond simple correlations and aim to make causal inferences of the form 'Algorithm A outperformed algorithm B because the classes were highly imbalanced', or counterfactually, 'if the classes were re-balanced, this performance difference between A and B would not have been observed'. 

 

Short CV:
Peter Flach has been Professor of Artificial Intelligence at the University of Bristol since 2003. An internationally leading scholar in the evaluation and improvement of machine learning models using ROC analysis and calibration, he has also published on mining highly structured data, and has an interest in human-centred AI. He is author of Simply Logical: Intelligent Reasoning by Example (John Wiley, 1994) and Machine Learning: the Art and Science of Algorithms that Make Sense of Data (Cambridge University Press, 2012).

 

Prof Flach stepped down last year as the Editor-in-Chief of the Machine Learning journal, after being in post for 10 years. He was Programme Co-Chair of the 1999 International Conference on Inductive Logic Programming, the 2001 European Conference on Machine Learning, the 2009 ACM Conference on Knowledge Discovery and Data Mining, and the 2012 European Conference on Machine Learning and Knowledge Discovery in Databases in Bristol. He is President of the European Association for Data Science, and a Fellow of the Alan Turing Institute for Data Science and Artificial Intelligence. 

22/10/21 Javier Alonso Mora TUD Motion Planning among Decision-Making Agents

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

28-05-2021 Francesca Ieva - How to manage complexity in Healthcare: new methods and challenges for Health Analytics

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

30-04-2021 Harrie Oosterhuis - Optimizing Search and Recommender Systems based on Position-Biased User Interactions

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

25-03-2021 Ozan Öktem - Bayesian inversion for tomography through machine learning

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

19-02-2021 Ronald Geskus - Competing risks, analysis and interpretation

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

22-01-2021 Joaquin Vanschoren - Learning how to learn how to learn

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

15-01-2021 Kristian Kersting - Making Deep Neural Networks Right for the Right Scientific Reasons

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

26-06-2020 Max Welling - Neural Augmentation with Applications in MRI Image Reconstruction and Wireless Communication

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

12-06-2020 Webinar Bernet Elzinga - Innovative approaches on parenting from a family perspective

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

05-06-2020 Webinar Wouter van Loon - Selecting views in multi-view learning

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

29-05-2020 Webinar Serge Rombouts - Imaging Brain Networks: pharmacological manipulation and individual prediction of cognitive decline

Due to the selected cookie settings, we cannot show this video here.

Watch the video on the original website or

Presentation files

26-06-2020 Webinar Max Welling - Neural Augmentation in Wireless Communication

15-05-2020: Webinar Peter Grünwald - Safe Testing

This website uses cookies.  More information.