news_side_Button.jpg (3776 bytes)

projects_side_Button.jpg (3894 bytes)

publications_side_Button.jpg (4042 bytes)

abourradar_side_Button.jpg (4035 bytes)

people_side_Button.jpg (3827 bytes)

links_side_Button.jpg (3757 bytes)

MSc Student project suggestions and notes

Updated: 12/12/2021

I am currently adopting MSc students who will be taking my Semester 2 module ECS7005P "Risk assessment for data science and AI". On this module students will learn how to use specialist software to build causal models (Bayesian networks) that enable risk assessment and decision support for a broad class of problems where there are insufficient data for pure machine learning algorithms to provide useful or accurate results. We call this the 'smart data' approach - in contrast to the 'big data' approach.

This 5 minute video explains why smart data, rather than big data, can achieve 'true AI": https://youtu.be/-7vSiWRasxY

For example, imagine you were trying to learn from data whether customers applying for loans are likely to default; the data available is about people previously given loans and whether or not they defaulted- but we have no data about people who were refused loans. Or imagine trying to learn from data whether patients with particular risk factors are likely to suffer from a particular disease; the data available are about previous patients known to have suffered from the disease, but we have no data about people who did not suffer from the disease and no data about the kind of interventions or treatments any of the patients had. In cases such as these we must use knowledge to build an underlying causal model and then a combination of data and knowledge to 'paramaterize' the model.

Each project will use a combination of data and knowledge to develop a Bayesian network model for a working decision support system for one of the following kind of problems (although I am open to other well motivated suggestions):
  • monitoring and diagnosis of a chronic medical condition
  • early detection and monitoring of risk (for example: will students drop out of a course, are children likely to be harmed at home, will subscribers in a particular service renew at end of contract, etc)
  • determining whether there is bias in a system that makes decisions about people (for example, automated systems that review job applicants, predictive policing systems etc.)
  • determining whether a film will be a box office hit (purely data driven regression models do not make accurate predictions - see this paper -, so the objective is to incorporate knowledge in addition to the data in websites such as Themoviedb.org and imdb.com
  • determining whether the evidence in a criminal investigation is likely to result in a successful prosecution
This 7 minute video provides an example of the kind of working Bayesian network model you would be expected to produce:
https://youtu.be/3KGYuLFMRSY

Two short 2-page overview articles of the smart data approach are:

Previous highly successful projects:


















 


home_side_Button.jpg (3716 bytes) Norman Fenton website last updated on 23 Oct, 2020.
Please direct all comments about this website to n.fenton@qmul.ac.uk