Clustering Multiple Long-Term Conditions

Welcome

This site provides an overview of my PhD research into clustering Multiple Long-Term Conditions (MLTC). Here, you will also find resources, including clinical code lists, analysis code and results.

Research aims

My research aims to develop methods and understand applications of clustering both diseases and people, in the context of MLTC. I do this applied to a large dataset of over 10 million primary and secondary care electronic health records (EHRs) representative of the population in England. I combine methods from statistics and epidemiology with natural language processing, developing algorithms capable of accounting for the sequence of a person's diseases over time. My research aims are to:

Aim 1: Understand the limitations of using EHR data for research into MLTC and disease sequences.
Aim 2: Generate clusters of diseases, comparing algorithms using disease co-occurrence versus sequence.
Aim 3: Generate clusters of people with similar patterns of diseases.
Aim 4: Explore how clusters can inform clinical outcomes and service design.

About me

I work as a GP in London and a clinical researcher at Imperial College London. This research formed my PhD thesis, which was awarded in November 2024. You can find more information on my personal webpage.

Acknowledgements

My research was funded through a clinical PhD fellowship from the Wellcome Trust 4i programme at Imperial College London. Thanks to my supervisors Professor Paul Aylin, Professor Mauricio Barahona, Professor Azeem Majeed, Dr Tom Woodcock, Dr Jonathan Clarke. The work was also supported by a Patient and Public Advisory Group including three people with lived experience of having, or caring for a person with MLTC.

Contact

For further information, or if you want to discuss any aspect of the work, please get in touch:

Email: thomas.beaney@imperial.ac.uk