Research output per year
Research output per year
Yan is a statistician and applied statistician with PhD in statistical epidemiology, MSc in health data science and BSc degree in mathematics. Yan is also an experienced statistical programmer as he has worked in a data management company and has conducted extensive statistical analysis in his MSc and PhD project for 6 years. He has expertise using electronic health records (EHR) to conduct epidemiological studies. He is also experienced in conducting statistical analysis and fulfilling the Food and Drug Administration (FDA) requirements for clinical trials of new medicines as he conducted multiple projects in the analyses of trial data. His current research area focuses on assessing the generalisability of risk prediction models (including traditional risk prediction model and machine-learning (AI) models) using EHRs from UK databases.
Research
I am experienced using EHR data to assess model performance on both population level and individual level of risk prediction model. I am also familiar with using risk prediction model and other statistics to conduct epidemiology studies, e.g. to investigate on the drivers of antibiotics over prescription.
Programming
I am capable of combining the strength of different program languages (SAS, R, Python, C++ and Java), and quickly overcoming computational challenges of substantive software and hardware in the research. I am familiar with statistical programming with SAS procedures, model fitting with R packages and machine learning model fitting and validation with Python.
Data
As an experienced data scientist, I am very familiar with conducting analysis with large data. I have fitted and validated multiple risk prediction models including both of traditional statistical models and machine learning models in a 3.6 million cohort. I designed a workflow with the combination of advantages of different programming languages to conduct efficient statistical analysis using large data.
Currently I have three aspects of research interest:
I taught new employee SAS programing in clinical trials.
I assist teaching master students how to program with R
I supervise master students for their dissertation
I taught master students with an introduction of clinical risk prediction modelling with traditional statistical modelling and machine learning
I am also a part-time video editor and photo designer with an aim to make the educational course contents more interesting with meme, animation and comics.
I am also interested in independent game develop with Python. I wish to involve more educational video game into education.
In 2015, UN member states agreed to 17 global Sustainable Development Goals (SDGs) to end poverty, protect the planet and ensure prosperity for all. This person’s work contributes towards the following SDG(s):
Doctor of Philosophy, The University of Manchester
1 Sep 2017 → 1 Sep 2020
Award Date: 1 Sep 2020
Master in Science, Health data science, The University of Manchester
1 Sep 2016 → 1 Sep 2017
Award Date: 1 Sep 2017
Bachelor of Science, Mathematics and applied mathematics, Sichuan University
1 Sep 2009 → 1 Sep 2013
Award Date: 1 Sep 2013
Research output: Contribution to journal › Article › peer-review
Research output: Contribution to journal › Article › peer-review
Research output: Contribution to journal › Article › peer-review
Research output: Contribution to journal › Article › peer-review
Research output: Contribution to journal › Article › peer-review
Supervisor: Sperrin, M. (Supervisor) & Van Staa, T. (Supervisor)
Student thesis: Phd