Henry Wilde

Cardiff, Wales   henrydavidwilde@gmail.com
github.com/daffidwilde   References available upon request

Summary

I am a thoughtful, ethically minded scientist with a track record of pragmatism and efficient, impactful work. I have a breadth of projects under my belt from large-scale health data analysis with machine learning to productionising secure enclaves for record linkage. I find great joy in picking up new tools and techniques, and in putting those skills to use at pace.

Currently, I am leveraging LLMs to realise business efficiencies in the ONS, and I champion the increased use of privacy-enhancing technologies (PETs) across the Civil Service and Government.

Having successfully led numerous high-impact projects in academia and government, I am now looking to apply my expertise as a data scientist and software engineer in a new venture.

Employment

May 2022 - present Data scientistData Science Campus, Office for National Statistics

Python (data science stack, BeautifulSoup) | Version control (Git, GitLab, GitHub) | Google Cloud Platform | Docker | Automated testing (pytest, hypothesis, GitHub Actions) | Publishing (Quarto, Streamlit, GitHub Pages, Markdown, LaTeX) | LLMs (Gemini, OpenAI, LangChain) | Distributed computation (Dask, PySpark, Google BigQuery)

Feb 2021 - May 2022 Research associateWater Research Institute, Cardiff University

R (tidyverse, mgcv, Shiny, RStan, RMarkdown) | Version control (Git, GitHub) | LIMS

2019-2020 Volunteer consultantSchool of Biosciences, Cardiff University

Python | Version control (Git, GitHub) | Jupyter | Microsoft Excel

Dissertation supervisorSchool of Mathematics, Cardiff University

Python | Version control (Git, GitHub) | SQL | LaTeX

2017-2021 PhD studentship teachingSchool of Mathematics, Cardiff University

Python (data science stack, SymPy, Dask) | Version control (Git, GitHub) | Testing (pytest, hypothesis, Travis CI) | Writing (LaTeX, Markdown, reStructuredText, Sphinx)

Education

2017-2021 PhD Applied Statistics, Operational Research and Data AnalyticsSchool of Mathematics, Cardiff University

2014-2017 BSc Mathematics (First Class Honours)School of Mathematics, Cardiff University

Awards

2022-2024 Reward and RecognitionOffice for National Statistics

2022 PETs HackathonUnited Nations PET Lab

2018 Support for NATCOR BursaryAssociation of European Operational Research Societies

Publications

A list is also available online.

Thesis

2021 Wilde, H. New methods for algorithm evaluation and cluster initialisation with applications to healthcare. Cardiff University. PDF. GitHub repository.

Journals

2022 Wilde, H., et al. Accounting for dilution of SARS-CoV-2 in wastewater samples using physico-chemical markers. Water, 14(18):2885. DOI:10.3390/w14182885

2020 Wilde, H., Knight, V. and Gillard, J. Evolutionary dataset optimisation: learning algorithm quality through evolution. Applied Intelligence, 50:1172-1191. DOI:10.1007/s10489-019-01592-4

Wilde, H., Knight, V. and Gillard, J. Matching: a Python library for solving matching games. Journal of Open Source Software, 5(48):2169. DOI:10.21105/joss.02169

Pre-prints

2024 Jones, O., et al. Estimating wastewater dilution using chemical markers and incomplete flow measurements: application to normalisation of SARS-CoV-2 measurements. DOI:10.20944/preprints202402.1109.v1

2022 Houssiau, F., et al. A framework for auditable synthetic data generation. arXiv:2211.11540

Interests


Cooking

I taught myself to cook as a child, and then worked as a chef while at sixth form, including at a former Michelin star restaurant. Cooking for friends and family is now one of my dearest pastimes.

Cycling

During the height of the COVID-19 pandemic, I desperately needed something to occupy myself outside of writing my thesis. So, I taught myself bike mechanics and renovated a vintage steel-frame touring bike.

D & D

I adore fantasy in all its forms. Now, after years of listening to Dungeons & Dragons podcasts, I serve as the game master in a homebrew campaign for my three brothers.