Sponsors

Largest ever genomic dataset of children and their parents

For the first time, genomic data for over 37,000 children and parents collected over multiple decades from across the UK is now available to researchers worldwide.

High-resolution DNA sequencing has created a unique resource to explore the relationship between genetic and environmental factors in child health and development.

The data release is led by:

  • Wellcome Sanger Institute
  • Children of the 90s (also known as ALSPAC)
  • Millennium Cohort Study (MCS)
  • Born in Bradford (BiB)

The data release is supported by the Economic and Social Research Council (ESRC) and Medical Research Council (MRC). Previously, large DNA sequence datasets typically focused on children with rare conditions or adult population cohorts. This new data release is the first time the focus has been on sequencing birth cohorts, which are population-based studies of people followed from birth through to adolescence or early adulthood.

To produce this data release, researchers at the Sanger Institute sequenced all 20,000 genes in the human genome, known as exome sequencing, in samples from:

  • 8,436 children and 3,215 parents from the Children of the 90s study
  • 7,667 children and 6,925 parents from the MCS
  • 8,784 children and 2,875 parents from BiB

These combined data resources offer the scientific community the opportunity to make valuable insights in areas ranging from population genetics to the social sciences. In the coming months, this DNA sequence data resource will be expanded to encompass all participants in these cohorts as well as additional cohorts. The value of these data will be enhanced by harmonising the data across the different cohorts, providing a more powerful resource than could be achieved by one study in isolation.

Dr Richard Evans, Interim Head of Population Health Sciences at MRC, said: “The UK’s cohorts and longitudinal population studies are an extraordinary national asset, made possible by the participation of a diverse range of people. The rich data and samples from these studies, when combined with whole exome sequencing, can unlock new research questions and insights into human society, development, health and aging. MRC’s funding is part of our overall investment in understanding the drivers of disease to enable precision prevention and personalised treatments, and maximising existing infrastructure to ensure real value for money. This work aligns perfectly with a new exciting national resource that is supported by MRC and ESRC, Population Research UK, which is all about coordinating and leveraging UK cohorts.”

Now available on the European Genome-phenome Archive, these high-quality genomic data can be used in combination with the existing longitudinal health and survey information provided by participating families. The team has made the anonymised data as accessible as possible to approved researchers. This includes drafting a data note (available on Wellcome Open Research) and other materials to help support its use by those who are less familiar with large-scale sequencing data.

Read more about this project on the Wellcome Sanger Institute website.

 

Latest Issues

UKHSA Conference 2025

Manchester Central
25-26 March, 2025

Microbiology Society Annual Conference 2025

Liverpool Arena and Convention Centre
31 March - 3 April, 2025

2nd Global Summit on Pathology

Rome, Italy
10-11 April, 2025

Clinical Innovations EXPO - Breaking Through the Adoption Barriers

Jubilee Hotel and Conference Centre, Nottingham, UK
15 May, 2025

BSMT 40th Anniversary Microbiology Conference

RAF Museum, Hendon, London NW9 5LL
15 May, 2025