Other biological data

The bulk of the data in FinnGen consists of the genotypes and the health register data which is available from all FinnGen study subjects. However, during FinnGen 2 and further now in FinnGen 3 the project is expanding to generate other data types of subset of its participants.

During FinnGen 2, as part of Expansion Area 5, aiming to better understand the functional implications of a number of Finnish specific disease-associated genetic variants the following data types were included:

  • Metabolomics data from approximately 1000 plasma samples (Metabolon)
  • Proteomics data from 2200 plasma samples using the Olink 3K platform, and from approximately 1000 plasma samples using the SomaLogic platform.
  • Clinical chemistry of 2200 serum samples and full blood cell counts.
  • Single cell painting of 400 PBMC samples
  • Single-nucleus RNA and ATAC sequencing, as well as immunophenotyping of 360 samples


The aim is to expand some of these activities during FinnGen 3, where we aim to generate proteomics data on additional ~10 000 plasma samples and single nuclei RNA and ATAC sequencing on ~2000 PBMC samples. 

Also as a part of FinnGen 3 activities, FinnGen also acquired exome sequence variants for 25,000 and genome sequencing variants for ca 2,500 FinnGen study subjects’ samples. Additional NMR metabolomics data is available for 37,000 FinnGen study subjects. These datasets further enrich the genetic resources available for the FinnGen study and have been primarily acquired through the THL biobank.