Other biological data
The bulk of the data in FinnGen consists of the genotypes and the health register data which is available from all FinnGen study subjects. However, the project is gradually expanding to generate other data types of subset of its participants.
To better understand the functional implications of a number of Finnish specific disease-associated genetic variants the following data types have been included:
- Metabolomics data from approximately 1000 plasma samples (Metabolon)
- Proteomics data from 6350 plasma samples using the Olink 3K or 5K platform, and from 880 plasma samples using the SomaLogic platform.
- Clinical chemistry of 2200 serum samples and full blood cell counts.
- Single cell painting of 400 PBMC samples
- Single-nucleus RNA and ATAC sequencing, as well as immunophenotyping of 1100 samples
These activities are ongoing, and we plan to generate proteomics data on additional ~ 5000 plasma samples.
Also as a part of FinnGen 3 activities, FinnGen also acquired exome sequence variants for 25,000 and genome sequencing variants for ca 2,500 FinnGen study subjects’ samples. Additional NMR metabolomics data is available for 37,000 FinnGen study subjects. These datasets further enrich the genetic resources available for the FinnGen study and have been primarily acquired through the THL biobank.