Access results

According to the FinnGen consortium agreement, FinnGen results are subjected to one year embargo and, after that, can be made available to the larger scientific community. Data releases are done twice a year. The results can be browsed with the web browser and summary statistics data can be downloaded from Google cloud storage free of charge. The latest release is from December 2023. Furthermore, we have recently released the gene-based burden test results of loss of function variants for DF8. Please follow the instructions below to access the results.

Latest release:

DF10 - December 18 2023

Total sample size: 412,181 (230,310 females and 181,871 males)

Total number of variants analyzed: 21,311,942 variants

Number of disease endpoints (phenotypes) available: ​2,408 endpoints

Data released to the partners: Q3 2022

Public release: Dec 18, 2023

Browser: r10.finngen.fi

Thorough documentation is available here.

 

Please note that the allele frequency column provides the human reference genome alternative allele (not minor allele) frequency. All effect sizes and allele frequencies are reported in terms of alternate allele.

 

How to access the summary statistics:

  1. Fill this online form to gain access to Finngen GWAS summary statistics. 
  2. Instructions on how to download summary statistics are sent to you by e-mail.
  3. Download the data 

 

Acknowledgements:

Please use the following description when referring to our project:


The FinnGen study is a large-scale genomics initiative that has analyzed over 500,000 Finnish biobank samples and correlated genetic variation with health data to understand disease mechanisms and predispositions. The project is a collaboration between research organisations and biobanks within Finland and international industry partners.

 

When using these results in publications, please remember to:

  1. Acknowledge the FinnGen study. You can use the following text:

“We want to acknowledge the participants and investigators of the FinnGen study”

  1. Cite our latest publication:

Kurki, M.I., Karjalainen, J., Palta, P. et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature 613, 508–518 (2023). https://doi.org/10.1038/s41586-022-05473-8

Furthermore, if possible, include "FinnGen" as a keyword for your publication.

 

Browsing the results: 

The web browser r10.finngen.fi contains all FinnGen GWAS results from the latest public release and provides you with three options:

  1. Search for the GWAS result of a variant, phenotype or gene.

  2. Explore the loss-of-function burden (LoF) for gene-phenotypes combinations.

  3. Find a particular phenotype/endpoint.

 

Risteys

Risteys tool (Risteys = intersection in Finnish) allows browsing of the FinnGen data at the phenotype level, including endpoint definitions, statistics about number of individuals, gender distribution, and longitudinal relationships. For this data release, please visit: r10.risteys.finngen.fi.

 

Individual level data

In case you are interested in accessing individual level biobank samples and data or recontacting Finnish biobank participants, please visit the Fingenious portal hosted by the Finnish Biobank Cooperative FINBB for more information.

 

Earlier releases:

DF9 - May 11 2023

Total sample size: 377,277 (210,870 females and 166,407 males)

Total number of variants analyzed: 20,175,454 variants

Number of disease endpoints (phenotypes) available: ​2,272 endpoints

Data released to the partners: Q1 2022

Browser: r9.finngen.fi

Thorough documentation is available here.

FinnGen + pan-UKBB meta-analysis results for 679 phenotypes are available for browsing here: https://public-metaresults-fg-ukbb.finngen.fi/

Read more about the endpoint matching from this page

Note: DF9 HLA region GWAS analysis results also available for download (as of 11.10.2023) 

 

DF8 - Dec 1 2022

Total sample size: 342,499 (190,879 females and 151,620 males)

Total number of variants analyzed: 20,175,454 variants

Number of disease endpoints (phenotypes) available: ​2,202 endpoints*

Data released to the partners: Q4 2021

Public release: Dec 1, 2022

Browser: r8.finngen.fi

Thorough documentation is available here.

*For DF8 the number of endpoints was reduced. This was done for a number of reasons - partly to reduce computational costs and also because nearly redundant endpoints could be confusing.
Endpoints were reduced through careful calculation of the case overlaps and consultation with the FinnGen clinical groups.

DF8 - May 26, 2023

Regenie gene-based burden test results of loss of function variants (LoFs) of 544 genes.

 

DF7 - June 1 2022

Total sample size: 309,154 (173,746 females and 135,408 males)

Total number of variants analyzed: 16,962,023 variants

Number of disease endpoints (phenotypes) available: ​3,095 endpoints

Data released to the partners: Q2 2021

Public release: June 1, 2022

Browser: r7.finngen.fi

Thorough documentation is available here.

 

DF6 - January 24 2022

Total sample size: ​260,405 (147,061 females and 113,344 males)

Total number of variants analyzed: 16,962,023 variants

Number of disease endpoints (phenotypes) available: ​2,861 endpoints

Data released to the partners: Q3 2020

Public release: January 24 2022

Browser: r6.finngen.fi

Thorough documentation is available here.

 

DF5 - May 11 2021

Total sample size: ​218,792

Total number of variants analyzed: 16,962,023 variants

Number of disease endpoints (phenotypes) available: ​2,803 endpoints

Data released to the partners: Q2 2020

Public release: May 11 2021

Browser: r5.finngen.fi

Thorough documentation is available here

 

DF4 - November 30 2020:

Total sample size: ​176,899

Total number of variants analyzed: 16,962,023 variants

Number of disease endpoints (phenotypes) available: ​2,444 endpoints

Data released to the partners: Q4 2019

Public release: Nov 30, 2020

Thorough documentation is available here

 

DF3 - June 16 2020:

Total sample size: ​135,638

Total number of variants analyzed: 16,962,023 variants

Number of disease endpoints (phenotypes) available: ​1,801 endpoints

Data released to the partners: Q2 2019 (May)

Public release: June 16 2020

Thorough documentation is available here

New feature: LD estimation from SISu v3 reference panel

 

DF2 - January 14 2020:

Total sample size: ​96,499​​

Total number of variants analyzed: 16,152,119 variants

Number of disease endpoints (phenotypes) available: ​1,122 endpoints

Data released to the partners: Q4 2018 (Nov 27)

Public release: Jan 14 2020

Thorough documentation is available here