-
GDI-LU-D-T0001
Colorectal cancer dataset -
GDI synthetic dataset (Population 11 Finland, Subgroup 2)
This dataset contains the pheno-clinical and genomic information of 42046 individuals from COVID Population 11 Finland, Subgroup 2. 2010 are affected by Phenotype 1, 2010 are... -
CRG_AFBeacon_GoE_synthetic_dataset
This is a subset of the GDI MS8 synthetic dataset (COVID pop13_1) containing aggregated allele frequency (AF) statistics for variants on chr21 across 42,312 samples. It has been... -
GDI PT Pop12 sub1 (ITA)
Synthetic dataset containig CSV and VCF files -
GDI PT INSA Beacon dataset 1
GDI Beacon dataset from INSA -
COVID-19 GWAS and Allele Frequency Lookup Dataset for GDI MS 8
This is the first part of the split 11 of the synthetic data dedicated for the GDI MS8 -
CINECA Synthetic Cohort EUROPE UK1 referencing fake samples
Please note: This synthetic data set (with cohort “participants” / ”subjects” marked with FAKE) has no identifiable data and cannot be used to make any inference about cohort... -
Exome Sequencing
All variants detected by whole exome sequencing of 2628 Dutch healthy elderly individuals -
Genome of Europe Dutch Dummy dataset
The first dummy GoE examplar AF-browser dataset and on 2025-11-27 we decided that also the description how we arrived at this dataset will be part of the description -
Tiny GoE synthetic data
Example GoE synthetic data from https://raw.githubusercontent.com/GenomicDataInfrastructure/starter-kit-synthetic-... -
CRG AFBeacon GoE synthetic dataset
CRG_AFBeacon_GoE_synthetic_dataset -
1+MG COVID dataset
The 1+MG COVID dataset is located at CSC's Allas service and it is available via Dylan Spalding (dylan.spalding@csc.fi) by request for the GDI project. In the future, the... -
GoE Small sample data
Slovenian small size dataset with real GoE data -
Czech GoE Synthetic Aggregated Dataset
Czech GoE Synthetic Aggregated Dataset -
Example GoE data provided by GDI
Example GoE data provided by GDI -
GDI Synthetic Genomics Dataset
Synthetic genomic dataset for testing the GDI starter kit. Contains simulated VCF data with allele frequency information for colorectal cancer cases. -
Genome of Europe Bulgaria Dummy dataset
GoE/GDI - Bulgaria MMC Dataset 1 - This is an example synthetic dataset for the staging environment of GDI. -
1+MG COVID dataset
The 1+MG COVID dataset is located at CSC's Allas service and it is available via Dylan Spalding (dylan.spalding@csc.fi) by request for the GDI project. In the future, the... -
EOSC4Cancer Longitudinal Synthetic Colorectal Cancer Genomic data developed at BSC
The synthetic genomes have been created trying to mimic real cancer data of 4 patients (Named 185, 186, 187, and 188). Mutations are based on real CRC patients from the PCAWG... -
Exome Sequencing
All variants detected by whole exome sequencing of 2628 Dutch healthy elderly individuals