-
GDI-LU-D-T0001
Colorectal cancer dataset -
GDI synthetic dataset (Population 11 Finland, Subgroup 2)
This dataset contains the pheno-clinical and genomic information of 42046 individuals from COVID Population 11 Finland, Subgroup 2. 2010 are affected by Phenotype 1, 2010 are... -
CRG_AFBeacon_GoE_synthetic_dataset
This is a subset of the GDI MS8 synthetic dataset (COVID pop13_1) containing aggregated allele frequency (AF) statistics for variants on chr21 across 42,312 samples. It has been... -
COVID-19 GWAS and Allele Frequency Lookup Dataset for GDI MS 8
This is the first part of the split 11 of the synthetic data dedicated for the GDI MS8 -
EOSC4Cancer Longitudinal Synthetic Colorectal Cancer Genomic data developed at BSC
The synthetic genomes have been created trying to mimic real cancer data of 4 patients (Named 185, 186, 187, and 188). Mutations are based on real CRC patients from the PCAWG... -
Exome Sequencing
All variants detected by whole exome sequencing of 2628 Dutch healthy elderly individuals -
1+MG COVID dataset
The 1+MG COVID dataset is located at CSC's Allas service and it is available via Dylan Spalding (dylan.spalding@csc.fi) by request for the GDI project. In the future, the... -
CINECA Synthetic Cohort EUROPE UK1 referencing fake samples
Please note: This synthetic data set (with cohort “participants” / ”subjects” marked with FAKE) has no identifiable data and cannot be used to make any inference about cohort... -
Tiny GoE synthetic data
Example GoE synthetic data from https://raw.githubusercontent.com/GenomicDataInfrastructure/starter-kit-synthetic-... -
GoE Estonian dataset, 578 samples, pgx_pilot AF
Dataset containing 578 estonian samples of 579 after pgx_pilot AF pipeline -
GoE Estonian dataset, 578 samples.
Dataset containing 578 estonian samples of 579 after AF_bcftools pipeline -
CRG AFBeacon GoE synthetic dataset
CRG_AFBeacon_GoE_synthetic_dataset -
GoE Small sample data
Slovenian small size dataset with real GoE data -
1+MG COVID dataset
The 1+MG COVID dataset is located at CSC's Allas service and it is available via Dylan Spalding (dylan.spalding@csc.fi) by request for the GDI project. In the future, the... -
Czech GoE Synthetic Aggregated Dataset
Czech GoE Synthetic Aggregated Dataset -
GDI Synthetic Genomics Dataset
Synthetic genomic dataset for testing the GDI starter kit. Contains simulated VCF data with allele frequency information for colorectal cancer cases. -
Genome of Europe Bulgaria Dummy dataset
GoE/GDI - Bulgaria MMC Dataset 1 - This is an example synthetic dataset for the staging environment of GDI. -
Example GoE data provided by GDI
Example GoE data provided by GDI -
GDI PT INSA Beacon dataset 1
GDI Beacon dataset from INSA -
GDI PT Pop12 sub1 (ITA)
Synthetic dataset containig CSV and VCF files