Kyushu University Definitive Haplotype Database
(D-HaploDB)

Home

Haplotype Browser
(D1 and D2 on NCBI Build 35)

Haplotype Browser
(D3 on NCBI Build 36)

Data Download

Terms and Conditions

Update History

Hayashi Lab Homepage


Genotype Data of D2 SNPs

When "mole_info_DhaploD2.zip" file is expanded, a single file containing SNP data on all chromosomes is produced. The file gives genotype results of 581235 SNPs for 74 complete hydatidiform mole (CHM) samples. This dataset is a merge of the genotypes presented in D1 and a part of the genotypes determined using Affymetrix 500K Arrays followed by qc (Higasa K, Kukita Y, Kato K, Wake N, Tahira T, Hayashi K, "Evaluation of haplotype inference using definitive haplotype data obtained from complete hydatidiform moles, and its significance for the analyses of positively selected regions". PLoS Genetics 5:e1000468, 2009). The file is a tab-delimited plain text table, with the following columns:

Column NameDescription
rsrs number
chrchromosome: 1-22, X
posposition in chromosome (NCBI Build 35)
allele1allele 1 nucleotide
allele2allele 2 nucleotide
gtype genotype results for 74 CHM samples

Download : mole_info_DhaploD2.zip (13.7 MB)


Genotype Data of D1 SNPs

We prepared two compressed files for data downloading. When "gen_data_all.zip" file is expanded, a single file containing SNP data on all chromosomes is produced. When "gen_data.zip" file is expanded, 23 files, each containing SNP data on each chromosome (22 autosomes and X chromosome) are produced. These files give dbSNP ID, NCBI Build 35 coordinates, and genotype results for 74 complete hydatidiform mole (CHM) samples. Alleles are given for the (+) strand on the specified NCBI sequence. Each file is a tab-delimited plain text table, with the following columns:

Column NameDescription
Refsnp_IDrefSNP rs number
Perlegen_IDPerlegen unique identifier for this SNP
ChromosomeChromosome: 01-22, X
Accession_IDNCBI Build 35 sequence accession number
Contig_positionPosition within the specified Build 35 sequence
AllelesThe SNP alleles, in arbitrary order
CHM001 ~ Haploid genotypes for the specified Japanese CHM sample identifier

Download : gen_data_all.zip (9.5 MB)
gen_data.zip (9.5 MB)


Annotation Data

Following two files are detail information of haplotype block and LD bin.
Each file includes four tab-delimited text files, which are...

File NameDescription
*_info.txtSummary of block/LD_bin
(Chromosome, block/bin ID, #SNP, #tagSNP, #Unambiguous Haplotype)
*_rs_info.txtSNP ID in each block/LD_bin
(Chromosome, block/bin ID, RS ID, Perlegen ID, position)
*_hap_info.txtFrequency of haplotype in each block/LD_bin
(Chromosome, block/bin ID, haplotype ID, frequency, haplotype)
*_tag_info.txttagSNPs in each block/LD_bin
(Chromosome, block/bin ID, name, Perlegen ID)

Download : LDbin.zip (7.9 MB)
block.zip (8.0 MB)

PML converted Data

We begin to provide our data in PML (Polymorphism Markup Language)
which is based on XML, to facilitate portability of our data on SNPs and other sequence variations.
The files are available as dhaplo_pml.tar (49.5 MB) which is devided into dhaplo_pml_snp.tar (32.2 MB),
dhaplo_pml_bin.tar (17.3 MB), header and readme files.