Whole-genome sequencing data of Kazakh individuals

Kairov, Ulykbek; Molkenov, Askhat; Rakhimova, Saule; Kozhamkulov, Ulan; Sharip, Aigul; Karabayev, Daniyar; Daniyarov, Asset; H.Lee, Joseph; D.Terwilliger, Joseph; Akilzhanova, Ainur; Zhumadilov, Zhaxybay

Kazakhstan is a Central Asian crossroad of European and Asian populations situated along the way of the Great Silk Way. The territory of Kazakhstan has historically been inhabited by nomadic tribes and today is the multi-ethnic country with the dominant Kazakh ethnic group. We sequenced and analyzed the whole-genomes of five ethnic healthy Kazakh individuals with high coverage using next-generation sequencing platform. This whole-genome sequence data of healthy Kazakh individuals can be a valuable reference for biomedical studies investigating disease associations and population-wide genomic studies of ethnically diverse Central Asian region.

Data description
Blood samples have been collected from five ethnic healthy Kazakh individuals living in Kazakhstan. The genomic DNA was extracted from blood and sequenced. Sequencing was performed on Illumina HiSeq2000 next-generation sequencing platform. We sequenced and analyzed the whole-genomes of ethnic Kazakh individuals with the coverage ranging from 26 to 32X. Ranging from 98.85 to 99.58% base pairs were totally mapped and aligned on the human reference genome GRCh37 hg19. Het/Hom and Ts/Tv ratios for each whole genome ranged from 1.35 to 1.49 and from 2.07 to 2.08, respectively. Sequencing data are available in the National Center for Biotechnology Information SRA database under the accession number PRJNA374772.

Geographic Areas


  • thumnail for 13104_2021_Article_5464.pdf 13104_2021_Article_5464.pdf application/pdf 141 KB Download File

Also Published In

BMC Research Notes

More About This Work

Published Here
September 22, 2023


Whole genome, Next-generation sequencing, Kazakh ethnicity, Bioinformatics analysis, Population genomics, Genome annotation, Biomedical research, Kazakhstan