Academic Commons

Articles

Bos taurus genome assembly

Liu, Yue; Qin, Xiang; Song, Xing-Zhi; Jiang, Huaiyang; Shen, Yufeng; Durbin, K James; Lien, Sigbjørn; Kent, Matthew; Sodeland, Marte; Ren, Yanru; Zhang, Lan; Sodergren, Erica; Havlak, Paul; Worley, Kim; Weinstock, George; Gibbs, Richard

We present here the assembly of the bovine genome. The assembly method combines the BAC plus WGS local assembly used for the rat and sea urchin with the whole genome shotgun (WGS) only assembly used for many other animal genomes including the rhesus macaque. The assembly process consisted of multiple phases: First, BACs were assembled with BAC generated sequence, then subsequently in combination with the individual overlapping WGS reads. Different assembly parameters were tested to separately optimize the performance for each BAC assembly of the BAC and WGS reads. In parallel, a second assembly was produced using only the WGS sequences and a global whole genome assembly method. The two assemblies were combined to create a more complete genome representation that retained the high quality BAC-based local assembly information, but with gaps between BACs filled in with the WGS-only assembly. Finally, the entire assembly was placed on chromosomes using the available map information. Over 90% of the assembly is now placed on chromosomes. The estimated genome size is 2.87 Gb which represents a high degree of completeness, with 95% of the available EST sequences found in assembled contigs. The quality of the assembly was evaluated by comparison to 73 finished BACs, where the draft assembly covers between 92.5 and 100% (average 98.5%) of the finished BACs. The assembly contigs and scaffolds align linearly to the finished BACs, suggesting that misassemblies are rare. Genotyping and genetic mapping of 17,482 SNPs revealed that more than 99.2% were correctly positioned within the Btau_4.0 assembly, confirming the accuracy of the assembly. The biological analysis of this bovine genome assembly is being published, and the sequence data is available to support future bovine research.

Files

  • thumnail for ef030017e182d602f74844d6520435f5.zip ef030017e182d602f74844d6520435f5.zip binary/octet-stream 475 KB Download File
  • thumnail for 1471-2164-10-180-S1.DOC 1471-2164-10-180-S1.DOC binary/octet-stream 64.5 KB Download File
  • thumnail for 1471-2164-10-180-S3.DOC 1471-2164-10-180-S3.DOC binary/octet-stream 90 KB Download File
  • thumnail for 1471-2164-10-180-S2.DOC 1471-2164-10-180-S2.DOC binary/octet-stream 55 KB Download File
  • thumnail for 1471-2164-10-180-S5.DOC 1471-2164-10-180-S5.DOC binary/octet-stream 50.5 KB Download File
  • thumnail for 1471-2164-10-180-S4.DOC 1471-2164-10-180-S4.DOC binary/octet-stream 305 KB Download File
  • thumnail for 1471-2164-10-180-S6.DOC 1471-2164-10-180-S6.DOC binary/octet-stream 975 KB Download File

Also Published In

More About This Work

Academic Units
Biomedical Informatics
Publisher
BioMed Central
Published Here
September 8, 2014
Academic Commons provides global access to research and scholarship produced at Columbia University, Barnard College, Teachers College, Union Theological Seminary and Jewish Theological Seminary. Academic Commons is managed by the Columbia University Libraries.