Goat is an important source of milk, meat, and fiber, especially in developing countries. An advantage of goats as livestock is the low maintenance requirements and high adaptability compared to other milk producers. The global population of domestic goats exceeds 800 million. In Africa, goat production is characterized by low productivity levels, and attempts to introduce more productive breeds have met with poor success due in part to nutritional constraints. It has been suggested that incorporation of selective breeding within the herds adapted for survival could represent one approach to improving food security across Africa. A recently produced genome assembly of a Chinese Yunnan breed goat, based on 192 Gb of short reads across a range of insert sizes from 180 bp to 20 kb, reported a contig N50 of 18.7 kb. The scaffold N50 was improved from 2.2 Mb to 3.1 Mb by addition of fosmid end sequence, with an estimated 140 million Ns in gaps and 91% coverage. The assembly has proven somewhat problematic for pursuing genome-wide association analysis with SNP arrays, apparently due in part to errors in ordering of markers using the draft genome. In order to provide a higher quality assembly, we sequenced a highly inbred, San Clemente breed goat genome using 458 SMRT cells on the Pacific Biosciences platform. These cells generated 193.5 Gbases of sequence after processing into subreads, with mean 5110 bases and max subread length of 40.5 kb. This sequence data generated an assembly using the recently reported MHAP error correction approach and Celera Assembler v8.2. The contig N50 was 2.5 Mb, with the largest contig spanning 19.5 Mb. Additional characteristics of the assembly will be presented.
Goats are specialized in dairy, meat and fiber production, being adapted to a wide range of environmental conditions and having a large economic impact in developing countries. In the last years, there have been dramatic advances in the knowledge of the structure and diversity of the goat genome/transcriptome and in the development of genomic tools, rapidly narrowing the gap between goat and related species such as cattle and sheep. Major advances are: 1) publication of a de novo goat genome reference sequence; 2) Development of whole genome high density RH maps, and; 3) Design of a commercial 50K SNP array. Moreover, there are currently several projects aiming at improving current genomic tools and resources. An improved assembly of the goat genome using PacBio reads is being produced, and the design of new SNP arrays is being studied to accommodate the specific needs of this species in the context of very large scale genotyping projects (i.e. breed characterization at an international scale and genomic selection) and parentage analysis. As in other species, the focus has now turned to the identification of causative mutations underlying the phenotypic variation of traits. In addition, since 2014, the ADAPTmap project (www.goatadaptmap.org) has gathered data to explore the diversity of caprine populations at a worldwide scale by using a wide variety of approaches and data.
From Sequencing to Chromosomes: New de novo assembly and scaffolding methods improve the goat reference genome
Single-molecule sequencing is now routinely used to assemble complete, high-quality microbial genomes, but these assembly methods have not scaled well to large genomes. To address this problem, we previously introduced the MinHash Alignment Process (MHAP) for overlapping single-molecule reads using probabilistic, locality-sensitive hashing. Integrating MHAP with Celera Assembler (CA) has enabled reference-grade assemblies of model organisms, revealing novel heterochromatic sequences and filling low-complexity gap sequences in the GRCh38 human reference genome. We have applied our methods to assemble the San Clemente goat genome. Combining single-molecule sequencing from Pacific Biosciences and BioNano Genomics generates and assembly that is over 150-fold more contiguous than the latest Capra hircus reference. In combination with Hi-C sequencing, the assembly surpasses reference assemblies, de novo, with minimal manual intervention. The autosomes are each assembled into a single scaffold. Our assembly provides a more complete gene reconstruction, better alignments with Goat 52k chip, and improved allosome reconstruction. In addition to providing increased continuity of sequence, our assembly achieves a higher BUSCO completion score (84%) than the existing goat reference assembly suggesting better quality annotation of gene models. Our results demonstrate that single-molecule sequencing can produce near-complete eukaryotic genomes at modest cost and minimal manual effort.
Reference quality de novo genome assemblies were once solely the domain of large, well-funded genome projects. While next-generation short read technology removed some of the cost barriers, accurate chromosome-scale assembly remains a real challenge. Here we present efforts to de novo assemble the goat (Capra hircus) genome. Through the combination of single-molecule technologies from Pacific Biosciences (sequencing) and BioNano Genomics (optical mapping) coupled with high-throughput chromosome conformation capture sequencing (Hi-C), an inbred San Clemente goat genome has been sequenced and assembled to a high degree of completeness at a relatively modest cost. Starting with 38 million PacBio reads, we integrated the MinHash Alignment Process (MHAP) with the Celera Assembler (CA) to produce an assembly composed of 3110 contigs with a contig N50 size of 4.7 Mb. This assembly was scaffolded with BioNano genome maps derived from a single IrysChip into 333 scaffolds with an N50 of 23.1 Mb including the complete scaffolding of chromosome 20. Finally, cis-chromosome associations were determined by Hi-C, yielding complete reconstruction of all autosomes into single scaffolds with a final N50 of 91.7 Mb. We hope to demonstrate that our methods are not only cost effective, but improve our ability to annotate challenging genomic regions such as highly repetitive immune gene clusters.
PacBio Sequencing is characterized by very long sequence reads (averaging > 10,000 bases), lack of GC-bias, and high consensus accuracy. These features have allowed the method to provide a new…
Background Assemblies of diploid genomes are generally unphased, pseudo-haploid representations that do not correctly reconstruct the two parental haplotypes present in the individual sequenced. Instead, the assembly alternates between parental haplotypes and may contain duplications in regions where the parental haplotypes are sufficiently different. Trio binning is an approach to genome assembly that uses short reads from both parents to classify long reads from the offspring according to maternal or paternal haplotype origin, and is thus helped rather than impeded by heterozygosity. Using this approach, it is possible to derive two assemblies from an individual, accurately representing both parental contributions in their entirety with higher continuity and accuracy than is possible with other methods.Results We used trio binning to assemble reference genomes for two species from a single individual using an interspecies cross of yak (Bos grunniens) and cattle (Bos taurus). The high heterozygosity inherent to interspecies hybrids allowed us to confidently assign >99% of long reads from the F1 offspring to parental bins using unique k-mers from parental short reads. Both the maternal (yak) and paternal (cattle) assemblies contain over one third of the acrocentric chromosomes, including the two largest chromosomes, in single haplotigs.Conclusions These haplotigs are the first vertebrate chromosome arms to be assembled gap-free and fully phased, and the first time assemblies for two species have been created from a single individual. Both assemblies are the most continuous currently available for non-model vertebrates.MbmegabaseskbkilobasesMYAmillions of years agoMHCmajor histocompatibility complexSMRTsingle molecule real time
Complete genome sequence of Bacillus velezensis JT3-1, a microbial germicide isolated from yak feces
Bacillus velezensis JT3-1 is a probiotic strain isolated from feces of the domestic yak (Bos grunniens) in the Gansu province of China. It has strong antagonistic activity against Listeria monocytogenes, Staphylococcus aureus, Escherichia coli, Salmonella Typhimurium, Mannheimia haemolytica, Staphylococcus hominis, Clostridium perfringens, and Mycoplasma bovis. These properties have made the JT3-1 strain the focus of commercial interest. In this study, we describe the complete genome sequence of JT3-1, with a genome size of 3,929,799 bp, 3761 encoded genes and an average GC content of 46.50%. Whole genome sequencing of Bacillus velezensis JT3-1 will lay a good foundation for elucidation of the mechanisms of its antimicrobial activity, and for its future application.
The ruminants are one of the most successful mammalian lineages, exhibiting morphological and habitat diversity and containing several key livestock species. To better understand their evolution, we generated and analyzed de novo assembled genomes of 44 ruminant species, representing all six Ruminantia families. We used these genomes to create a time-calibrated phylogeny to resolve topological controversies, overcoming the challenges of incomplete lineage sorting. Population dynamic analyses show that population declines commenced between 100,000 and 50,000 years ago, which is concomitant with expansion in human populations. We also reveal genes and regulatory elements that possibly contribute to the evolution of the digestive system, cranial appendages, immune system, metabolism, body size, cursorial locomotion, and dentition of the ruminants. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Whole-Genome Sequencing of a Brucella melitensis Strain (BMWS93) Isolated from a Bank Clerk and Exhibiting Complete Resistance to Rifampin.
Human brucellosis has become the most severe public health problem in the Ulanqab region of Inner Mongolia, China. Brucella melitensis BMWS93 was obtained from a blood sample taken from a bank clerk in the Ulanqab region of Inner Mongolia, China, and antimicrobial susceptibility testing in vitro showed no zone of inhibition, which confirmed resistance to rifampin. Therefore, whole-genome sequencing of this isolate was performed to better understand the mechanism of this resistance.Copyright © 2019 Liu et al.
Using bacteria to transform reactive corrosion products into stable compounds represents an alternative to traditional methods employed in iron conservation. Two environmental Aeromonas strains (CA23 and CU5) were used to transform ferric iron corrosion products (goethite and lepidocrocite) into stable ferrous iron-bearing minerals (vivianite and siderite). A genomic and transcriptomic approach was used to analyze the metabolic traits of these strains and to evaluate their pathogenic potential. Although genes involved in solid-phase iron reduction were identified, key genes present in other environmental iron-reducing species are missing from the genome of CU5. Several pathogenicity factors were identified in the genomes of both strains, but none of these was expressed under iron reduction conditions. Additional in vivo tests showed hemolytic and cytotoxic activities for strain CA23 but not for strain CU5. Both strains were easily inactivated using ethanol and heat. Nonetheless, given a lesser potential for a pathogenic lifestyle, CU5 is the most promising candidate for the development of a bio-based iron conservation method stabilizing iron corrosion. Based on all the results, a prototype treatment was established using archaeological items. On those, the conversion of reactive corrosion products and the formation of a homogenous layer of biogenic iron minerals were achieved. This study shows how naturally occurring microorganisms and their metabolic capabilities can be used to develop bio-inspired solutions to the problem of metal corrosion.IMPORTANCE Microbiology can greatly help in the quest for a sustainable solution to the problem of iron corrosion, which causes important economic losses in a wide range of fields, including the protection of cultural heritage and building materials. Using bacteria to transform reactive and unstable corrosion products into more-stable compounds represents a promising approach. The overall aim of this study was to develop a method for the conservation and restoration of corroded iron items, starting from the isolation of iron-reducing bacteria from natural environments. This resulted in the identification of a suitable candidate (Aeromonas sp. strain CU5) that mediates the formation of desirable minerals at the surfaces of the objects. This led to the proof of concept of an application method on real objects.Copyright © 2019 Kooli et al.
Intercellular communication is required for trap formation in the nematode-trapping fungus Duddingtonia flagrans.
Nematode-trapping fungi (NTF) are a large and diverse group of fungi, which may switch from a saprotrophic to a predatory lifestyle if nematodes are present. Different fungi have developed different trapping devices, ranging from adhesive cells to constricting rings. After trapping, fungal hyphae penetrate the worm, secrete lytic enzymes and form a hyphal network inside the body. We sequenced the genome of Duddingtonia flagrans, a biotechnologically important NTF used to control nematode populations in fields. The 36.64 Mb genome encodes 9,927 putative proteins, among which are more than 638 predicted secreted proteins. Most secreted proteins are lytic enzymes, but more than 200 were classified as small secreted proteins (< 300 amino acids). 117 putative effector proteins were predicted, suggesting interkingdom communication during the colonization. As a first step to analyze the function of such proteins or other phenomena at the molecular level, we developed a transformation system, established the fluorescent proteins GFP and mCherry, adapted an assay to monitor protein secretion, and established gene-deletion protocols using homologous recombination or CRISPR/Cas9. One putative virulence effector protein, PefB, was transcriptionally induced during the interaction. We show that the mature protein is able to be imported into nuclei in Caenorhabditis elegans cells. In addition, we studied trap formation and show that cell-to-cell communication is required for ring closure. The availability of the genome sequence and the establishment of many molecular tools will open new avenues to studying this biotechnologically relevant nematode-trapping fungus.
Complete Genome Sequence of a Sequence Type 4846 Streptococcus pneumoniae Serotype 12F Strain Isolated from a Meningitis Case in Japan
Streptococcus pneumoniae serotype 12F rarely colonizes the nasopharynx but commonly causes invasive pneumococcal disease. Here, we report the complete genome sequence of a sequence type 4846 (ST4846) S. pneumoniae serotype 12F strain isolated from a cluster of invasive pneumococcal disease patients in Japan.
Complete Genome Sequence of the Telford Type S Strain of Mycobacterium avium subsp. paratuberculosis
Mycobacterium avium subsp. paratuberculosis is the causative agent of Johnetextquoterights disease (JD). Here, we report the complete genome sequence of Telford 9.2, a well-characterized representative strain of the M. avium subsp. paratuberculosis S subtype that is endemic in New Zealand and Australian sheep.
Streptococcus periodonticum sp. nov., Isolated from Human Subgingival Dental Plaque of Periodontitis Lesion.
A novel facultative anaerobic and Gram-stain-positive coccus, designated strain ChDC F135T, was isolated from human subgingival dental plaque of periodontitis lesion and was characterized by polyphasic taxonomic analysis. The 16S rRNA gene (16S rDNA) sequence of strain ChDC F135T was closest to that of Streptococcus sinensis HKU4T (98.2%), followed by Streptococcus intermedia SK54T (97.0%), Streptococcus constellatus NCTC11325T (96.0%), and Streptococcus anginosus NCTC 10713T (95.7%). In contrast, phylogenetic analysis based on the superoxide dismutase gene (sodA) and the RNA polymerase beta-subunit gene (rpoB) showed that the nucleotide sequence similarities of strain ChDC F135T were highly similar to the corresponding genes of S. anginosus NCTC 10713T (99.2% and 97.6%, respectively), S. constellatus NCTC11325T (87.8% and 91.4%, respectively), and S. intermedia SK54T (85.8% and 91.2%, respectively) rather than those of S. sinensis HKU4T (80.5% and 82.6%). The complete genome of strain ChDC F135T consisted of 1,901,251 bp and the G+C content was 38.9 mol %. Average nucleotide identity value between strain ChDC F135T and S. sinensis HKU4T or S. anginosus NCTC 10713T were 75.7% and 95.6%, respectively. The C14:0 composition of the cellular fatty acids of strain ChDC F135T (32.8%) was different from that of S. intermedia (6-8%), S. constellatus (6-13%), and S. anginosus (13-20%). Based on the results of phylogenetic and phenotypic analysis, strain ChDC F135T (=?KCOM 2412T?=?JCM 33300T) was classified as a type strain of a novel species of the genus Streptococcus, for which we proposed the name Streptococcus periodonticum sp. nov.
A novel facultative anaerobic, Gram-stain-negative coccus, designated strain ChDC B345T, was isolated from human pericoronitis lesion and was characterized by polyphasic taxonomic analysis. The 16S ribosomal RNA gene (16S rDNA) sequence revealed that the strain belonged to the genus Streptococcus. The 16S rDNA sequence of strain ChDC B345T was most closely related to those of Streptococcus mitis NCTC 12261T (99.5%) and Streptococcus pseudopneumoniae ATCC BAA-960T (99.5%). Complete genome of strain ChDC B345T was 1,972,471 bp in length and the G?+?C content was 40.2 mol%. Average nucleotide identity values between strain ChDC B345T and S. pseudopneumoniae ATCC BAA-960T or S. mitis NCTC 12261T were 92.17% and 93.63%, respectively. Genome-to-genome distance values between strain ChDC B345T and S. pseudopneumoniae ATCC BAA-960T or S. mitis NCTC 12261T were 47.8% (45.2-50.4%) and 53.0% (51.0-56.4%), respectively. Based on these results, strain ChDC B345T (=?KCOM 1679T?=?JCM 33299T) should be classified as a novel species of genus Streptococcus, for which we propose the name Streptococcus gwangjuense sp. nov.