Goat is an important source of milk, meat, and fiber, especially in developing countries. An advantage of goats as livestock is the low maintenance requirements and high adaptability compared to other milk producers. The global population of domestic goats exceeds 800 million. In Africa, goat production is characterized by low productivity levels, and attempts to introduce more productive breeds have met with poor success due in part to nutritional constraints. It has been suggested that incorporation of selective breeding within the herds adapted for survival could represent one approach to improving food security across Africa. A recently produced genome assembly of a Chinese Yunnan breed goat, based on 192 Gb of short reads across a range of insert sizes from 180 bp to 20 kb, reported a contig N50 of 18.7 kb. The scaffold N50 was improved from 2.2 Mb to 3.1 Mb by addition of fosmid end sequence, with an estimated 140 million Ns in gaps and 91% coverage. The assembly has proven somewhat problematic for pursuing genome-wide association analysis with SNP arrays, apparently due in part to errors in ordering of markers using the draft genome. In order to provide a higher quality assembly, we sequenced a highly inbred, San Clemente breed goat genome using 458 SMRT cells on the Pacific Biosciences platform. These cells generated 193.5 Gbases of sequence after processing into subreads, with mean 5110 bases and max subread length of 40.5 kb. This sequence data generated an assembly using the recently reported MHAP error correction approach and Celera Assembler v8.2. The contig N50 was 2.5 Mb, with the largest contig spanning 19.5 Mb. Additional characteristics of the assembly will be presented.
Many applications of high throughput sequencing rely on the availability of an accurate reference genome. Errors in the reference genome assembly increase the number of false-positives in downstream analyses. Recently, we have shown that over 33% of the current pig reference genome, Sscrofa10.2, is either misassembled or otherwise unreliable for genomic analyses. Additionally, ~10% of the bases in the assembly are Ns in gaps of an arbitrary size. Thousands of highly fragmented contigs remain unplaced and many genes are known to be missing from the assembly. Here we present a new assembly of the pig genome, Sscrofa11, assembled using 65X PacBio sequencing from T.J. Tabasco, the same Duroc sow used in the assembly of Sscrofa10.2. The PacBio reads were assembled using the Falcon assembly pipeline resulting in 3,206 contigs with an initial contig N50 of 14.5Mb. We used Sscrofa10.2 as a template to scaffold the PacBio contigs, under the assumption that its gross structure is correct, and used PBJelly to fill gaps. Additional gaps were filled using large, sequenced BACs from the original assembly. Following gap filling, the assembly has substantially improved contiguity and contains more sequence than the Sscrofa10.2 assembly. Arrow and Pilon were used to polish the assembly. The contig N50 is now 58.5Mb with 103 gaps remaining. By comparing regions of the two assemblies we show that regions with structural abnormalities we identified in Sscrofa10.2 are resolved in the new PacBio assembly.
The PacBio Iso-Seq method produces high-quality, full-length transcripts of up to 10 kb and longer and has been used to annotate many important plant and animal genomes. We describe here the full Iso-Seq ecosystem that enables researchers to achieve high-quality genome annotations. The Iso-Seq Express workflow is a 1-day protocol that requires only 60-300 ng of total RNA and supports multiplexing of different tissues. Sequencing on a single SMRT Cell 8M on the Sequel II System produces up to 4 million full-length reads, sufficient to exhaustively characterize a whole transcriptome on the order of 15,000-17,000 genes with 100,000 or more transcripts. Most importantly, the method is supported by a maturing suite of official and community-developed tools. The SMRT Link Iso-Seq application outputs high-quality (>99% accurate), full-length transcript sequences that can optionally be mapped to a reference genome for a single SMRT Cell worth of data in 6-9 hours. For example, the SQANTI2 tool classifies Iso-Seq transcripts against a reference annotation, filters potential library artifacts, and processes information from both long read-only and short read-based quantification. IsoPhase is a tool for identifying allele-specific isoform expression. Cogent has been used to process Iso-Seq transcripts in a genome-independent manner to assess genome assemblies. Finally, IsoAnnot is an up-and-coming tool for identifying differential isoform expression across different samples. We describe how these tools complement each other and provide guidelines to make the best use out of Iso-Seq data for understanding transcriptomes.
PacBio Sequencing is characterized by very long sequence reads (averaging > 10,000 bases), lack of GC-bias, and high consensus accuracy. These features have allowed the method to provide a new…
This video provides an overview of the techniques and steps of preparing samples, DNA, and libraries for PacBio Single Molecule, Real-Time (SMRT) Sequencing to be used in de novo assembly…
The Iso-Seq method enables the sequencing of transcript isoforms from the 5’ end to their poly-A tails, eliminating the need for transcript reconstruction and inference. This webinar provides a comprehensive…
The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model with high anatomical and immunological similarity to humans. The draft reference genome (Sscrofa10.2) represented a purebred female pig from a commercial pork production breed (Duroc), and was established using older clone-based sequencing methods. The Sscrofa10.2 assembly was incomplete and unresolved redundancies, short range order and orientation errors and associated misassembled genes limited its utility. We present two highly contiguous chromosome-level genome assemblies created with more recent long read technologies and a whole genome shotgun strategy, one for the same Duroc female (Sscrofa11.1) and one for an outbred, composite breed male animal commonly used for commercial pork production (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy compared to the earlier reference, and the availability of two independent assemblies provided an opportunity to identify large-scale variants and to error-check the accuracy of representation of the genome. We propose that the improved Duroc breed assembly (Sscrofa11.1) become the reference genome for genomic research in pigs.
Tigecycline is one of the last-resort antibiotics to treat complicated infections caused by both multidrug-resistant Gram-negative and Gram-positive bacteria1. Tigecycline resistance has sporadically occurred in recent years, primarily due to chromosome-encoding mechanisms, such as overexpression of efflux pumps and ribosome protection2,3. Here, we report the emergence of the plasmid-mediated mobile tigecycline resistance mechanism Tet(X4) in Escherichia coli isolates from China, which is capable of degrading all tetracyclines, including tigecycline and the US FDA newly approved eravacycline. The tet(X4)-harbouring IncQ1 plasmid is highly transferable, and can be successfully mobilized and stabilized in recipient clinical and laboratory strains of Enterobacteriaceae bacteria. It is noteworthy that tet(X4)-positive E.?coli strains, including isolates co-harbouring mcr-1, have been widely detected in pigs, chickens, soil and dust samples in China. In vivo murine models demonstrated that the presence of Tet(X4) led to tigecycline treatment failure. Consequently, the emergence of plasmid-mediated Tet(X4) challenges the clinical efficacy of the entire family of tetracycline antibiotics. Importantly, our study raises concern that the plasmid-mediated tigecycline resistance may further spread into various ecological niches and into clinical high-risk pathogens. Collective efforts are in urgent need to preserve the potency of these essential antibiotics.
Complete genome sequence of Bacillus velezensis JT3-1, a microbial germicide isolated from yak feces
Bacillus velezensis JT3-1 is a probiotic strain isolated from feces of the domestic yak (Bos grunniens) in the Gansu province of China. It has strong antagonistic activity against Listeria monocytogenes, Staphylococcus aureus, Escherichia coli, Salmonella Typhimurium, Mannheimia haemolytica, Staphylococcus hominis, Clostridium perfringens, and Mycoplasma bovis. These properties have made the JT3-1 strain the focus of commercial interest. In this study, we describe the complete genome sequence of JT3-1, with a genome size of 3,929,799 bp, 3761 encoded genes and an average GC content of 46.50%. Whole genome sequencing of Bacillus velezensis JT3-1 will lay a good foundation for elucidation of the mechanisms of its antimicrobial activity, and for its future application.
The ruminants are one of the most successful mammalian lineages, exhibiting morphological and habitat diversity and containing several key livestock species. To better understand their evolution, we generated and analyzed de novo assembled genomes of 44 ruminant species, representing all six Ruminantia families. We used these genomes to create a time-calibrated phylogeny to resolve topological controversies, overcoming the challenges of incomplete lineage sorting. Population dynamic analyses show that population declines commenced between 100,000 and 50,000 years ago, which is concomitant with expansion in human populations. We also reveal genes and regulatory elements that possibly contribute to the evolution of the digestive system, cranial appendages, immune system, metabolism, body size, cursorial locomotion, and dentition of the ruminants. Copyright © 2019 The Authors, some rights reserved; exclusive licensee American Association for the Advancement of Science. No claim to original U.S. Government Works.
Increased prevalence of Escherichia coli strains from food carrying blaNDM and mcr-1-bearing plasmids that structurally resemble those of clinical strains, China, 2015 to 2017.
Introduction: Emergence of resistance determinants of blaNDM and mcr-1 has undermined the antimicrobial effectiveness of the last line drugs carbapenems and colistin. Aim: This work aimed to assess the prevalence of blaNDM and mcr-1 in E. coli strains collected from food in Shenzhen, China, during the period 2015 to 2017. Methods: Multidrug-resistant E. coli strains were isolated from food samples. Plasmids encoding mcr-1 or blaNDM genes were characterised and compared with plasmids found in clinical isolates.ResultsAmong 1,166 non-repeated cephalosporin-resistant E. coli strains isolated from 2,147 food samples, 390 and 42, respectively, were resistant to colistin and meropenem, with five strains being resistant to both agents. The rate of resistance to colistin increased significantly (p?0.01) from 26% in 2015 to 46% in 2017, and that of meropenem resistance also increased sharply from 0.3% in 2015 to 17% in 2017 (p?0.01). All meropenem-resistant strains carried a plasmid-borne blaNDM gene. Among the colistin-resistant strains, three types of mcr-1-bearing plasmids were determined. Plasmid sequencing indicated that these mcr-1 and blaNDM-bearing plasmids were structurally similar to those commonly recovered from clinical isolates. Interestingly, both mcr-1-bearing and blaNDM-bearing plasmids were transferrable to E. coli strain J53 under selection by meropenem, yet only mcr-1-bearing plasmids were transferrable under colistin selection. Conclusion: These findings might suggest that mobile elements harbouring mcr-1 and blaNDM have been acquired by animal strains and transmitted to our food products, highlighting a need to prevent a spike in the rate of drug resistant food-borne infections.
Conjugal Transfer, Whole-Genome Sequencing, and Plasmid Analysis of Four mcr-1-Bearing Isolates from U.S. Patients.
Four Enterobacteriaceae clinical isolates bearing mcr-1 gene-harboring plasmids were characterized. All isolates demonstrated the ability to transfer colistin resistance to Escherichia coli; plasmids were stable in conjugants after multiple passages on nonselective media. mcr-1 was located on an IncX4 (n?=?3) or IncN (n?=?1) plasmid. The IncN plasmid harbored 13 additional antimicrobial resistance genes. Results indicate that the mcr-1-bearing plasmids in this study were highly transferable in vitro and stable in the recipients.This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.
Detection of VIM-1-Producing Enterobacter cloacae and Salmonella enterica Serovars Infantis and Goldcoast at a Breeding Pig Farm in Germany in 2017 and Their Molecular Relationship to Former VIM-1-Producing S. Infantis Isolates in German Livestock Production.
In 2011, VIM-1-producing Salmonella enterica serovar Infantis and Escherichia coli were isolated for the first time in four German livestock farms. In 2015/2016, highly related isolates were identified in German pig production. This raised the issue of potential reservoirs for these isolates, the relation of their mobile genetic elements, and potential links between the different affected farms/facilities. In a piglet-producing farm suspicious for being linked to some blaVIM-1 findings in Germany, fecal and environmental samples were examined for the presence of carbapenemase-producing Enterobacteriaceae and Salmonella spp. Newly discovered isolates were subjected to Illumina whole-genome sequencing (WGS) and S1 pulsed-field gel electrophoresis (PFGE) hybridization experiments. WGS data of these isolates were compared with those for the previously isolated VIM-1-producing Salmonella Infantis isolates from pigs and poultry. Among 103 samples, one Salmonella Goldcoast isolate, one Salmonella Infantis isolate, and one Enterobacter cloacae isolate carrying the blaVIM-1 gene were detected. Comparative WGS analysis revealed that the blaVIM-1 gene was part of a particular Tn21-like transposable element in all isolates. It was located on IncHI2 (ST1) plasmids of ~290 to 300?kb with a backbone highly similar (98 to 100%) to that of reference pSE15-SA01028. SNP analysis revealed a close relationship of all VIM-1-positive S Infantis isolates described since 2011. The findings of this study demonstrate that the occurrence of the blaVIM-1 gene in German livestock is restricted neither to a certain bacterial species nor to a certain Salmonella serovar but is linked to a particular Tn21-like transposable element located on transferable pSE15-SA01028-like IncHI2 (ST1) plasmids, being present in all of the investigated isolates from 2011 to 2017.IMPORTANCE Carbapenems are considered one of few remaining treatment options against multidrug-resistant Gram-negative pathogens in human clinical settings. The occurrence of carbapenemase-producing Enterobacteriaceae in livestock and food is a major public health concern. Particularly the occurrence of VIM-1-producing Salmonella Infantis in livestock farms is worrisome, as this zoonotic pathogen is one of the main causes for human salmonellosis in Europe. Investigations on the epidemiology of those carbapenemase-producing isolates and associated mobile genetic elements through an in-depth molecular characterization are indispensable to understand the transmission of carbapenemase-producing Enterobacteriaceae along the food chain and between different populations to develop strategies to prevent their further spread.Copyright © 2019 Roschanski et al.
The pig is a well-studied model animal of biomedical and agricultural importance. Genes of this species, Sus scrofa, are known from experiments and predictions, and collected at the NCBI reference sequence database section. Gene reconstruction from transcribed gene evidence of RNA-seq now can accurately and completely reproduce the biological gene sets of animals and plants. Such a gene set for the pig is reported here, including human orthologs missing from current NCBI and Ensembl reference pig gene sets, additional alternate transcripts, and other improvements. Methodology for accurate and complete gene set reconstruction from RNA is used: the automated SRA2Genes pipeline of EvidentialGene project.
A whole genome scan of SNP data suggests a lack of abundant hard selective sweeps in the genome of the broad host range plant pathogenic fungus Sclerotinia sclerotiorum.
The pathogenic fungus Sclerotinia sclerotiorum infects over 600 species of plant. It is present in numerous environments throughout the world and causes significant damage to many agricultural crops. Fragmentation and lack of gene flow between populations may lead to population sub-structure. Within discrete recombining populations, positive selection may lead to a ‘selective sweep’. This is characterised by an increase in frequency of a favourable allele leading to reduction in genotypic diversity in a localised genomic region due to the phenomenon of genetic hitchhiking. We aimed to assess whether isolates of S. sclerotiorum from around the world formed genotypic clusters associated with geographical origin and to determine whether signatures of population-specific positive selection could be detected. To do this, we sequenced the genomes of 25 isolates of S. sclerotiorum collected from four different continents-Australia, Africa (north and south), Europe and North America (Canada and the northen United States) and conducted SNP based analyses of population structure and selective sweeps. Among the 25 isolates, there was evidence for two major population clusters. One of these consisted of 11 isolates from Canada, the USA and France (population 1), and the other consisted of nine isolates from Australia and one from Morocco (population 2). The rest of the isolates were genotypic outliers. We found that there was evidence of outcrossing in these two populations based on linkage disequilibrium decay. However, only a single candidate selective sweep was observed, and it was present in population 2. This sweep was close to a Major Facilitator Superfamily transporter gene, and we speculate that this gene may have a role in nutrient uptake from the host. The low abundance of selective sweeps in the S. sclerotiorum genome contrasts the numerous examples in the genomes of other fungal pathogens. This may be a result of its slow rate of evolution and low effective recombination rate due to self-fertilisation and vegetative reproduction.