Menu
September 22, 2019

PacBio sequencing and its applications.

Single-molecule, real-time sequencing developed by Pacific BioSciences offers longer read lengths than the second-generation sequencing (SGS) technologies, making it well-suited for unsolved problems in genome, transcriptome, and epigenetics research. The highly-contiguous de novo assemblies using PacBio sequencing can close gaps in current reference assemblies and characterize structural variation (SV) in personal genomes. With longer reads, we can sequence through extended repetitive regions and detect mutations, many of which are associated with diseases. Moreover, PacBio transcriptome sequencing is advantageous for the identification of gene isoforms and facilitates reliable discoveries of novel genes and novel isoforms of annotated genes, due to its ability to sequence full-length transcripts or fragments with significant lengths. Additionally, PacBio’s sequencing technique provides information that is useful for the direct detection of base modifications, such as methylation. In addition to using PacBio sequencing alone, many hybrid sequencing strategies have been developed to make use of more accurate short reads in conjunction with PacBio long reads. In general, hybrid sequencing strategies are more affordable and scalable especially for small-size laboratories than using PacBio Sequencing alone. The advent of PacBio sequencing has made available much information that could not be obtained via SGS alone. Copyright © 2015 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.


September 22, 2019

Sequence of the sugar pine megagenome.

Until very recently, complete characterization of the megagenomes of conifers has remained elusive. The diploid genome of sugar pine (Pinus lambertiana Dougl.) has a highly repetitive, 31 billion bp genome. It is the largest genome sequenced and assembled to date, and the first from the subgenus Strobus, or white pines, a group that is notable for having the largest genomes among the pines. The genome represents a unique opportunity to investigate genome “obesity” in conifers and white pines. Comparative analysis of P. lambertiana and P. taeda L. reveals new insights on the conservation, age, and diversity of the highly abundant transposable elements, the primary factor determining genome size. Like most North American white pines, the principal pathogen of P. lambertiana is white pine blister rust (Cronartium ribicola J.C. Fischer ex Raben.). Identification of candidate genes for resistance to this pathogen is of great ecological importance. The genome sequence afforded us the opportunity to make substantial progress on locating the major dominant gene for simple resistance hypersensitive response, Cr1 We describe new markers and gene annotation that are both tightly linked to Cr1 in a mapping population, and associated with Cr1 in unrelated sugar pine individuals sampled throughout the species’ range, creating a solid foundation for future mapping. This genomic variation and annotated candidate genes characterized in our study of the Cr1 region are resources for future marker-assisted breeding efforts as well as for investigations of fundamental mechanisms of invasive disease and evolutionary response. Copyright © 2016 by the Genetics Society of America.


September 22, 2019

Transcriptome profiling using Illumina- and SMRT-based RNA-seq of hot pepper for in-depth understanding of genes involved in CMV infection.

Hot pepper (Capsicum annuum L.) is becoming an increasingly important vegetable crop in the world. Cucumber mosaic virus (CMV) is a destructive virus that can cause leaf distortion and fruit lesions, affecting pepper production. However, studies on the response to CMV infection in pepper at the transcriptional level are limited. In this study, the transcript profiles of pepper leaves after CMV infection were investigated using Illumina and single-molecule real-time (SMRT) RNA-sequencing (RNA-seq). A total of 2143 differentially expressed genes (DEGs) were identified at five different stages. Gene ontology (GO) and KEGG analysis revealed that these DEGs were involved in the response to stress, defense response and plant-pathogen interaction pathways. Among these DEGs, several key genes that consistently appeared in studies of plant-pathogen interactions had increased transcript abundance after inoculation, including chitinase, pathogenesis-related (PR) protein, TMV resistance protein, WRKY transcription factor and jasmonate ZIM-domain protein. Four of these DEGs were further validated by quantitative real-time RT-PCR (qRT-PCR). Furthermore, a total of 73, 597 alternative splicing (AS) events were identified in the pepper leaves after CMV infection, distributed in 12, 615 genes. The intron retention of WRKY33 (Capana09g001251) might be involved in the regulation of CMV infection. Taken together, our study provides a transcriptome-wide insight into the molecular basis of resistance to CMV infection in pepper leaves and potential candidate genes for improving resistance cultivars. Copyright © 2018 Elsevier B.V. All rights reserved.


September 22, 2019

Genome-wide analysis of complex wheat gliadins, the dominant carriers of celiac disease epitopes.

Gliadins, specified by six compound chromosomal loci (Gli-A1/B1/D1 and Gli-A2/B2/D2) in hexaploid bread wheat, are the dominant carriers of celiac disease (CD) epitopes. Because of their complexity, genome-wide characterization of gliadins is a strong challenge. Here, we approached this challenge by combining transcriptomic, proteomic and bioinformatic investigations. Through third-generation RNA sequencing, full-length transcripts were identified for 52 gliadin genes in the bread wheat cultivar Xiaoyan 81. Of them, 42 were active and predicted to encode 25 a-, 11 ?-, one d- and five ?-gliadins. Comparative proteomic analysis between Xiaoyan 81 and six newly-developed mutants each lacking one Gli locus indicated the accumulation of 38 gliadins in the mature grains. A novel group of a-gliadins (the CSTT group) was recognized to contain very few or no CD epitopes. The d-gliadins identified here or previously did not carry CD epitopes. Finally, the mutant lacking Gli-D2 showed significant reductions in the most celiac-toxic a-gliadins and derivative CD epitopes. The insights and resources generated here should aid further studies on gliadin functions in CD and the breeding of healthier wheat.


September 22, 2019

Characteristics of ARG-carrying plasmidome in the cultivable microbial community from wastewater treatment system under high oxytetracycline concentration.

Studies on antibiotic production wastewater have shown that even a single antibiotic can select for multidrug resistant bacteria in aquatic environments. It is speculated that plasmids are an important mechanism of multidrug resistance (MDR) under high concentrations of antibiotics. Herein, two metagenomic libraries were constructed with plasmid DNA extracted from cultivable microbial communities in a biological wastewater treatment reactor supplemented with 0 (CONTROL) or 25 mg/L of oxytetracycline (OTC-25). The OTC-25 plasmidome reads were assigned to 72 antibiotic resistance genes (ARGs) conferring resistance to 13 types of antibiotics. Dominant ARGs, encoding resistance to tetracycline, aminoglycoside, sulfonamide, and multidrug resistance genes, were enriched in the plasmidome under 25 mg/L of oxytetracycline. Furthermore, 17 contiguous multiple-ARG carrying contigs (carrying =?2 ARGs) were discovered in the OTC-25 plasmidome, whereas only nine were found in the CONTROL. Mapping of the OTC-25 plasmidome reads to completely sequenced plasmids revealed that the conjugative IncU resistance plasmid pFBAOT6 of Aeromonas caviae, carrying multidrug resistance transporter (pecM), tetracycline resistance genes (tetA, tetR), and transposase genes, might be a potential prevalent resistant plasmid in the OTC-25 plasmidome. Additionally, two novel resistant plasmids (containing contig C301682 carrying multidrug resistant operon mexCD-oprJ and contig C301632 carrying the tet36 and transposases genes) might also be potential prevalent resistant plasmids in the OTC-25 plasmidome. This study will be helpful to better understand the role of plasmids in the development of MDR in water environments under high antibiotic concentrations.


September 22, 2019

Diverse antibiotic resistance genes in dairy cow manure.

Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. IMPORTANCE The increasing prevalence of antibiotic resistance among bacteria is one of the most intractable challenges in 21st-century public health. The origins of resistance are complex, and a better understanding of the impacts of antibiotics used on farms would produce a more robust platform for public policy. Microbiomes of farm animals are reservoirs of antibiotic resistance genes, which may affect distribution of antibiotic resistance genes in human pathogens. Previous studies have focused on antibiotic resistance genes in manures of animals subjected to intensive antibiotic use, such as pigs and chickens. Cow manure has received less attention, although it is commonly used in crop production. Here, we report the discovery of novel and diverse antibiotic resistance genes in the cow microbiome, demonstrating that it is a significant reservoir of antibiotic resistance genes. The genomic resource presented here lays the groundwork for understanding the dispersal of antibiotic resistance from the agroecosystem to other settings.


September 22, 2019

Contemporary evolution of a Lepidopteran species, Heliothis virescens, in response to modern agricultural practices.

Adaptation to human-induced environmental change has the potential to profoundly influence the genomic architecture of affected species. This is particularly true in agricultural ecosystems, where anthropogenic selection pressure is strong. Heliothis virescens primarily feeds on cotton in its larval stages, and US populations have been declining since the widespread planting of transgenic cotton, which endogenously expresses proteins derived from Bacillus thuringiensis (Bt). No physiological adaptation to Bt toxin has been found in the field, so adaptation in this altered environment could involve (i) shifts in host plant selection mechanisms to avoid cotton, (ii) changes in detoxification mechanisms required for cotton-feeding vs. feeding on other hosts or (iii) loss of resistance to previously used management practices including insecticides. Here, we begin to address whether such changes occurred in H. virescens populations between 1997 and 2012, as Bt-cotton cultivation spread through the agricultural landscape. For our study, we produced an H. virescens genome assembly and used this in concert with a ddRAD-seq-enabled genome scan to identify loci with significant allele frequency changes over the 15-year period. Genetic changes at a previously described H. virescens insecticide target of selection were detectable in our genome scan and increased our confidence in this methodology. Additional loci were also detected as being under selection, and we quantified the selection strength required to elicit observed allele frequency changes at each locus. Potential contributions of genes near loci under selection to adaptive phenotypes in the H. virescens cotton system are discussed.© 2017 John Wiley & Sons Ltd.


September 22, 2019

Pacbio sequencing of copper-tolerant Xanthomonas citri reveals presence of a chimeric plasmid structure and provides insights into reassortment and shuffling of transcription activator-like effectors among X. citri strains.

Xanthomonas citri, a causal agent of citrus canker, has been a well-studied model system due to recent availability of whole genome sequences of multiple strains from different geographical regions. Major limitations in our understanding of the evolution of pathogenicity factors in X. citri strains sequenced by short-read sequencing methods have been tracking plasmid reshuffling among strains due to inability to accurately assign reads to plasmids, and analyzing repeat regions among strains. X. citri harbors major pathogenicity determinants, including variable DNA-binding repeat region containing Transcription Activator-like Effectors (TALEs) on plasmids. The long-read sequencing method, PacBio, has allowed the ability to obtain complete and accurate sequences of TALEs in xanthomonads. We recently sequenced Xanthomonas citri str. Xc-03-1638-1-1, a copper tolerant A group strain isolated from grapefruit in 2003 from Argentina using PacBio RS II chemistry. We analyzed plasmid profiles, copy number and location of TALEs in complete genome sequences of X. citri strains.We utilized the power of long reads obtained by PacBio sequencing to enable assembly of a complete genome sequence of strain Xc-03-1638-1-1, including sequences of two plasmids, 249 kb (plasmid harboring copper resistance genes) and 99 kb (pathogenicity plasmid containing TALEs). The pathogenicity plasmid in this strain is a hybrid plasmid containing four TALEs. Due to the intriguing nature of this pathogenicity plasmid with Tn3-like transposon association, repetitive elements and multiple putative sites for origins of replication, we might expect alternative structures of this plasmid in nature, illustrating the strong adaptive potential of X. citri strains. Analysis of the pathogenicity plasmid among completely sequenced X. citri strains, coupled with Southern hybridization of the pathogenicity plasmids, revealed clues to rearrangements of plasmids and resulting reshuffling of TALEs among strains.We demonstrate in this study the importance of long-read sequencing for obtaining intact sequences of TALEs and plasmids, as well as for identifying rearrangement events including plasmid reshuffling. Rearrangement events, such as the hybrid plasmid in this case, could be a frequent phenomenon in the evolution of X. citri strains, although so far it is undetected due to the inability to obtain complete plasmid sequences with short-read sequencing methods.


September 22, 2019

Complete genome sequencing of the luminescent bacterium, Vibrio qinghaiensis sp. Q67 using PacBio technology.

Vibrio qinghaiensis sp.-Q67 (Vqin-Q67) is a freshwater luminescent bacterium that continuously emits blue-green light (485?nm). The bacterium has been widely used for detecting toxic contaminants. Here, we report the complete genome sequence of Vqin-Q67, obtained using third-generation PacBio sequencing technology. Continuous long reads were attained from three PacBio sequencing runs and reads >500?bp with a quality value of >0.75 were merged together into a single dataset. This resultant highly-contiguous de novo assembly has no genome gaps, and comprises two chromosomes with substantial genetic information, including protein-coding genes, non-coding RNA, transposon and gene islands. Our dataset can be useful as a comparative genome for evolution and speciation studies, as well as for the analysis of protein-coding gene families, the pathogenicity of different Vibrio species in fish, the evolution of non-coding RNA and transposon, and the regulation of gene expression in relation to the bioluminescence of Vqin-Q67.


September 22, 2019

Translating genomics into practice for real-time surveillance and response to carbapenemase-producing Enterobacteriaceae: evidence from a complex multi-institutional KPC outbreak.

Until recently, Klebsiella pneumoniae carbapenemase (KPC)-producing Enterobacteriaceae were rarely identified in Australia. Following an increase in the number of incident cases across the state of Victoria, we undertook a real-time combined genomic and epidemiological investigation. The scope of this study included identifying risk factors and routes of transmission, and investigating the utility of genomics to enhance traditional field epidemiology for informing management of established widespread outbreaks.All KPC-producing Enterobacteriaceae isolates referred to the state reference laboratory from 2012 onwards were included. Whole-genome sequencing was performed in parallel with a detailed descriptive epidemiological investigation of each case, using Illumina sequencing on each isolate. This was complemented with PacBio long-read sequencing on selected isolates to establish high-quality reference sequences and interrogate characteristics of KPC-encoding plasmids.Initial investigations indicated that the outbreak was widespread, with 86 KPC-producing Enterobacteriaceae isolates (K. pneumoniae 92%) identified from 35 different locations across metropolitan and rural Victoria between 2012 and 2015. Initial combined analyses of the epidemiological and genomic data resolved the outbreak into distinct nosocomial transmission networks, and identified healthcare facilities at the epicentre of KPC transmission. New cases were assigned to transmission networks in real-time, allowing focussed infection control efforts. PacBio sequencing confirmed a secondary transmission network arising from inter-species plasmid transmission. Insights from Bayesian transmission inference and analyses of within-host diversity informed the development of state-wide public health and infection control guidelines, including interventions such as an intensive approach to screening contacts following new case detection to minimise unrecognised colonisation.A real-time combined epidemiological and genomic investigation proved critical to identifying and defining multiple transmission networks of KPC Enterobacteriaceae, while data from either investigation alone were inconclusive. The investigation was fundamental to informing infection control measures in real-time and the development of state-wide public health guidelines on carbapenemase-producing Enterobacteriaceae surveillance and management.


September 22, 2019

Adaptive strategies of Bacillus thuringiensis isolated from acid mine drainage site in Sabah, Malaysia.

The adaptive process in bacteria is driven by specific genetic elements which regulate phenotypic characteristics such as tolerance to high metal ion concentrations and the secretion of protective biofilms. Extreme environments such as those associated with heavy metal pollution and extremes of acidity offer opportunities to study the adaptive mechanisms of microorganisms. This study focused on the genome analysis of Bacillus thuringiensis (Bt MCMY1), a gram positive rod shaped bacterium isolated from an acid mine drainage site in Sabah, Malaysia by using a combination of Single Molecule Real Time DNA Sequencing, Scanning Electron Microscopy (SEM) and Fourier Transform Infrared Spectroscopy (FTIR). The genome size of Bt MCMY1 was determined to be 5,458,152 bases which was encoded on a single chromosome. Analysis of the genome revealed genes associated with resistance to Copper, Mercury, Arsenic, Cobalt, Zinc, Cadmium and Aluminum. Evidence from SEM and FTIR indicated that the bacterial colonies form distinct films which bear the signature of polyhydroxyalkanoates (PHA) and this finding was supported by the genome data indicating the presence of a genetic pathway associated with the biosynthesis of PHAs. This is the first report of a Bacillus sp. isolated from an acid mine drainage site in Sabah, Malaysia and the genome sequence will provide insights into the manner in which B. thuringiensis adapts to acid mine drainage.


September 22, 2019

Functional metagenomics reveals a novel carbapenem-hydrolyzing mobile beta-lactamase from Indian river sediments contaminated with antibiotic production waste.

Evolution has provided environmental bacteria with a plethora of genes that give resistance to antibiotic compounds. Under anthropogenic selection pressures, some of these genes are believed to be recruited over time into pathogens by horizontal gene transfer. River sediment polluted with fluoroquinolones and other drugs discharged from bulk drug production in India constitute an environment with unprecedented, long-term antibiotic selection pressures. It is therefore plausible that previously unknown resistance genes have evolved and/or are promoted here. In order to search for novel resistance genes, we therefore analyzed such river sediments by a functional metagenomics approach. DNA fragments providing resistance to different antibiotics in E. coli were sequenced using Sanger and PacBio RSII platforms. We recaptured the majority of known antibiotic resistance genes previously identified by open shot-gun metagenomics sequencing of the same samples. In addition, seven novel resistance gene candidates (six beta-lactamases and one amikacin resistance gene) were identified. Two class A beta-lactamases, blaRSA1 and blaRSA2, were phylogenetically close to clinically important ESBLs like blaGES, blaBEL and blaL2, and were further characterized for their substrate spectra. The blaRSA1 protein, encoded as an integron gene cassette, efficiently hydrolysed penicillins, first generation cephalosporins and cefotaxime, while blaRSA2 was an inducible class A beta-lactamase, capable of hydrolyzing carbapenems albeit with limited efficiency, similar to the L2 beta-lactamase from Stenotrophomonas maltophilia. All detected novel genes were associated with plasmid mobilization proteins, integrons, and/or other resistance genes, suggesting a potential for mobility. This study provides insight into a resistome shaped by an exceptionally strong and long-term antibiotic selection pressure. An improved knowledge of mobilized resistance factors in the external environment may make us better prepared for the resistance challenges that we may face in clinics in the future. Copyright © 2017 Elsevier Ltd. All rights reserved.


September 22, 2019

An ancient integration in a plant NLR is maintained as a trans-species polymorphism

Plant immune receptors are under constant selective pressure to maintain resistance to plant pathogens. Nucleotide-binding leucine-rich repeat (NLR) proteins are one class of cytoplasmic immune receptors whose genes commonly show signatures of adaptive evolution. While it is known that balancing selection contributes to maintaining high intraspecific allelic diversity, the evolutionary mechanism that influences the transmission of alleles during speciation remains unclear. The barley Mla locus has over 30 described alleles conferring isolate-specific resistance to barley powdery mildew and contains three NLR families (RGH1, RGH2, and RGH3). We discovered (using sequence capture and RNAseq) the presence of a novel integrated Exo70 domain in RGH2 in the Mla3 haplotype. Allelic variation across barley accessions includes presence/absence of the integrated domain in RGH2. Expanding our search to several Poaceae species, we found shared interspecific conservation in the RGH2-Exo70 integration. We hypothesise that balancing selection has maintained allelic variation at Mla as a trans-species polymorphism over 24 My, thus contributing to and preserving interspecific allelic diversity during speciation.


September 22, 2019

Pangenome analyses of the wheat pathogen Zymoseptoria tritici reveal the structural basis of a highly plastic eukaryotic genome.

Structural variation contributes substantially to polymorphism within species. Chromosomal rearrangements that impact genes can lead to functional variation among individuals and influence the expression of phenotypic traits. Genomes of fungal pathogens show substantial chromosomal polymorphism that can drive virulence evolution on host plants. Assessing the adaptive significance of structural variation is challenging, because most studies rely on inferences based on a single reference genome sequence.We constructed and analyzed the pangenome of Zymoseptoria tritici, a major pathogen of wheat that evolved host specialization by chromosomal rearrangements and gene deletions. We used single-molecule real-time sequencing and high-density genetic maps to assemble multiple genomes. We annotated the gene space based on transcriptomics data that covered the infection life cycle of each strain. Based on a total of five telomere-to-telomere genomes, we constructed a pangenome for the species and identified a core set of 9149 genes. However, an additional 6600 genes were exclusive to a subset of the isolates. The substantial accessory genome encoded on average fewer expressed genes but a larger fraction of the candidate effector genes that may interact with the host during infection. We expanded our analyses of the pangenome to a worldwide collection of 123 isolates of the same species. We confirmed that accessory genes were indeed more likely to show deletion polymorphisms and loss-of-function mutations compared to core genes.The pangenome construction of a highly polymorphic eukaryotic pathogen showed that a single reference genome significantly underestimates the gene space of a species. The substantial accessory genome provides a cradle for adaptive evolution.


September 22, 2019

Magic pools: Parallel assessment of transposon delivery vectors in bacteria

Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term “magic pools.” Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. To identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylum Bacteroidetes. IMPORTANCE Molecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a “magic pool.” The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA “parts,” we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.