Menu
July 7, 2019

LoRTE: Detecting transposon-induced genomic variants using low coverage PacBio long read sequences.

Population genomic analysis of transposable elements has greatly benefited from recent advances of sequencing technologies. However, the short size of the reads and the propensity of transposable elements to nest in highly repeated regions of genomes limits the efficiency of bioinformatic tools when Illumina or 454 technologies are used. Fortunately, long read sequencing technologies generating read length that may span the entire length of full transposons are now available. However, existing TE population genomic softwares were not designed to handle long reads and the development of new dedicated tools is needed.LoRTE is the first tool able to use PacBio long read sequences to identify transposon deletions and insertions between a reference genome and genomes of different strains or populations. Tested against simulated and genuine Drosophila melanogaster PacBio datasets, LoRTE appears to be a reliable and broadly applicable tool to study the dynamic and evolutionary impact of transposable elements using low coverage, long read sequences.LoRTE is an efficient and accurate tool to identify structural genomic variants caused by TE insertion or deletion. LoRTE is available for download at http://www.egce.cnrs-gif.fr/?p=6422.


July 7, 2019

The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene.By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the “EPSPS cassette.” This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content.The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.


July 7, 2019

Analysis of serial isolates of mcr-1-positive Escherichia coli reveals a highly active ISApl1 transposon.

The emergence of a transferable colistin resistance gene (mcr-1) is of global concern. The insertion sequence ISApl1 is a key component in the mobilization of this gene, but its role remains poorly understood. Six Escherichia coli isolates were cultured from the same patient over the course of 1 month in Germany and the United States after a brief hospitalization in Bahrain for an unconnected illness. Four carried mcr-1 as determined by real-time PCR, but two were negative. Two additional mcr-1-negative E. coli isolates were collected during follow-up surveillance 9 months later. All isolates were analyzed by whole-genome sequencing (WGS). WGS revealed that the six initial isolates were composed of two distinct strains: an initial ST-617 E. coli strain harboring mcr-1 and a second, unrelated, mcr-1-negative ST-32 E. coli strain that emerged 2 weeks after hospitalization. Follow-up swabs taken 9 months later were negative for the ST-617 strain, but the mcr-1-negative ST-32 strain was still present. mcr-1 was associated with a single copy of ISApl1, located on a 64.5-kb IncI2 plasmid that shared >95% homology with other mcr-1 IncI2 plasmids. ISApl1 copy numbers ranged from 2 for the first isolate to 6 for the final isolate, but ISApl1 movement was independent of mcr-1 Some movement was accompanied by gene disruption, including the loss of genes encoding proteins involved in stress responses, arginine catabolism, and l-arabinose utilization. These data represent the first comprehensive analysis of ISApl1 movement in serial clinical isolates and reveal that, under certain conditions, ISApl1 is a highly active IS element whose movement may be detrimental to the host cell. Copyright © 2017 Snesrud et al.


July 7, 2019

Hybrid assembly of the large and highly repetitive genome of Aegilops tauschii, a progenitor of bread wheat, with the MaSuRCA mega-reads algorithm.

Long sequencing reads generated by single-molecule sequencing technology offer the possibility of dramatically improving the contiguity of genome assemblies. The biggest challenge today is that long reads have relatively high error rates, currently around 15%. The high error rates make it difficult to use this data alone, particularly with highly repetitive plant genomes. Errors in the raw data can lead to insertion or deletion errors (indels) in the consensus genome sequence, which in turn create significant problems for downstream analysis; for example, a single indel may shift the reading frame and incorrectly truncate a protein sequence. Here, we describe an algorithm that solves the high error rate problem by combining long, high-error reads with shorter but much more accurate Illumina sequencing reads, whose error rates average <1%. Our hybrid assembly algorithm combines these two types of reads to construct mega-reads, which are both long and accurate, and then assembles the mega-reads using the CABOG assembler, which was designed for long reads. We apply this technique to a large data set of Illumina and PacBio sequences from the species Aegilops tauschii, a large and extremely repetitive plant genome that has resisted previous attempts at assembly. We show that the resulting assembled contigs are far larger than in any previous assembly, with an N50 contig size of 486,807 nucleotides. We compare the contigs to independently produced optical maps to evaluate their large-scale accuracy, and to a set of high-quality bacterial artificial chromosome (BAC)-based assemblies to evaluate base-level accuracy. © 2017 Zimin et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Population and clinical genetics of human transposable elements in the (post) genomic era.

Recent technological developments-in genomics, bioinformatics and high-throughput experimental techniques-are providing opportunities to study ongoing human transposable element (TE) activity at an unprecedented level of detail. It is now possible to characterize genome-wide collections of TE insertion sites for multiple human individuals, within and between populations, and for a variety of tissue types. Comparison of TE insertion site profiles between individuals captures the germline activity of TEs and reveals insertion site variants that segregate as polymorphisms among human populations, whereas comparison among tissue types ascertains somatic TE activity that generates cellular heterogeneity. In this review, we provide an overview of these new technologies and explore their implications for population and clinical genetic studies of human TEs. We cover both recent published results on human TE insertion activity as well as the prospects for future TE studies related to human evolution and health.


July 7, 2019

Transcriptome Remodeling of Acinetobacter baumannii during Infection and Treatment.

Acinetobacter baumannii is an increasingly common multidrug-resistant pathogen in health care settings. Although the genetic basis of antibiotic resistance mechanisms has been extensively studied, much less is known about how genetic variation contributes to other aspects of successful infections. Genetic changes that occur during host infection and treatment have the potential to remodel gene expression patterns related to resistance and pathogenesis. Longitudinal sets of multidrug-resistant A. baumannii isolates from eight patients were analyzed by RNA sequencing (RNA-seq) to identify differentially expressed genes and link them to genetic changes contributing to transcriptional variation at both within-patient and population levels. The number of differentially expressed genes among isolates from the same patient ranged from 26 (patient 588) to 145 (patient 475). Multiple patients had isolates with differential gene expression patterns related to mutations in the pmrAB and adeRS two-component regulatory system genes, as well as significant differences in genes related to antibiotic resistance, iron acquisition, amino acid metabolism, and surface-associated proteins. Population level analysis revealed 39 genetic regions with clade-specific differentially expressed genes, for which 19, 8, and 3 of these could be explained by insertion sequence mobilization, recombination-driven sequence variation, and intergenic mutations, respectively. Multiple types of mutations that arise during infection can significantly remodel the expression of genes that are known to be important in pathogenesis. IMPORTANCE Health care-associated multidrug-resistant Acinetobacter baumannii can cause persistent infections in patients, but bacterial cells must overcome host defenses and antibiotic therapies to do so. Genetic variation arises during host infection, and new mutations are often enriched in genes encoding transcriptional regulators, iron acquisition systems, and surface-associated structures. In this study, genetic variation was shown to result in transcriptome remodeling at the level of individual patients and across phylogenetic groups. Differentially expressed genes include those related to capsule modification, iron acquisition, type I pili, and antibiotic resistance. Population level transcriptional variation reflects genome dynamics over longer evolutionary time periods, and convergent transcriptional changes support the adaptive significance of these regions. Transcriptional changes can be attributed to multiple types of genomic change, but insertion sequence mobilization had a predominant effect. The transcriptional effects of mutations that arise during infection highlight the rapid adaptation of A. baumannii during host exposure. Copyright © 2017 Wright et al.


July 7, 2019

Benchmarking computational tools for polymorphic transposable element detection.

Transposable elements (TEs) are an important source of human genetic variation with demonstrable effects on phenotype. Recently, a number of computational methods for the detection of polymorphic TE (polyTE) insertion sites from next-generation sequence data have been developed. The use of such tools will become increasingly important as the pace of human genome sequencing accelerates. For this report, we performed a comparative benchmarking and validation analysis of polyTE detection tools in an effort to inform their selection and use by the TE research community. We analyzed a core set of seven tools with respect to ease of use and accessibility, polyTE detection performance and runtime parameters. An experimentally validated set of 893 human polyTE insertions was used for this purpose, along with a series of simulated data sets that allowed us to assess the impact of sequence coverage on tool performance. The recently developed tool MELT showed the best overall performance followed by Mobster and then RetroSeq. PolyTE detection tools can best detect Alu insertion events in the human genome with reduced reliability for L1 insertions and substantially lowered performance for SVA insertions. We also show evidence that different polyTE detection tools are complementary with respect to their ability to detect a complete set of insertion events. Accordingly, a combined approach, coupled with manual inspection of individual results, may yield the best overall performance. In addition to the benchmarking results, we also provide notes on tool installation and usage as well as suggestions for future polyTE detection algorithm development. Published by Oxford University Press 2016. This work is written by US Government employees and is in the public domain in the US.


July 7, 2019

IncFII conjugative plasmid-mediated transmission of blaNDM-1 elements among animal-borne Escherichia coli strains.

This study aims to investigate the prevalence and transmission dynamics of the blaNDM-1 gene in animal Escherichia coli strains. Two IncFII blaNDM-1-encoding plasmids with only minor structural variation in the MDR region, pHNEC46-NDM and pHNEC55-NDM, were found to be responsible for the transmission of blaNDM-1 in these strains. The blaNDM-1 gene can be incorporated into plasmids and stably inherited in animal-borne E. coli strains that can be maintained in animal gut microflora even without carbapenem selection pressure. Copyright © 2016 American Society for Microbiology.


July 7, 2019

Evolution of the wheat blast fungus through functional losses in a host specificity determinant.

Wheat blast first emerged in Brazil in the mid-1980s and has recently caused heavy crop losses in Asia. Here we show how this devastating pathogen evolved in Brazil. Genetic analysis of host species determinants in the blast fungus resulted in the cloning of avirulence genes PWT3 and PWT4, whose gene products elicit defense in wheat cultivars containing the corresponding resistance genes Rwt3 and Rwt4 Studies on avirulence and resistance gene distributions, together with historical data on wheat cultivation in Brazil, suggest that wheat blast emerged due to widespread deployment of rwt3 wheat (susceptible to Lolium isolates), followed by the loss of function of PWT3 This implies that the rwt3 wheat served as a springboard for the host jump to common wheat. Copyright © 2017, American Association for the Advancement of Science.


July 7, 2019

Rare Pyrenophora teres hybridization events revealed by development of sequence-specific PCR markers.

Pyrenophora teres f. teres and P. teres f. maculata cause net form and spot form, respectively, of net blotch on barley (Hordeum vulgare). The two forms reproduce sexually, producing hybrids with genetic and pathogenic variability. Phenotypic identification of hybrids is challenging because lesions induced by hybrids on host plants resemble lesions induced by either P. teres f. teres or P. teres f. maculata. In this study, 12 sequence-specific polymerase chain reaction markers were developed based on expressed regions spread across the genome. The primers were validated using 210 P. teres isolates, 2 putative field hybrids (WAC10721 and SNB172), 50 laboratory-produced hybrids, and 7 isolates collected from barley grass (H. leporinum). The sequence-specific markers confirmed isolate WAC10721 as a hybrid. Only four P. teres f. teres markers amplified on DNA of barley grass isolates. Amplified fragment length polymorphism markers suggested that P. teres barley grass isolates are genetically different from P. teres barley isolates and that the second putative hybrid (SNB172) is a barley grass isolate. We developed a suite of markers which clearly distinguish the two forms of P. teres and enable unambiguous identification of hybrids.


July 7, 2019

The blaOXA-23-associated transposons in the genome of Acinetobacter spp. represent an epidemiological situation of the species encountering carbapenems.

High rates of carbapenem resistance in the human pathogen Acinetobacter baumannii threaten public health and need to be scrutinized.A total of 356 A. baumannii and 50 non-baumannii Acinetobacter spp. (NBA) strains collected in 2013 throughout South Korea were studied. The type of blaOXA-23 transposon was determined by PCR mapping and molecular epidemiology was assessed by MLST. Twelve representative strains and two comparative A. baumannii were entirely sequenced by single-molecule real-time sequencing.The carbapenem resistance rate was 88% in A. baumannii, mainly due to blaOXA-23, with five exceptional cases associated with ISAba1-blaOXA-51-like. The blaOXA-23 gene in A. baumannii was carried either by Tn2006 (44%) or Tn2009 (54%), with a few exceptions carried by Tn2008 (1.6%). Of the NBA strains, 14% were resistant to carbapenems, two with blaOXA-58 and five with blaOXA-23 associated with Tn2006. The Tn2006-possessing strains belonged to various STs, whereas Tn2008- and Tn2009-possessing strains were limited to ST208 and ST191, respectively. The three transposons were often multiplied in the chromosome, and the gene copy number and the carbapenem MICs presented linear relationships either very strongly for Tn2008 or moderately for Tn2006 and Tn2009.The dissemination of Tn2006 was facilitated by its capability for intercellular transfer and that of Tn2009 was attributable to successful dissemination of the ST191 bacterial host carrying the transposon. Tn2008 was infrequent because of its insufficient ability to undergo intercellular transfer and the scarce bacterial host A. baumannii ST208. Gene amplification is an adaptive mechanism for bacteria that encounter antimicrobial drugs.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Hidden genetic variation shapes the structure of functional elements in Drosophila.

Mutations that add, subtract, rearrange, or otherwise refashion genome structure often affect phenotypes, although the fragmented nature of most contemporary assemblies obscures them. To discover such mutations, we assembled the first new reference-quality genome of Drosophila melanogaster since its initial sequencing. By comparing this new genome to the existing D. melanogaster assembly, we created a structural variant map of unprecedented resolution and identified extensive genetic variation that has remained hidden until now. Many of these variants constitute candidates underlying phenotypic variation, including tandem duplications and a transposable element insertion that amplifies the expression of detoxification-related genes associated with nicotine resistance. The abundance of important genetic variation that still evades discovery highlights how crucial high-quality reference genomes are to deciphering phenotypes.


July 7, 2019

Microbial bioinformatics for food safety and production.

In the production of fermented foods, microbes play an important role. Optimization of fermentation processes or starter culture production traditionally was a trial-and-error approach inspired by expert knowledge of the fermentation process. Current developments in high-throughput ‘omics’ technologies allow developing more rational approaches to improve fermentation processes both from the food functionality as well as from the food safety perspective. Here, the authors thematically review typical bioinformatics techniques and approaches to improve various aspects of the microbial production of fermented food products and food safety. © The Author 2015. Published by Oxford University Press.


July 7, 2019

Effects of genome structure variation, homeologous genes and repetitive DNA on polyploid crop research in the age of genomics.

Compared to diploid species, allopolyploid crop species possess more complex genomes, higher productivity, and greater adaptability to changing environments. Next generation sequencing techniques have produced high-density genetic maps, whole genome sequences, transcriptomes and epigenomes for important polyploid crops. However, several problems interfere with the full application of next generation sequencing techniques to these crops. Firstly, different types of genomic variation affect sequence assembly and QTL mapping. Secondly, duplicated or homoeologous genes can diverge in function and then lead to emergence of many minor QTL, which increases difficulties in fine mapping, cloning and marker assisted selection. Thirdly, repetitive DNA sequences arising in polyploid crop genomes also impact sequence assembly, and are increasingly being shown to produce small RNAs to regulate gene expression and hence phenotypic traits. We propose that these three key features should be considered together when analyzing polyploid crop genomes. It is apparent that dissection of genomic structural variation, elucidation of the function and mechanism of interaction of homoeologous genes, and investigation of the de novo roles of repeat sequences in agronomic traits are necessary for genomics-based crop breeding in polyploids. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.


July 7, 2019

Long read and single molecule DNA sequencing simplifies genome assembly and TAL effector gene analysis of Xanthomonas translucens.

The species Xanthomonas translucens encompasses a complex of bacterial strains that cause diseases and yield loss on grass species including important cereal crops. Three pathovars, X. translucens pv. undulosa, X. translucens pv. translucens and X. translucens pv.cerealis, have been described as pathogens of wheat, barley, and oats. However, no complete genome sequence for a strain of this complex is currently available.A complete genome sequence of X. translucens pv. undulosa strain XT4699 was obtained by using PacBio long read, single molecule, real time (SMRT) DNA sequences and Illumina sequences. Draft genome sequences of nineteen additional X. translucens strains, which were collected from wheat or barley in different regions and at different times, were generated by Illumina sequencing. Phylogenetic relationships among different Xanthomonas strains indicates that X. translucens are members of a distinct clade from so-called group 2 xanthomonads and three pathovars of this species, undulosa, translucens and cerealis, represent distinct subclades in the group 1 clade. Knockout mutation of type III secretion system of XT4699 eliminated the ability to cause water-soaking symptoms on wheat and barley and resulted in a reduction in populations on wheat in comparison to the wild type strain. Sequence comparison of X. translucens strains revealed the genetic variation on type III effector repertories among different pathovars or within one pathovar. The full genome sequence of XT4699 reveals the presence of eight members of the Transcription-Activator Like (TAL) effector genes, which are phylogenetically distant from previous known TAL effector genes of group 2 xanthomonads. Microarray and qRT-PCR analyses revealed TAL effector-specific wheat gene expression modulation.PacBio long read sequencing facilitates the assembly of Xanthomonas genomes and the multiple TAL effector genes, which are difficult to assemble from short read platforms. The complete genome sequence of X. translucens pv. undulosa strain XT4699 and draft genome sequences of nineteen additional X. translucens strains provides a resource for further genetic analyses of pathogenic diversity and host range of the X. translucens species complex. TAL effectors of XT4699 strain play roles in modulating wheat host gene expressions.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.