Menu
July 7, 2019

Patterns of polymorphism at the self-incompatibility locus in 1,083 Arabidopsis thaliana genomes.

Although the transition to selfing in the model plant Arabidopsis thaliana involved the loss of the self-incompatibility (SI) system, it clearly did not occur due to the fixation of a single inactivating mutation at the locus determining the specificities of SI (the S-locus). At least three groups of divergent haplotypes (haplogroups), corresponding to ancient functional S-alleles, have been maintained at this locus, and extensive functional studies have shown that all three carry distinct inactivating mutations. However, the historical process of loss of SI is not well understood, in particular its relation with the last glaciation. Here, we took advantage of recently published genomic resequencing data in 1,083 Arabidopsis thaliana accessions that we combined with BAC sequencing to obtain polymorphism information for the whole S-locus region at a species-wide scale. The accessions differed by several major rearrangements including large deletions and interhaplogroup recombinations, forming a set of haplogroups that are widely distributed throughout the native range and largely overlap geographically. “Relict” A. thaliana accessions that directly derive from glacial refugia are polymorphic at the S-locus, suggesting that the three haplogroups were already present when glacial refugia from the last Ice Age became isolated. Interhaplogroup recombinant haplotypes were highly frequent, and detailed analysis of recombination breakpoints suggested multiple independent origins. These findings suggest that the complete loss of SI in A. thaliana involved independent self-compatible mutants that arose prior to the last Ice Age, and experienced further rearrangements during postglacial colonization.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

The unique genomic landscape surrounding the EPSPS gene in glyphosate resistant Amaranthus palmeri: a repetitive path to resistance.

The expanding number and global distributions of herbicide resistant weedy species threaten food, fuel, fiber and bioproduct sustainability and agroecosystem longevity. Amongst the most competitive weeds, Amaranthus palmeri S. Wats has rapidly evolved resistance to glyphosate primarily through massive amplification and insertion of the 5-enolpyruvylshikimate-3-phosphate synthase (EPSPS) gene across the genome. Increased EPSPS gene copy numbers results in higher titers of the EPSPS enzyme, the target of glyphosate, and confers resistance to glyphosate treatment. To understand the genomic unit and mechanism of EPSPS gene copy number proliferation, we developed and used a bacterial artificial chromosome (BAC) library from a highly resistant biotype to sequence the local genomic landscape flanking the EPSPS gene.By sequencing overlapping BACs, a 297 kb sequence was generated, hereafter referred to as the “EPSPS cassette.” This region included several putative genes, dense clusters of tandem and inverted repeats, putative helitron and autonomous replication sequences, and regulatory elements. Whole genome shotgun sequencing (WGS) of two biotypes exhibiting high and no resistance to glyphosate was performed to compare genomic representation across the EPSPS cassette. Mapping of sequences for both biotypes to the reference EPSPS cassette revealed significant differences in upstream and downstream sequences relative to EPSPS with regard to both repetitive units and coding content between these biotypes. The differences in sequence may have resulted from a compounded-building mechanism such as repetitive transpositional events. The association of putative helitron sequences with the cassette suggests a possible amplification and distribution mechanism. Flow cytometry revealed that the EPSPS cassette added measurable genomic content.The adoption of glyphosate resistant cropping systems in major crops such as corn, soybean, cotton and canola coupled with excessive use of glyphosate herbicide has led to evolved glyphosate resistance in several important weeds. In Amaranthus palmeri, the amplification of the EPSPS cassette, characterized by a complex array of repetitive elements and putative helitron sequences, suggests an adaptive structural genomic mechanism that drives amplification and distribution around the genome. The added genomic content not found in glyphosate sensitive plants may be driving evolution through genome expansion.


July 7, 2019

Complete gene sequence of spider attachment silk protein (PySp1) reveals novel linker regions and extreme repeat homogenization.

Spiders use a myriad of silk types for daily survival, and each silk type has a unique suite of task-specific mechanical properties. Of all spider silk types, pyriform silk is distinct because it is a combination of a dry protein fiber and wet glue. Pyriform silk fibers are coated with wet cement and extruded into “attachment discs” that adhere silks to each other and to substrates. The mechanical properties of spider silk types are linked to the primary and higher-level structures of spider silk proteins (spidroins). Spidroins are often enormous molecules (>250 kDa) and have a lengthy repetitive region that is flanked by relatively short (~100 amino acids), non-repetitive amino- and carboxyl-terminal regions. The amino acid sequence motifs in the repetitive region vary greatly between spidroin type, while motif length and number underlie the remarkable mechanical properties of spider silk fibers. Existing knowledge of pyriform spidroins is fragmented, making it difficult to define links between the structure and function of pyriform spidroins. Here, we present the full-length sequence of the gene encoding pyriform spidroin 1 (PySp1) from the silver garden spider Argiope argentata. The predicted protein is similar to previously reported PySp1 sequences but the A. argentata PySp1 has a uniquely long and repetitive “linker”, which bridges the amino-terminal and repetitive regions. Predictions of the hydrophobicity and secondary structure of A. argentata PySp1 identify regions important to protein self-assembly. Analysis of the full complement of A. argentata PySp1 repeats reveals extreme intragenic homogenization, and comparison of A. argentata PySp1 repeats with other PySp1 sequences identifies variability in two sub-repetitive expansion regions. Overall, the full-length A. argentata PySp1 sequence provides new evidence for understanding how pyriform spidroins contribute to the properties of pyriform silk fibers. Copyright © 2017 The Authors. Published by Elsevier Ltd.. All rights reserved.


July 7, 2019

ThermoAlign: a genome-aware primer design tool for tiled amplicon resequencing.

Isolating and sequencing specific regions in a genome is a cornerstone of molecular biology. This has been facilitated by computationally encoding the thermodynamics of DNA hybridization for automated design of hybridization and priming oligonucleotides. However, the repetitive composition of genomes challenges the identification of target-specific oligonucleotides, which limits genetics and genomics research on many species. Here, a tool called ThermoAlign was developed that ensures the design of target-specific primer pairs for DNA amplification. This is achieved by evaluating the thermodynamics of hybridization for full-length oligonucleotide-template alignments – thermoalignments – across the genome to identify primers predicted to bind specifically to the target site. For amplification-based resequencing of regions that cannot be amplified by a single primer pair, a directed graph analysis method is used to identify minimum amplicon tiling paths. Laboratory validation by standard and long-range polymerase chain reaction and amplicon resequencing with maize, one of the most repetitive genomes sequenced to date (˜85% repeat content), demonstrated the specificity-by-design functionality of ThermoAlign. ThermoAlign is released under an open source license and bundled in a dependency-free container for wide distribution. It is anticipated that this tool will facilitate multiple applications in genetics and genomics and be useful in the workflow of high-throughput targeted resequencing studies.


July 7, 2019

Efficient CNV breakpoint analysis reveals unexpected structural complexity and correlation of dosage-sensitive genes with clinical severity in genomic disorders.

Genomic disorders are the clinical conditions manifested by submicroscopic genomic rearrangements including copy number variants (CNVs). The CNVs can be identified by array-based comparative genomic hybridization (aCGH), the most commonly used technology for molecular diagnostics of genomic disorders. However, clinical aCGH only informs CNVs in the probe-interrogated regions. Neither orientational information nor the resulting genomic rearrangement structure is provided, which is a key to uncovering mutational and pathogenic mechanisms underlying genomic disorders. Long-range polymerase chain reaction (PCR) is a traditional approach to obtain CNV breakpoint junction, but this method is inefficient when challenged by structural complexity such as often found at the PLP1 locus in association with Pelizaeus-Merzbacher disease (PMD). Here we introduced ‘capture and single-molecule real-time sequencing’ (cap-SMRT-seq) and newly developed ‘asymmetry linker-mediated nested PCR walking’ (ALN-walking) for CNV breakpoint sequencing in 49 subjects with PMD-associated CNVs. Remarkably, 29 (94%) of the 31 CNV breakpoint junctions unobtainable by conventional long-range PCR were resolved by cap-SMRT-seq and ALN-walking. Notably, unexpected CNV complexities, including inter-chromosomal rearrangements that cannot be resolved by aCGH, were revealed by efficient breakpoint sequencing. These sequence-based structures of PMD-associated CNVs further support the role of DNA replicative mechanisms in CNV mutagenesis, and facilitate genotype-phenotype correlation studies. Intriguingly, the lengths of gained segments by CNVs are strongly correlated with clinical severity in PMD, potentially reflecting the functional contribution of other dosage-sensitive genes besides PLP1. Our study provides new efficient experimental approaches (especially ALN-walking) for CNV breakpoint sequencing and highlights their importance in uncovering CNV mutagenesis and pathogenesis in genomic disorders.© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 7, 2019

Resequencing array for gene variant detection in malignant hyperthermia and butyrylcholinestherase deficiency.

Malignant hyperthermia (MH) and butyrylcholinestherase (BCHE) deficiency are two relevant pharmacogenetic disorders in anesthetic practice linked with sequence variants, the former in the RyR1 and CACNA1S genes, the latter in the BCHE gene. Genotyping for known pathogenic variants in these genes is useful to help identify susceptible individuals, and others may exist but remain unknown, because full-length sequence of these genes is, in general, not investigated. To facilitate this task, we developed a resequencing DNA array, the perioperative patient safety (POPS) array, to be able to screen the entire coding sequences of the RyR1, CACNA1S and BCHE genes. MH-susceptible individuals (n?=?121) identified with the in vitro contracture test, the standard diagnostic tool for MH susceptibility, were genotyped with the arrays. Compared with capillary sequencing, call rates with the arrays could achieve 100% at maximal sensitivity, although to reduce false positive rates, sensitivity was adjusted to 0.85, 0.87 and 0.66 for RyR1, CACNA1S and BCHE respectively, with overall base call specificity exceeding 99%. Detection of 29 predetermined RyR1 variants in 44 individuals was successful in 97% of the cases, among them all 16 variants of established diagnostic value. In a trial application of the arrays, 21 MH-susceptible subjects with no known RyR1 or CACNA1S variants were screened, resulting in the discovery of new variants, all confirmed by capillary sequencing. In conclusion, arrays offer an efficient high-throughput alternative for diagnostic genotyping of candidate genes affecting MH susceptibility, BCHE deficiency and other neuromuscular disorders, simultaneously enabling a comprehensive search for rare variants in these genes. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Mistranslation can enhance fitness through purging of deleterious mutations.

Phenotypic mutations are amino acid changes caused by mistranslation. How phenotypic mutations affect the adaptive evolution of new protein functions is unknown. Here we evolve the antibiotic resistance protein TEM-1 towards resistance on the antibiotic cefotaxime in an Escherichia coli strain with a high mistranslation rate. TEM-1 populations evolved in such strains endow host cells with a general growth advantage, not only on cefotaxime but also on several other antibiotics that ancestral TEM-1 had been unable to deactivate. High-throughput sequencing of TEM-1 populations shows that this advantage is associated with a lower incidence of weakly deleterious genotypic mutations. Our observations show that mistranslation is not just a source of noise that delays adaptive evolution. It could even facilitate adaptive evolution by exacerbating the effects of deleterious mutations and leading to their more efficient purging. The ubiquity of mistranslation and its effects render mistranslation an important factor in adaptive protein evolution.


July 7, 2019

The complete genome sequence of Exiguobacterium arabatum W-01 reveals potential probiotic functions.

Shrimp is extensively cultured worldwide. Shrimp farming is suffering from a variety of diseases. Probiotics are considered to be one of the effective methods to prevent and cure shrimp diseases. Exiguobacterium arabatum W-01, a gram-positive and orange-pigmented bacterium, was isolated from the intestine of a healthy Penaeus vannamei specimen. Whole-genome sequencing revealed a genome of 2,914,854 bp, with 48.02% GC content. In total, 3,083 open reading frames (ORFs) were identified, with an average length of 843.98 bp and a mean GC content of 48.11%, accounting for 89.27% of the genome. Among these ORFs, 2,884 (93.5%) genes were classified into Clusters of Orthologous Groups (COG) families comprising 21 functional categories, and 1,650 ORFs were classified into 83 functional Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways. A total of 27 rRNA operons and 68 tRNAs were identified, with all 20 amino acids represented. In addition, 91 genomic islands, 68 potential prophages, and 33 tandem repeats, but no clustered regularly interspaced short palindromic repeats (CRISPRs), were found. No resistance genes and only one virulence gene were identified. Among the 150 secreted proteins of E. arabatum W-01, a variety of transport system substrate-binding proteins, enzymes, and biosynthetic proteins, which play important roles in the uptake and metabolism of nutrients, were found. Two adherence-related protein genes and 31 flagellum-related protein genes were also identified. Taken together, these results indicate potential probiotic functions for E. arabatum W-01.© 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.


July 7, 2019

High-quality draft genome sequences of four lignocellulose-degrading bacteria isolated from Puerto Rican forest soil: Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.

Here, we report the high-quality draft genome sequences of four phylogenetically diverse lignocellulose-degrading bacteria isolated from tropical soil (Gordonia sp., Paenibacillus sp., Variovorax sp., and Vogesella sp.) to elucidate the genetic basis of their ability to degrade lignocellulose. These isolates may provide novel enzymes for biofuel production. Copyright © 2017 Woo et al.


July 7, 2019

MHC class I diversity in chimpanzees and bonobos.

Major histocompatibility complex (MHC) class I genes are critically involved in the defense against intracellular pathogens. MHC diversity comparisons among samples of closely related taxa may reveal traces of past or ongoing selective processes. The bonobo and chimpanzee are the closest living evolutionary relatives of humans and last shared a common ancestor some 1 mya. However, little is known concerning MHC class I diversity in bonobos or in central chimpanzees, the most numerous and genetically diverse chimpanzee subspecies. Here, we used a long-read sequencing technology (PacBio) to sequence the classical MHC class I genes A, B, C, and A-like in 20 and 30 wild-born bonobos and chimpanzees, respectively, with a main focus on central chimpanzees to assess and compare diversity in those two species. We describe in total 21 and 42 novel coding region sequences for the two species, respectively. In addition, we found evidence for a reduced MHC class I diversity in bonobos as compared to central chimpanzees as well as to western chimpanzees and humans. The reduced bonobo MHC class I diversity may be the result of a selective process in their evolutionary past since their split from chimpanzees.


July 7, 2019

Identification of a gene cluster for telomestatin biosynthesis and heterologous expression using a specific promoter in a clean host.

Telomestatin, a strong telomerase inhibitor with G-quadruplex stabilizing activity, is a potential therapeutic agent for treating cancers. Difficulties in isolating telomestatin from microbial cultures and in chemical synthesis are bottlenecks impeding the wider use. Therefore, improvement in telomestatin production and structural diversification are required for further utilization and application. Here, we discovered the gene cluster responsible for telomestatin biosynthesis, and achieved production of telomestatin by heterologous expression of this cluster in the engineered Streptomyces avermitilis SUKA strain. Utilization of an optimal promoter was essential for successful production. Gene disruption studies revealed that the tlsB, tlsC, and tlsO-T genes play key roles in telomestatin biosynthesis. Moreover, exchanging TlsC core peptide sequences resulted in the production of novel telomestatin derivatives. This study sheds light on the expansion of chemical diversity of natural peptide products for drug development.


July 7, 2019

Designing robust watermark barcodes for multiplex long-read sequencing.

To attain acceptable sample misassignment rates, current approaches to multiplex single-molecule real-time sequencing require upstream quality improvement, which is obtained from multiple passes over the sequenced insert and significantly reduces the effective read length. In order to fully exploit the raw read length on multiplex applications, robust barcodes capable of dealing with the full single-pass error rates are needed.We present a method for designing sequencing barcodes that can withstand a large number of insertion, deletion and substitution errors and are suitable for use in multiplex single-molecule real-time sequencing. The manuscript focuses on the design of barcodes for full-length single-pass reads, impaired by challenging error rates in the order of 11%. The proposed barcodes can multiplex hundreds or thousands of samples while achieving sample misassignment probabilities as low as 10-7 under the above conditions, and are designed to be compatible with chemical constraints imposed by the sequencing process.Software tools for constructing watermark barcode sets and demultiplexing barcoded reads, together with example sets of barcodes and synthetic barcoded reads, are freely available at www.cifasis-conicet.gov.ar/ezpeleta/NS-watermark .ezpeleta@cifasis-conicet.gov.ar.


July 7, 2019

Loss of pollen-specific phospholipase NOT LIKE DAD triggers gynogenesis in maize.

Gynogenesis is an asexual mode of reproduction common to animals and plants, in which stimuli from the sperm cell trigger the development of the unfertilized egg cell into a haploid embryo. Fine mapping restricted a major maize QTL (quantitative trait locus) responsible for the aptitude of inducer lines to trigger gynogenesis to a zone containing a single gene NOT LIKE DAD (NLD) coding for a patatin-like phospholipase A. In all surveyed inducer lines, NLD carries a 4-bp insertion leading to a predicted truncated protein. This frameshift mutation is responsible for haploid induction because complementation with wild-type NLD abolishes the haploid induction capacity. Activity of the NLD promoter is restricted to mature pollen and pollen tube. The translational NLD::citrine fusion protein likely localizes to the sperm cell plasma membrane. In Arabidopsis roots, the truncated protein is no longer localized to the plasma membrane, contrary to the wild-type NLD protein. In conclusion, an intact pollen-specific phospholipase is required for successful sexual reproduction and its targeted disruption may allow establishing powerful haploid breeding tools in numerous crops.© 2017 The Authors.


July 7, 2019

Evolution and comparative genomics of pAQU-like conjugative plasmids in Vibrio species.

To investigate a set of MDR conjugative plasmids found in Vibrio species and characterize the underlying evolution process.pAQU-type plasmids from Vibrio species were sequenced using both Illumina and PacBio platforms. Bioinformatics tools were utilized to analyse the typical MDR regions and core genes in the plasmids.The nine pAQU-type plasmids ranged from ~160 to 206?kb in size and were found to harbour as many as 111 core genes encoding conjugative, replication and maintenance functions. Eight plasmids were found to carry a typical MDR region, which contained various accessory and resistance genes, including ISCR1-blaPER-1-bearing complex class 1 integrons, ISCR2-floR, ISCR2-tet(D)-tetR-ISCR2, qnrVC6, a Tn10-like structure and others associated with mobile elements. Comparison between a plasmid without resistance genes and different MDR plasmids showed that integration of different mobile elements, such as IS26, ISCR1, ISCR2, IS10 and IS6100, into the plasmid backbone was the key mechanism by which foreign resistance genes were acquired during the evolution process.This study identified pAQU-type plasmids as emerging MDR conjugative plasmids among important pathogens from different origins in Asia. These findings suggest that aquatic bacteria constitute a major reservoir of resistance genes, which may be transmissible to other human pathogens during food production and processing.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.