Menu
July 7, 2019

Post genomics era for orchid research.

Among 300,000 species in angiosperms, Orchidaceae containing 30,000 species is one of the largest families. Almost every habitats on earth have orchid plants successfully colonized, and it indicates that orchids are among the plants with significant ecological and evolutionary importance. So far, four orchid genomes have been sequenced, including Phalaenopsis equestris, Dendrobium catenatum, Dendrobium officinale, and Apostaceae shengen. Here, we review the current progress and the direction of orchid research in the post genomics era. These include the orchid genome evolution, genome mapping (genome-wide association analysis, genetic map, physical map), comparative genomics (especially receptor-like kinase and terpene synthase), secondary metabolomics, and genome editing.


July 7, 2019

Diversity in grain amaranths and relatives distinguished by genotyping by sequencing (GBS).

The genotyping by sequencing (GBS) method has become a molecular marker technology of choice for many crop plants because of its simultaneous discovery and evaluation of a large number of single nucleotide polymorphisms (SNPs) and utility for germplasm characterization. Genome representation and complexity reduction are the basis for GBS fingerprinting and can vary by species based on genome size and other sequence characteristics. Grain amaranths are a set of three species that were domesticated in the New World to be high protein, pseudo-cereal grain crops. The goal of this research was to employ the GBS technique for diversity evaluation in grain amaranth accessions and close relatives from sixAmaranthusspecies and determine genetic differences and similarities between groupings. A total of 10,668 SNPs were discovered in 94 amaranth accessions withApeKI complexity reduction and 10X genome coverage Illumina sequencing. The majority of the SNPs were species specific with 4,568 and 3,082 for the two grain amaranths originating in Central AmericaAmaranthus cruentus and A. hypochondriacusand 3,284 found amongst bothA. caudatus, originally domesticated in South America, and its close relative,A. quitensis. The distance matrix based on shared alleles provided information on the close relationships of the two cultivated Central American species with each other and of the wild and cultivated South American species with each other, as distinguished from the outgroup with two wild species,A. powelliiandA. retroflexus. The GBS data also distinguished admixture between each pair of species and the geographical origins and seed colors of the accessions. The SNPs we discovered here can be used for marker development for future amaranth study.


July 7, 2019

Genome sequence-based marker development and genotyping in potato

Potato (Solanum tuberosum L.) is one of the world’s most economically important food crops and holds major significance for future food security. Despite its importance, the study of potato genetics and breeding has lagged behind mainly due to its polyploid genome and high levels of heterozygosity. Conventional marker and genotyping approaches have been helpful in progressing potato genetic research but have also had limitations in exploiting the outcome from these studies for gene discovery and applied research applications. The sequencing of the potato genome, followed by advancements in marker and genotyping technologies, has brought a step change in the way potato genetic studies are conducted. Potato is now amenable to modern sequence-based marker and genotyping methods with their increased ability to put thousands of markers on any population of interest without a priori knowledge. This has increased the precision and resolution of genetic studies previously not feasible in potato. A diverse range of fixed and flexible genotyping platforms, for a wide variety of research and breeding applications, are now available. Concerted research efforts are now needed to screen the available genetic diversity for this important crop to identify novel and beneficial trait alleles in order to enable efficient and precise introgression breeding permitting breeding of climate smart, and resilient, potato cultivars. This chapter provides an overview of sequence-based marker development and genotyping methods along with their implications for potato research and breeding in the post-genomics era.


July 7, 2019

Fragmentation of surface adsorbed and aligned DNA molecules using soft lithography for next-generation sequencing

In this study, the enzymatic in situ cutting of linearized DNA molecules at approximately 11 kbp intervals is demonstrated using a soft lithography technique. The ultimate goal is to provide a general ordered cutting method to greatly simplify the assembly process. DNA was stretched onto PMMA (Poly methyl methacrylate) coated silicon by withdrawing the substrate from a DNA solution (a process termed “combing”). The stretched lambda DNA could be linearly cut with a soft lithography stamp used to selectively apply DNase I. After cutting the DNA on the substrate, the DNA fragments are removed from the surface by incubating PMMA in the commercial NEBuffer 3.1 at 75°C. The recovered fragments desorbed into the buffer and were sequenced using the PacBio RS II sequencer without an amplification step. The mean coverage was 2870X for the approximately 11 kbp fragmented sample and 100% of the lambda genome was sequenced. Methods to extend of the technique to ordered fragmentation are discussed.


July 7, 2019

The complete chloroplast genome of Gentiana straminea (Gentianaceae), an endemic species to the Sino-Himalayan subregion.

Endemic to the Sino-Himalayan subregion, the medicinal alpine plant Gentiana straminea is a threatened species. The genetic and molecular data about it is deficient. Here we report the complete chloroplast (cp) genome sequence of G. straminea, as the first sequenced member of the family Gentianaceae. The cp genome is 148,991bp in length, including a large single copy (LSC) region of 81,240bp, a small single copy (SSC) region of 17,085bp and a pair of inverted repeats (IRs) of 25,333bp. It contains 112 unique genes, including 78 protein-coding genes, 30 tRNAs and 4 rRNAs. The rps16 gene lacks exon2 between trnK-UUU and trnQ-UUG, which is the first rps16 pseudogene found in the nonparasitic plants of Asterids clade. Sequence analysis revealed the presence of 13 forward repeats, 13 palindrome repeats and 39 simple sequence repeats (SSRs). An entire cp genome comparison study of G. straminea and four other species in Gentianales was carried out. Phylogenetic analyses using maximum likelihood (ML) and maximum parsimony (MP) were performed based on 69 protein-coding genes from 36 species of Asterids. The results strongly supported the position of Gentianaceae as one member of the order Gentianales. The complete chloroplast genome sequence will provide intragenic information for its conservation and contribute to research on the genetic and phylogenetic analyses of Gentianales and Asterids. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Environmental changes bridge evolutionary valleys.

In the basic fitness landscape metaphor for molecular evolution, evolutionary pathways are presumed to follow uphill steps of increasing fitness. How evolution can cross fitness valleys is an open question. One possibility is that environmental changes alter the fitness landscape such that low-fitness sequences reside on a hill in alternate environments. We experimentally test this hypothesis on the antibiotic resistance gene TEM-15 ß-lactamase by comparing four evolutionary strategies shaped by environmental changes. The strategy that included initial steps of selecting for low antibiotic resistance (negative selection) produced superior alleles compared with the other three strategies. We comprehensively examined possible evolutionary pathways leading to one such high-fitness allele and found that an initially deleterious mutation is key to the allele’s evolutionary history. This mutation is an initial gateway to an otherwise relatively inaccessible area of sequence space and participates in higher-order, positive epistasis with a number of neutral to slightly beneficial mutations. The ability of negative selection and environmental changes to provide access to novel fitness peaks has important implications for natural evolutionary mechanisms and applied directed evolution.


July 7, 2019

Large-scale mitogenomics enables insights into Schizophora (Diptera) radiation and population diversity.

True flies are insects of the order Diptera and encompass one of the most diverse groups of animals on Earth. Within dipterans, Schizophora represents a recent radiation of insects that was used as a model to develop a pipeline for generating complete mitogenomes using various sequencing platforms and strategies. 91 mitogenomes from 32 different species were sequenced and assembled with high fidelity, using amplicon, whole genome shotgun or single molecule sequencing approaches. Based on the novel mitogenomes, we estimate the origin of Schizophora within the Cretaceous-Paleogene (K-Pg) boundary, about 68.3?Ma. Detailed analyses of the blowfly family (Calliphoridae) place its origin at 22?Ma, concomitant with the radiation of grazing mammals. The emergence of ectoparasitism within calliphorids was dated 6.95?Ma for the screwworm fly and 2.3?Ma for the Australian sheep blowfly. Varying population histories were observed for the blowfly Chrysomya megacephala and the housefly Musca domestica samples in our dataset. Whereas blowflies (n?=?50) appear to have undergone selective sweeps and/or severe bottlenecks in the New World, houseflies (n?=?14) display variation among populations from different zoogeographical zones and low levels of gene flow. The reported high-throughput mitogenomics approach for insects enables new insights into schizophoran diversity and population history of flies.


July 7, 2019

BAC-pool sequencing and assembly of 19 Mb of the complex sugarcane genome.

Sequencing plant genomes are often challenging because of their complex architecture and high content of repetitive sequences. Sugarcane has one of the most complex genomes. It is highly polyploid, preserves intact homeologous chromosomes from its parental species and contains >55% repetitive sequences. Although bacterial artificial chromosome (BAC) libraries have emerged as an alternative for accessing the sugarcane genome, sequencing individual clones is laborious and expensive. Here, we present a strategy for sequencing and assembly reads produced from the DNA of pooled BAC clones. A set of 178 BAC clones, randomly sampled from the SP80-3280 sugarcane BAC library, was pooled and sequenced using the Illumina HiSeq2000 and PacBio platforms. A hybrid assembly strategy was used to generate 2,451 scaffolds comprising 19.2 MB of assembled genome sequence. Scaffolds of =20 Kb corresponded to 80% of the assembled sequences, and the full sequences of forty BACs were recovered in one or two contigs. Alignment of the BAC scaffolds with the chromosome sequences of sorghum showed a high degree of collinearity and gene order. The alignment of the BAC scaffolds to the 10 sorghum chromosomes suggests that the genome of the SP80-3280 sugarcane variety is ~19% contracted in relation to the sorghum genome. In conclusion, our data show that sequencing pools composed of high numbers of BAC clones may help to construct a reference scaffold map of the sugarcane genome.


July 7, 2019

ABO allele-level frequency estimation based on population-scale genotyping by next generation sequencing.

The characterization of the ABO blood group status is vital for blood transfusion and solid organ transplantation. Several methods for the molecular characterization of the ABO gene, which encodes the alleles that give rise to the different ABO blood groups, have been described. However, the application of those methods has so far been restricted to selected samples and not been applied to population-scale analysis.We describe a cost-effective method for high-throughput genotyping of the ABO system by next generation sequencing. Sample specific barcodes and sequencing adaptors are introduced during PCR, rendering the products suitable for direct sequencing on Illumina MiSeq or HiSeq instruments. Complete sequence coverage of exons 6 and 7 enables molecular discrimination of the ABO subgroups and many alleles. The workflow was applied to ABO genotype more than a million samples. We report the allele group frequencies calculated on a subset of more than 110,000 sampled individuals of German origin. Further we discuss the potential of the workflow for high resolution genotyping taking the observed allele group frequencies into account. Finally, sequence analysis revealed 287 distinct so far not described alleles of which the most abundant one was identified in 174 samples.The described workflow delivers high resolution ABO genotyping at low cost enabling population-scale molecular ABO characterization.


July 7, 2019

A hot L1 retrotransposon evades somatic repression and initiates human colorectal cancer.

Although human LINE-1 (L1) elements are actively mobilized in many cancers, a role for somatic L1 retrotransposition in tumor initiation has not been conclusively demonstrated. Here, we identify a novel somatic L1 insertion in the APC tumor suppressor gene that provided us with a unique opportunity to determine whether such insertions can actually initiate colorectal cancer (CRC), and if so, how this might occur. Our data support a model whereby a hot L1 source element on Chromosome 17 of the patient’s genome evaded somatic repression in normal colon tissues and thereby initiated CRC by mutating the APC gene. This insertion worked together with a point mutation in the second APC allele to initiate tumorigenesis through the classic two-hit CRC pathway. We also show that L1 source profiles vary considerably depending on the ancestry of an individual, and that population-specific hot L1 elements represent a novel form of cancer risk. © 2016 Scott et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Accelerated dysbiosis of gut microbiota during aggravation of DSS-induced colitis by a butyrate-producing bacterium.

Butyrate-producing bacteria (BPB) are potential probiotic candidates for inflammatory bowel diseases as they are often depleted in the diseased gut microbiota. However, here we found that augmentation of a human-derived butyrate-producing strain, Anaerostipes hadrus BPB5, significantly aggravated colitis in dextran sulphate sodium (DSS)-treated mice while exerted no detrimental effect in healthy mice. We explored how the interaction between BPB5 and gut microbiota may contribute to this differential impact on the hosts. Butyrate production and severity of colitis were assessed in both healthy and DSS-treated mice, and gut microbiota structural changes were analysed using high-throughput sequencing. BPB5-inoculated healthy mice showed no signs of colitis, but increased butyrate content in the gut. In DSS-treated mice, BPB5 augmentation did not increase butyrate content, but induced significantly more severe disease activity index and much higher mortality. BPB5 didn’t induce significant changes of gut microbiota in healthy hosts, but expedited the structural shifts 3 days earlier toward the disease phase in BPB5-augmented than DSS-treated animals. The differential response of gut microbiota in healthy and DSS-treated mice to the same potentially beneficial bacterium with drastically different health consequences suggest that animals with dysbiotic gut microbiota should also be employed for the safety assessment of probiotic candidates.


July 7, 2019

Multiplex enhancer-reporter assays uncover unsophisticated TP53 enhancer logic.

Transcription factors regulate their target genes by binding to regulatory regions in the genome. Although the binding preferences of TP53 are known, it remains unclear what distinguishes functional enhancers from nonfunctional binding. In addition, the genome is scattered with recognition sequences that remain unoccupied. Using two complementary techniques of multiplex enhancer-reporter assays, we discovered that functional enhancers could be discriminated from nonfunctional binding events by the occurrence of a single TP53 canonical motif. By combining machine learning with a meta-analysis of TP53 ChIP-seq data sets, we identified a core set of more than 1000 responsive enhancers in the human genome. This TP53 cistrome is invariably used between cell types and experimental conditions, whereas differences among experiments can be attributed to indirect nonfunctional binding events. Our data suggest that TP53 enhancers represent a class of unsophisticated cell-autonomous enhancers containing a single TP53 binding site, distinct from complex developmental enhancers that integrate signals from multiple transcription factors. © 2016 Verfaillie et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

First report of blaIMP-14 on a plasmid harboring multiple drug resistance genes in Escherichia coli ST131.

The blaIMP-14 carbapenem resistance gene has largely previously been observed in Pseudomonas aeruginosa and Acinetobacter spp. As part of global surveillance and sequencing of carbapenem-resistant E. coli, we identified an ST131 strain harboring blaIMP-14 within a class 1 integron, itself nested within a ~54kb multi-drug resistance region on an epidemic IncA/C2 plasmid. The emergence of blaIMP-14 in this context in the ST131 lineage is of potential clinical concern. Copyright © 2016 Stoesser et al.


July 7, 2019

Microsatellite length scoring by Single Molecule Real Time Sequencing – Effects of sequence structure and PCR regime.

Microsatellites are DNA sequences consisting of repeated, short (1-6 bp) sequence motifs that are highly mutable by enzymatic slippage during replication. Due to their high intrinsic variability, microsatellites have important applications in population genetics, forensics, genome mapping, as well as cancer diagnostics and prognosis. The current analytical standard for microsatellites is based on length scoring by high precision electrophoresis, but due to increasing efficiency next-generation sequencing techniques may provide a viable alternative. Here, we evaluated single molecule real time (SMRT) sequencing, implemented in the PacBio series of sequencing apparatuses, as a means of microsatellite length scoring. To this end we carried out multiplexed SMRT sequencing of plasmid-carried artificial microsatellites of varying structure under different pre-sequencing PCR regimes. For each repeat structure, reads corresponding to the target length dominated. We found that pre-sequencing amplification had large effects on scoring accuracy and error distribution relative to controls, but that the effects of the number of amplification cycles were generally weak. In line with expectations enzymatic slippage decreased proportionally with microsatellite repeat unit length and increased with repetition number. Finally, we determined directional mutation trends, showing that PCR and SMRT sequencing introduced consistent but opposing error patterns in contraction and expansion of the microsatellites on the repeat motif and single nucleotide level.


July 7, 2019

Structural basis for recombinatorial permissiveness in the generation of Anaplasma marginale Msp2 antigenic variants.

Sequential expression of outer membrane protein antigenic variants is an evolutionarily convergent mechanism used by bacterial pathogens to escape host immune clearance and establish persistent infection. Variants must be sufficiently structurally distinct to escape existing immune effectors yet retain core structural elements required for localization and function within the outer membrane. We examined this balance using Anaplasma marginale, which generates antigenic variants in the outer membrane protein Msp2 using gene conversion. The overwhelming majority of Msp2 variants expressed during long-term persistent infection are mosaics, derived by recombination of oligonucleotide segments from multiple alleles to form unique hypervariable regions (HVR). As a result, the mosaics are not under long-term selective pressure to encode a functional protein; consequently, we hypothesized that the Msp2 HVR is structurally permissive for mosaic expression. Using an integrated approach of predictive modeling with determination of native Msp2 protein structure and function, we demonstrate that structured elements, most notably ß-sheets, are significantly concentrated in the highly conserved N- and C-terminal domains. In contrast the HVR is overwhelmingly random coil with the structured a-helices and ß-sheets confined to the genomically defined “structural tethers” that separate the antigenically variable microdomains. This structure is supported by the surface exposure of the HVR microdomains and the slow diffusion type porin function in native Msp2. Importantly, the predominance of random coil provides plasticity for formation of functional HVR mosaics and realization of the full potential of segmental gene conversion to dramatically expand the variant repertoire. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.