Menu
July 7, 2019

Complete mitogenome of Indian mottled eel, Anguilla bengalensis bengalensis (Gray, 1831) through PacBio RSII sequencing.

Complete mitogenome sequence for Anguilla bengalensis bengalensis (family Anguillidae) was generated through third-generation sequencing platform. The 16?714 bp mitgenome sequence contained 13 protein-coding genes, 22 transfer RNAs, 2 ribosomal RNAs, and a non-coding (control) region. The gene order was identical to that observed in most of the other vertebrates. The comparison of complete mitogenome sequence of Indian mottled eel generated during this study with two other subspecies did not agree with the taxonomic status of the three subspecies and considered as one species.


July 7, 2019

Next-generation polyploid phylogenetics: rapid resolution of hybrid polyploid complexes using PacBio single-molecule sequencing.

Difficulties in generating nuclear data for polyploids have impeded phylogenetic study of these groups. We describe a high-throughput protocol and an associated bioinformatics pipeline (Pipeline for Untangling Reticulate Complexes (Purc)) that is able to generate these data quickly and conveniently, and demonstrate its efficacy on accessions from the fern family Cystopteridaceae. We conclude with a demonstration of the downstream utility of these data by inferring a multi-labeled species tree for a subset of our accessions. We amplified four c. 1-kb-long nuclear loci and sequenced them in a parallel-tagged amplicon sequencing approach using the PacBio platform. Purc infers the final sequences from the raw reads via an iterative approach that corrects PCR and sequencing errors and removes PCR-mediated recombinant sequences (chimeras). We generated data for all gene copies (homeologs, paralogs, and segregating alleles) present in each of three sets of 50 mostly polyploid accessions, for four loci, in three PacBio runs (one run per set). From the raw sequencing reads, Purc was able to accurately infer the underlying sequences. This approach makes it easy and economical to study the phylogenetics of polyploids, and, in conjunction with recent analytical advances, facilitates investigation of broad patterns of polyploid evolution.© 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.


July 7, 2019

Implementation and data analysis of Tn-seq, whole genome resequencing, and single-molecule real time sequencing for bacterial genetics.

Few discoveries have been more transformative to the biological sciences than the development of DNA sequencing technologies. The rapid advancement of sequencing and bioinformatics tools has revolutionized bacterial genetics, deepening our understanding of model and clinically relevant organisms. Although application of newer sequencing technologies to studies in bacterial genetics is increasing, the implementation of DNA sequencing technologies and development of the bioinformatics tools required for analyzing the large data sets generated remains a challenge for many. In this minireview, we have chosen to summarize three sequencing approaches that are particularly useful for bacterial genetics. We provide resources for scientists new to and interested in their application. Herein, we discuss the analysis of Tn-seq data to determine gene disruptions differentially represented in a mutant population, Illumina sequencing for identification of suppressor or other mutations, and we summarize single-molecule real time (SMRT) sequencing for de novo genome assembly and the use of the output data for detection of DNA base modifications. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Genomic analysis of the multi-drug-resistant clinical isolate Myroides odoratimimus PR63039.

Myroides odoratimimus (M. odoratimimus) has been gradually implicated as an important nosocomial pathogen that poses a serious health threat to immunocompromised patients owing to its multi-drug resistance. However, the resistance mechanism is currently unclear. To clarify the antibiotic resistance and infectivity mechanisms of M. odoratimimus, whole genome sequencing was performed on the multi-drug-resistant M. odoratimimus strain PR63039. The genome sequence was completed with single molecule real-time (SMRT) technologies. Then, annotation was performed using RAST and IMG-ER. A number of databases and software programs were used to analyze the genomic characteristics, including GC-Profile, ISfinder, CG viewer, ARDB, CARD, ResFinder, the VFDB database, PHAST and Progressive Mauve. The M. odoratimimus PR63039 genome consisted of a chromosome and a plasmid. The genome contained a large number of resistance genes and virulence factors. The distribution of the resistance genes was distinctive, and a resistance region named MY63039-RR was found. The subsystem features generated by RAST indicated that the annotated genome had 108 genes that were potentially involved in virulence, disease and defense, all of which had strong associations with resistance and pathogenicity. The prophage analysis showed two incomplete prophages in the genome. The genomic analysis of M. odoratimimus PR63039 partially clarified its antibiotic resistance mechanisms and virulence factors. Obtaining a clear understanding of its genomic characteristics will be conducive to the management of multidrug-resistant M. odoratimimus.


July 7, 2019

Organelle_PBA, a pipeline for assembling chloroplast and mitochondrial genomes from PacBio DNA sequencing data.

The development of long-read sequencing technologies, such as single-molecule real-time (SMRT) sequencing by PacBio, has produced a revolution in the sequencing of small genomes. Sequencing organelle genomes using PacBio long-read data is a cost effective, straightforward approach. Nevertheless, the availability of simple-to-use software to perform the assembly from raw reads is limited at present.We present Organelle-PBA, a Perl program designed specifically for the assembly of chloroplast and mitochondrial genomes. For chloroplast genomes, the program selects the chloroplast reads from a whole genome sequencing pool, maps the reads to a reference sequence from a closely related species, and then performs read correction and de novo assembly using Sprai. Organelle-PBA completes the assembly process with the additional step of scaffolding by SSPACE-LongRead. The program then detects the chloroplast inverted repeats and reassembles and re-orients the assembly based on the organelle origin of the reference. We have evaluated the performance of the software using PacBio reads from different species, read coverage, and reference genomes. Finally, we present the assembly of two novel chloroplast genomes from the species Picea glauca (Pinaceae) and Sinningia speciosa (Gesneriaceae).Organelle-PBA is an easy-to-use Perl-based software pipeline that was written specifically to assemble mitochondrial and chloroplast genomes from whole genome PacBio reads. The program is available at https://github.com/aubombarely/Organelle_PBA .


July 7, 2019

Structure and evolution of the filaggrin gene repeated region in primates

The evolutionary dynamics of repeat sequences is quite complex, with some duplicates never having differentiated from each other. Two models can explain the complex evolutionary process for repeated genes—concerted and birth-and-death, of which the latter is driven by duplications maintained by selection. Copy number variations caused by random duplications and losses in repeat regions may modulate molecular pathways and therefore affect phenotypic characteristics in a population, resulting in individuals that are able to adapt to new environments. In this study, we investigated the filaggrin gene (FLG), which codes for filaggrin—an important component of the outer layers of mammalian skin—and contains tandem repeats that exhibit copy number variation between and within species. To examine which model best fits the evolutionary pathway for the complete tandem repeats within a single exon of FLG, we determined the repeat sequences in crab-eating macaque (Macaca fascicularis), orangutan (Pongo abelii), gorilla (Gorilla gorilla), and chimpanzee (Pan troglodytes) and compared these with the sequence in human (Homo sapiens).


July 7, 2019

Evolutionary genomics of the cold-adapted diatom Fragilariopsis cylindrus.

The Southern Ocean houses a diverse and productive community of organisms. Unicellular eukaryotic diatoms are the main primary producers in this environment, where photosynthesis is limited by low concentrations of dissolved iron and large seasonal fluctuations in light, temperature and the extent of sea ice. How diatoms have adapted to this extreme environment is largely unknown. Here we present insights into the genome evolution of a cold-adapted diatom from the Southern Ocean, Fragilariopsis cylindrus, based on a comparison with temperate diatoms. We find that approximately 24.7 per cent of the diploid F. cylindrus genome consists of genetic loci with alleles that are highly divergent (15.1 megabases of the total genome size of 61.1 megabases). These divergent alleles were differentially expressed across environmental conditions, including darkness, low iron, freezing, elevated temperature and increased CO2. Alleles with the largest ratio of non-synonymous to synonymous nucleotide substitutions also show the most pronounced condition-dependent expression, suggesting a correlation between diversifying selection and allelic differentiation. Divergent alleles may be involved in adaptation to environmental fluctuations in the Southern Ocean.


July 7, 2019

Two stable variants of Burkholderia pseudomallei strain MSHR5848 express broadly divergent in vitro phenotypes associated with their virulence differences.

Burkholderia pseudomallei (Bp), the agent of melioidosis, causes disease ranging from acute and rapidly fatal to protracted and chronic. Bp is highly infectious by aerosol, can cause severe disease with nonspecific symptoms, and is naturally resistant to multiple antibiotics. However, no vaccine exists. Unlike many Bp strains, which exhibit random variability in traits such as colony morphology, Bp strain MSHR5848 exhibited two distinct and relatively stable colony morphologies on sheep blood agar plates: a smooth, glossy, pale yellow colony and a flat, rough, white colony. Passage of the two variants, designated “Smooth” and “Rough”, under standard laboratory conditions produced cultures composed of > 99.9% of the single corresponding type; however, both could switch to the other type at different frequencies when incubated in certain nutritionally stringent or stressful growth conditions. These MSHR5848 derivatives were extensively characterized to identify variant-associated differences. Microscopic and colony morphology differences on six differential media were observed and only the Rough variant metabolized sugars in selective agar. Antimicrobial susceptibilities and lipopolysaccharide (LPS) features were characterized and phenotype microarray profiles revealed distinct metabolic and susceptibility disparities between the variants. Results using the phenotype microarray system narrowed the 1,920 substrates to a subset which differentiated the two variants. Smooth grew more rapidly in vitro than Rough, yet the latter exhibited a nearly 10-fold lower lethal dose for mice than Smooth. Finally, the Smooth variant was phagocytosed and replicated to a greater extent and was more cytotoxic than Rough in macrophages. In contrast, multiple locus sequence type (MLST) analysis, ribotyping, and whole genome sequence analysis demonstrated the variants’ genetic conservation; only a single consistent genetic difference between the two was identified for further study. These distinct differences shown by two variants of a Bp strain will be leveraged to better understand the mechanism of Bp phenotypic variability and to possibly identify in vitro markers of infection.


July 7, 2019

De novo hybrid assembly of the rubber tree genome reveals evidence of paleotetraploidy in Hevea species.

Para rubber tree (Hevea brasiliensis) is an important economic species as it is the sole commercial producer of high-quality natural rubber. Here, we report a de novo hybrid assembly of BPM24 accession, which exhibits resistance to major fungal pathogens in Southeast Asia. Deep-coverage 454/Illumina short-read and Pacific Biosciences (PacBio) long-read sequence data were acquired to generate a preliminary draft, which was subsequently scaffolded using a long-range “Chicago” technique to obtain a final assembly of 1.26?Gb (N50?=?96.8?kb). The assembled genome contains 69.2% repetitive sequences and has a GC content of 34.31%. Using a high-density SNP-based genetic map, we were able to anchor 28.9% of the genome assembly (363?Mb) associated with over two thirds of the predicted protein-coding genes into rubber tree’s 18 linkage groups. These genetically anchored sequences allowed comparative analyses of the intragenomic homeologous synteny, providing the first concrete evidence to demonstrate the presence of paleotetraploidy in Hevea species. Additionally, the degree of macrosynteny conservation observed between rubber tree and cassava strongly supports the hypothesis that the paleotetraploidization event took place prior to the divergence of the Hevea and Manihot species.


July 7, 2019

The evolution and population diversity of human-specific segmental duplications

Segmental duplications contribute to human evolution, adaptation and genomic instability but are often poorly characterized. We investigate the evolution, genetic variation and coding potential of human-specific segmental duplications (HSDs). We identify 218 HSDs based on analysis of 322 deeply sequenced archaic and contemporary hominid genomes. We sequence 550 human and nonhuman primate genomic clones to reconstruct the evolution of the largest, most complex regions with protein-coding potential (N?=?80 genes from 33 gene families). We show that HSDs are non-randomly organized, associate preferentially with ancestral ape duplications termed ‘core duplicons’ and evolved primarily in an interspersed inverted orientation. In addition to Homo sapiens-specific gene expansions (such as TCAF1/TCAF2), we highlight ten gene families (for example, ARHGAP11B and SRGAP2C) where copy number never returns to the ancestral state, there is evidence of mRNA splicing and no common gene-disruptive mutations are observed in the general population. Such duplicates are candidates for the evolution of human-specific adaptive traits.


July 7, 2019

First complete genome sequence of Marinilactibacillus piezotolerans strain 15R, a marine lactobacillus isolated from coal-bearing sediment 2.0 kilometers below the seafloor, determined by PacBio single-molecule real-time technology.

Marinilactibacillus piezotolerans strain 15R is a facultatively anaerobic heterotrophic lactobacillus isolated from deep marine subsurface sediment nearly 2 km below the seafloor in the northwestern Pacific. We report here the first whole-genome sequence of strain 15R. The identified genome sequence has 2,767,908 bp, 35.4% G+C content, and predicted 2,552 candidate protein-coding sequences, with no identified plasmids. Copyright © 2017 Wei et al.


July 7, 2019

Genome sequence of a unique Magnaporthe oryzae RMg-Dl isolate from India that causes blast disease in diverse cereal crops, obtained using PacBio single-molecule and Illumina HiSeq2500 sequencing.

The whole-genome assembly of a unique rice isolate from India, Magnaporthe oryzae RMg-Dl that causes blast disease in diverse cereal crops is presented. Analysis of the 34.82 Mb genome sequence will aid in better understanding the genetic determinants of host range, host jump, survival, pathogenicity, and virulence factors of M. oryzae. Copyright © 2017 Kumar et al.


July 7, 2019

Fallacy of the unique genome: sequence diversity within single Helicobacter pylori strains.

Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB a-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains.IMPORTANCE Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish “the genome” of a bacterial strain. Variability is usually reduced (“only sequence from a single colony”), ignored (“just publish the consensus”), or placed in the “too-hard” basket (“analysis of raw read data is more robust”). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen Helicobacter pylori model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as “the genome” of a bacterial strain may be misleading. Copyright © 2017 Draper et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.