Menu
September 22, 2019

Single-molecule real-time transcript sequencing facilitates common wheat genome annotation and grain transcriptome research.

The large and complex hexaploid genome has greatly hindered genomics studies of common wheat (Triticum aestivum, AABBDD). Here, we investigated transcripts in common wheat developing caryopses using the emerging single-molecule real-time (SMRT) sequencing technology PacBio RSII, and assessed the resultant data for improving common wheat genome annotation and grain transcriptome research.We obtained 197,709 full-length non-chimeric (FLNC) reads, 74.6 % of which were estimated to carry complete open reading frame. A total of 91,881 high-quality FLNC reads were identified and mapped to 16,188 chromosomal loci, corresponding to 13,162 known genes and 3026 new genes not annotated previously. Although some FLNC reads could not be unambiguously mapped to the current draft genome sequence, many of them are likely useful for studying highly similar homoeologous or paralogous loci or for improving chromosomal contig assembly in further research. The 91,881 high-quality FLNC reads represented 22,768 unique transcripts, 9591 of which were newly discovered. We found 180 transcripts each spanning two or three previously annotated adjacent loci, suggesting that they should be merged to form correct gene models. Finally, our data facilitated the identification of 6030 genes differentially regulated during caryopsis development, and full-length transcripts for 72 transcribed gluten gene members that are important for the end-use quality control of common wheat.Our work demonstrated the value of PacBio transcript sequencing for improving common wheat genome annotation through uncovering the loci and full-length transcripts not discovered previously. The resource obtained may aid further structural genomics and grain transcriptome studies of common wheat.


September 22, 2019

Accurate characterization of the IFITM locus using MiSeq and PacBio sequencing shows genetic variation in Galliformes.

Interferon inducible transmembrane (IFITM) proteins are effectors of the immune system widely characterized for their role in restricting infection by diverse enveloped and non-enveloped viruses. The chicken IFITM (chIFITM) genes are clustered on chromosome 5 and to date four genes have been annotated, namely chIFITM1, chIFITM3, chIFITM5 and chIFITM10. However, due to poor assembly of this locus in the Gallus Gallus v4 genome, accurate characterization has so far proven problematic. Recently, a new chicken reference genome assembly Gallus Gallus v5 was generated using Sanger, 454, Illumina and PacBio sequencing technologies identifying considerable differences in the chIFITM locus over the previous genome releases.We re-sequenced the locus using both Illumina MiSeq and PacBio RS II sequencing technologies and we mapped RNA-seq data from the European Nucleotide Archive (ENA) to this finalized chIFITM locus. Using SureSelect probes capture probes designed to the finalized chIFITM locus, we sequenced the locus of a different chicken breed, namely a White Leghorn, and a turkey.We confirmed the Gallus Gallus v5 consensus except for two insertions of 5 and 1 base pair within the chIFITM3 and B4GALNT4 genes, respectively, and a single base pair deletion within the B4GALNT4 gene. The pull down revealed a single amino acid substitution of A63V in the CIL domain of IFITM2 compared to Red Jungle fowl and 13, 13 and 11 differences between IFITM1, 2 and 3 of chickens and turkeys, respectively. RNA-seq shows chIFITM2 and chIFITM3 expression in numerous tissue types of different chicken breeds and avian cell lines, while the expression of the putative chIFITM1 is limited to the testis, caecum and ileum tissues.Locus resequencing using these capture probes and RNA-seq based expression analysis will allow the further characterization of genetic diversity within Galliformes.


September 22, 2019

The full transcription map of mouse papillomavirus type 1 (MmuPV1) in mouse wart tissues.

Mouse papillomavirus type 1 (MmuPV1) provides, for the first time, the opportunity to study infection and pathogenesis of papillomaviruses in the context of laboratory mice. In this report, we define the transcriptome of MmuPV1 genome present in papillomas arising in experimentally infected mice using a combination of RNA-seq, PacBio Iso-seq, 5′ RACE, 3′ RACE, primer-walking RT-PCR, RNase protection, Northern blot and in situ hybridization analyses. We demonstrate that the MmuPV1 genome is transcribed unidirectionally from five major promoters (P) or transcription start sites (TSS) and polyadenylates its transcripts at two major polyadenylation (pA) sites. We designate the P7503, P360 and P859 as “early” promoters because they give rise to transcripts mostly utilizing the polyadenylation signal at nt 3844 and therefore can only encode early genes, and P7107 and P533 as “late” promoters because they give rise to transcripts utilizing polyadenylation signals at either nt 3844 or nt 7047, the latter being able to encode late, capsid proteins. MmuPV1 genome contains five splice donor sites and three acceptor sites that produce thirty-six RNA isoforms deduced to express seven predicted early gene products (E6, E7, E1, E1^M1, E1^M2, E2 and E8^E2) and three predicted late gene products (E1^E4, L2 and L1). The majority of the viral early transcripts are spliced once from nt 757 to 3139, while viral late transcripts, which are predicted to encode L1, are spliced twice, first from nt 7243 to either nt 3139 (P7107) or nt 757 to 3139 (P533) and second from nt 3431 to nt 5372. Thirteen of these viral transcripts were detectable by Northern blot analysis, with the P533-derived late E1^E4 transcripts being the most abundant. The late transcripts could be detected in highly differentiated keratinocytes of MmuPV1-infected tissues as early as ten days after MmuPV1 inoculation and correlated with detection of L1 protein and viral DNA amplification. In mature warts, detection of L1 was also found in more poorly differentiated cells, as previously reported. Subclinical infections were also observed. The comprehensive transcription map of MmuPV1 generated in this study provides further evidence that MmuPV1 is similar to high-risk cutaneous beta human papillomaviruses. The knowledge revealed will facilitate the use of MmuPV1 as an animal virus model for understanding of human papillomavirus gene expression, pathogenesis and immunology.


September 22, 2019

Whole genome sequencing of “Faecalibaculum rodentium” ALO17, isolated from C57BL/6J laboratory mouse feces.

Intestinal microorganisms affect host physiology, including ageing. Given the difficulty in controlling for human studies of the gut microbiome, mouse models provide an alternative avenue to study such relationships. In this study, we report on the complete genome of “Faecalibaculum rodentium” ALO17, a bacterium that was isolated from the faeces of a 9-month-old female C57BL/6J mouse. This strain will be utilized in future in vivo studies detailing the relationships between the gut microbiome and ageing.The whole genome sequence of “F. rodentium” ALO17 was obtained using single-molecule, real-time (SMRT) technique on a PacBio instrument. The assembled genome consisted of 2,542,486 base pairs of double-stranded DNA with a GC content of 54.0 % and no plasmids. The genome was predicted to contain 2794 open reading frames, 55 tRNA genes, and 38 rRNA genes. The 16S rRNA gene of ALO17 was 86.9 % similar to that of Allobaculum stercoricanis DSM 13633(T), and the average overall nucleotide identity between strains ALO17 and DSM 13633(T) was 66.8 %. After confirming the phylogenetic relationship between “F. rodentium” ALO17 and A. stercoricanis DSM 13633(T), their whole genome sequences were compared, revealing that “F. rodentium” ALO17 contains more fermentation-related genes than A. stercoricanis DSM 13633(T). Furthermore, “F. rodentium” ALO17 produces higher levels of lactic acid than A. stercoricanis DSM 13633(T) as determined by high-performance liquid chromatography.The availability of the “F. rodentium” ALO17 whole genome sequence will enhance studies concerning the gut microbiota and host physiology, especially when investigating the molecular relationships between gut microbiota and ageing.


September 22, 2019

Analysis of the gut microbial diversity of dairy cows during peak lactation by PacBio Single-Molecule Real-Time (SMRT) Sequencing.

The gut microbes of dairy cows are strongly associated with their health, but the relationship between milk production and the intestinal microbiota has seldom been studied. Thus, we explored the diversity of the intestinal microbiota during peak lactation of dairy cows.The intestinal microbiota of nine dairy cows at peak lactation was evaluated using the Pacific Biosciences single-molecule real-time (PacBio SMRT) sequencing approach.A total of 32,670 high-quality 16S rRNA gene sequences were obtained, belonging to 12 phyla, 59 families, 107 genera, and 162 species. Firmicutes (83%) were the dominant phylum, while Bacteroides (6.16%) was the dominant genus. All samples showed a high microbial diversity, with numerous genera of short chain fatty acid (SCFA)-producers. The proportion of SCFA producers was relatively high in relation to the identified core intestinal microbiota. Moreover, the predicted functional metagenome was heavily involved in energy metabolism.This study provided novel insights into the link between the dairy cow gut microbiota and milk production.


September 22, 2019

High-resolution phylogenetic microbial community profiling.

Over the past decade, high-throughput short-read 16S rRNA gene amplicon sequencing has eclipsed clone-dependent long-read Sanger sequencing for microbial community profiling. The transition to new technologies has provided more quantitative information at the expense of taxonomic resolution with implications for inferring metabolic traits in various ecosystems. We applied single-molecule real-time sequencing for microbial community profiling, generating full-length 16S rRNA gene sequences at high throughput, which we propose to name PhyloTags. We benchmarked and validated this approach using a defined microbial community. When further applied to samples from the water column of meromictic Sakinaw Lake, we show that while community structures at the phylum level are comparable between PhyloTags and Illumina V4 16S rRNA gene sequences (iTags), variance increases with community complexity at greater water depths. PhyloTags moreover allowed less ambiguous classification. Last, a platform-independent comparison of PhyloTags and in silico generated partial 16S rRNA gene sequences demonstrated significant differences in community structure and phylogenetic resolution across multiple taxonomic levels, including a severe underestimation in the abundance of specific microbial genera involved in nitrogen and methane cycling across the Lake’s water column. Thus, PhyloTags provide a reliable adjunct or alternative to cost-effective iTags, enabling more accurate phylogenetic resolution of microbial communities and predictions on their metabolic potential.


September 22, 2019

Comparative genomic analysis of Sulfurospirillum cavolei MES reconstructed from the metagenome of an electrosynthetic microbiome.

Sulfurospirillum spp. play an important role in sulfur and nitrogen cycling, and contain metabolic versatility that enables reduction of a wide range of electron acceptors, including thiosulfate, tetrathionate, polysulfide, nitrate, and nitrite. Here we describe the assembly of a Sulfurospirillum genome obtained from the metagenome of an electrosynthetic microbiome. The ubiquity and persistence of this organism in microbial electrosynthesis systems suggest it plays an important role in reactor stability and performance. Understanding why this organism is present and elucidating its genetic repertoire provide a genomic and ecological foundation for future studies where Sulfurospirillum are found, especially in electrode-associated communities. Metabolic comparisons and in-depth analysis of unique genes revealed potential ecological niche-specific capabilities within the Sulfurospirillum genus. The functional similarities common to all genomes, i.e., core genome, and unique gene clusters found only in a single genome were identified. Based upon 16S rRNA gene phylogenetic analysis and average nucleotide identity, the Sulfurospirillum draft genome was found to be most closely related to Sulfurospirillum cavolei. Characterization of the draft genome described herein provides pathway-specific details of the metabolic significance of the newly described Sulfurospirillum cavolei MES and, importantly, yields insight to the ecology of the genus as a whole. Comparison of eleven sequenced Sulfurospirillum genomes revealed a total of 6246 gene clusters in the pan-genome. Of the total gene clusters, 18.5% were shared among all eleven genomes and 50% were unique to a single genome. While most Sulfurospirillum spp. reduce nitrate to ammonium, five of the eleven Sulfurospirillum strains encode for a nitrous oxide reductase (nos) cluster with an atypical nitrous-oxide reductase, suggesting a utility for this genus in reduction of the nitrous oxide, and as a potential sink for this potent greenhouse gas.


September 22, 2019

Root endophytes and invasiveness: no difference between native and non-native Phragmites in the Great Lakes Region

Microbial interactions could play an important role in plant invasions. If invasive plants associate with relatively more mutualists or fewer pathogens than their native counterparts, then microbial communities could foster plant invasiveness. Studies examining the effects of microbes on invasive plants commonly focus on a single microbial group (e.g., bacteria) or measure only plant response to microbes, not documenting the specific taxa associating with invaders. We surveyed root microbial communities associated with co-occurring native and non-native lineages of Phragmites australis, across Michigan, USA. Our aim was to determine whether (1) plant lineage was a stronger predictor of root microbial community composition than environmental variables and (2) the non-native lineage associated with more mutualistic and/or fewer pathogenic microbes than the native lineage. We used microscopy and culture-independent molecular methods to examine fungal colonization rate and community composition in three major microbial groups (bacteria, fungi, and oomycetes) within roots. We also used microbial functional databases to assess putative functions of the observed microbial taxa. While fungal colonization of roots was significantly higher in non-native Phragmites than the native lineage, we found no differences in root microbial community composition or potential function between the two Phragmites lineages. Community composition did differ significantly by site, with soil saturation playing a significant role in structuring communities in all three microbial groups. The relative abundance of some specific bacterial taxa did differ between Phragmites lineages at the phylum and genus level (e.g., Proteobacteria, Firmicutes). Purported function of root fungi and respiratory mode of root bacteria also did not differ between native and non-native Phragmites. We found no evidence that native and non-native Phragmites harbored distinct root microbial communities; nor did those communities differ functionally. Therefore, if the trends revealed at our sites are widespread, it is unlikely that total root microbial communities are driving invasion by non-native Phragmites plants.


September 22, 2019

Characteristics of ARG-carrying plasmidome in the cultivable microbial community from wastewater treatment system under high oxytetracycline concentration.

Studies on antibiotic production wastewater have shown that even a single antibiotic can select for multidrug resistant bacteria in aquatic environments. It is speculated that plasmids are an important mechanism of multidrug resistance (MDR) under high concentrations of antibiotics. Herein, two metagenomic libraries were constructed with plasmid DNA extracted from cultivable microbial communities in a biological wastewater treatment reactor supplemented with 0 (CONTROL) or 25 mg/L of oxytetracycline (OTC-25). The OTC-25 plasmidome reads were assigned to 72 antibiotic resistance genes (ARGs) conferring resistance to 13 types of antibiotics. Dominant ARGs, encoding resistance to tetracycline, aminoglycoside, sulfonamide, and multidrug resistance genes, were enriched in the plasmidome under 25 mg/L of oxytetracycline. Furthermore, 17 contiguous multiple-ARG carrying contigs (carrying =?2 ARGs) were discovered in the OTC-25 plasmidome, whereas only nine were found in the CONTROL. Mapping of the OTC-25 plasmidome reads to completely sequenced plasmids revealed that the conjugative IncU resistance plasmid pFBAOT6 of Aeromonas caviae, carrying multidrug resistance transporter (pecM), tetracycline resistance genes (tetA, tetR), and transposase genes, might be a potential prevalent resistant plasmid in the OTC-25 plasmidome. Additionally, two novel resistant plasmids (containing contig C301682 carrying multidrug resistant operon mexCD-oprJ and contig C301632 carrying the tet36 and transposases genes) might also be potential prevalent resistant plasmids in the OTC-25 plasmidome. This study will be helpful to better understand the role of plasmids in the development of MDR in water environments under high antibiotic concentrations.


September 22, 2019

Different next generation sequencing platforms produce different microbial profiles and diversity in cystic fibrosis sputum.

Cystic fibrosis (CF) is an autosomal recessive disease characterized by recurrent lung infections. Studies of the lung microbiome have shown an association between decreasing diversity and progressive disease. 454 pyrosequencing has frequently been used to study the lung microbiome in CF, but will no longer be supported. We sought to identify the benefits and drawbacks of using two state-of-the-art next generation sequencing (NGS) platforms, MiSeq and PacBio RSII, to characterize the CF lung microbiome. Each has its advantages and limitations.Twelve samples of extracted bacterial DNA were sequenced on both MiSeq and PacBio NGS platforms. DNA was amplified for the V4 region of the 16S rRNA gene and libraries were sequenced on the MiSeq sequencing platform, while the full 16S rRNA gene was sequenced on the PacBio RSII sequencing platform. Raw FASTQ files generated by the MiSeq and PacBio platforms were processed in mothur v1.35.1.There was extreme discordance in alpha-diversity of the CF lung microbiome when using the two platforms. Because of its depth of coverage, sequencing of the 16S rRNA V4 gene region using MiSeq allowed for the observation of many more operational taxonomic units (OTUs) and higher Chao1 and Shannon indices than the PacBio RSII. Interestingly, several patients in our cohort had Escherichia, an unusual pathogen in CF. Also, likely because of its coverage of the complete 16S rRNA gene, only PacBio RSII was able to identify Burkholderia, an important CF pathogen.When comparing microbiome diversity in clinical samples from CF patients using 16S sequences, MiSeq and PacBio NGS platforms may generate different results in microbial community composition and structure. It may be necessary to use different platforms when trying to correctly identify dominant pathogens versus measuring alpha-diversity estimates, and it would be important to use the same platform for comparisons to minimize errors in interpretation. Copyright © 2016 Elsevier B.V. All rights reserved.


September 22, 2019

Soil bacterial communities are shaped by temporal and environmental filtering: evidence from a long-term chronosequence.

Soil microbial communities are abundant, hyper-diverse and mediate global biogeochemical cycles, but we do not yet understand the processes mediating their assembly. Current hypothetical frameworks suggest temporal (e.g. dispersal limitation) and environmental (e.g. soil pH) filters shape microbial community composition; however, there is limited empirical evidence supporting this framework in the hyper-diverse soil environment, particularly at large spatial (i.e. regional to continental) and temporal (i.e. 100 to 1000 years) scales. Here, we present evidence from a long-term chronosequence (4000 years) that temporal and environmental filters do indeed shape soil bacterial community composition. Furthermore, nearly 20 years of environmental monitoring allowed us to control for potentially confounding environmental variation. Soil bacterial communities were phylogenetically distinct across the chronosequence. We determined that temporal and environmental factors accounted for significant portions of bacterial phylogenetic structure using distance-based linear models. Environmental factors together accounted for the majority of phylogenetic structure, namely, soil temperature (19%), pH (17%) and litter carbon:nitrogen (C:N; 17%). However, of all individual factors, time since deglaciation accounted for the greatest proportion of bacterial phylogenetic structure (20%). Taken together, our results provide empirical evidence that temporal and environmental filters act together to structure soil bacterial communities across large spatial and long-term temporal scales. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.


September 22, 2019

Improved full-length killer cell immunoglobulin-like receptor transcript discovery in Mauritian cynomolgus macaques.

Killer cell immunoglobulin-like receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens, and so, considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lack the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing Pacific Biosciences circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probe sets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.


September 22, 2019

Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.


September 22, 2019

Complete genome sequence of Endomicrobium proavitum, a free-living relative of the intracellular symbionts of termite gut flagellates (phylum Elusimicrobia).

We sequenced the complete genome of Endomicrobium proavitum strain Rsa215, the first isolate of the class Endomicrobia (phylum Elusimicrobia). It is the closest free-living relative of the endosymbionts of termite gut flagellates and thereby provides an excellent model for studying the evolutionary processes during the establishment of an intracellular symbiosis. Copyright © 2015 Zheng and Brune.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.