De novo assembly Archives - Page 52 of 324

September 22, 2019

De novo assembly of a Chinese soybean genome.

Soybean was domesticated in China and has become one of the most important oilseed crops. Due to bottlenecks in their introduction and dissemination, soybeans from different geographic areas exhibit extensive genetic diversity. Asia is the largest soybean market; therefore, a high-quality soybean reference genome from this area is critical for soybean research and breeding. Here, we report the de novo assembly and sequence analysis of a Chinese soybean genome for “Zhonghuang 13” by a combination of SMRT, Hi-C and optical mapping data. The assembled genome size is 1.025 Gb with a contig N50 of 3.46 Mb and a scaffold N50 of 51.87 Mb. Comparisons between this genome and the previously reported reference genome (cv. Williams 82) uncovered more than 250,000 structure variations. A total of 52,051 protein coding genes and 36,429 transposable elements were annotated for this genome, and a gene co-expression network including 39,967 genes was also established. This high quality Chinese soybean genome and its sequence analysis will provide valuable information for soybean improvement in the future.

September 22, 2019

Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

September 22, 2019

Complete genome sequence of Endomicrobium proavitum, a free-living relative of the intracellular symbionts of termite gut flagellates (phylum Elusimicrobia).

We sequenced the complete genome of Endomicrobium proavitum strain Rsa215, the first isolate of the class Endomicrobia (phylum Elusimicrobia). It is the closest free-living relative of the endosymbionts of termite gut flagellates and thereby provides an excellent model for studying the evolutionary processes during the establishment of an intracellular symbiosis. Copyright © 2015 Zheng and Brune.

September 22, 2019

Complete genome sequence of Bacillus kochii Oregon-R-modENCODE strain BDGP4, isolated from Drosophila melanogaster gut.

Bacillus kochii Oregon-R-modENCODE strain BDGP4 was isolated from the gut of Drosophila melanogaster for functional host microbial interaction studies. The complete genome comprised a single chromosomal circle of 4,557,232 bp with a G+C content of 37% and a single plasmid of 137,143 bp. Copyright © 2017 Wan et al.

September 22, 2019

Precise fecal microbiome of the herbivorous Tibetan antelope inhabiting high-altitude alpine plateau

The metataxonomic approach combining 16S rRNA gene amplicon sequencing using the PacBio technology with the application of the operational phylogenetic unit (OPU) approach, has been used to analyze the fecal microbial composition of the high-altitude and herbivorous Tibetan antelopes. The fecal samples of the antelope were collected in Hoh Xil National Nature Reserve, at an altitude over 4500 m, the largest depopulated zone in Qinghai-Tibetan Plateau, China, where non-native animals or humans may experience life-threatening acute mountain sickness. In total, 104 antelope fecal samples were enrolled in this study, and were clustered into 61,258 operational taxonomic units (OTUs) at an identity of 98.7% and affiliated with 757 OPUs, including 144 known species, 256 potentially new species, 103 potentially higher taxa within known lineages. In addition, 254 comprised sequences not affiliating with any known family, and the closest relatives were unclassified lineages of existing orders or classes. A total of 42 out of 757 OPUs conformed to the core fecal microbiome, of which four major lineages, namely, un-cultured Ruminococcaceae, Lachnospiraceae, Akkermansia and Christensenellaceae were associated with human health or longevity. The current study reveals that the fecal core microbiome of antelope is mainly composited of uncultured bacteria. The most abundant core taxa, namely, uncultured Ruminococcaceae, uncultured Akkermansia, uncultured Bacteroides, uncultured Christensenellaceae, uncultured Mollicutes, and uncultured Lachnospiraceae, may represent new bacterial candidates at high taxa levels, and several may have beneficial roles in health promotion or anti-intestinal dysbiosis. These organisms should be further isolated and evaluated for potential effect on human health and longevity.

September 22, 2019

Long reads: their purpose and place.

In recent years long-read technologies have moved from being a niche and specialist field to a point of relative maturity likely to feature frequently in the genomic landscape. Analogous to next generation sequencing, the cost of sequencing using long-read technologies has materially dropped whilst the instrument throughput continues to increase. Together these changes present the prospect of sequencing large numbers of individuals with the aim of fully characterizing genomes at high resolution. In this article, we will endeavour to present an introduction to long-read technologies showing: what long reads are; how they are distinct from short reads; why long reads are useful and how they are being used. We will highlight the recent developments in this field, and the applications and potential of these technologies in medical research, and clinical diagnostics and therapeutics.

September 22, 2019

Full-length RNA sequencing reveals unique transcriptome composition in bermudagrass.

Bermudagrass [Cynodon dactylon (L.) Pers.] is an important perennial warm-season turfgrass species with great economic value. However, the reference genome and transcriptome information are still deficient in bermudagrass, which severely impedes functional and molecular breeding studies. In this study, through analyzing a mixture sample of leaves, stolons, shoots, roots and flowers with single-molecule long-read sequencing technology from Pacific Biosciences (PacBio), we reported the first full-length transcriptome dataset of bermudagrass (C. dactylon cultivar Yangjiang) comprising 78,192 unigenes. Among the unigenes, 66,409 were functionally annotated, whereas 27,946 were found to have two or more isoforms. The annotated full-length unigenes provided many new insights into gene sequence characteristics and systematic phylogeny of bermudagrass. By comparison with transcriptome dataset in nine grass species, KEGG pathway analyses further revealed that C4 photosynthesis-related genes, notably the phosphoenolpyruvate carboxylase and pyruvate, phosphate dikinase genes, are specifically enriched in bermudagrass. These results not only explained the possible reason why bermudagrass flourishes in warm areas but also provided a solid basis for future studies in this important turfgrass species. Copyright © 2018 Elsevier Masson SAS. All rights reserved.

September 22, 2019

Identification of a novel fusion transcript between human relaxin-1 (RLN1) and human relaxin-2 (RLN2) in prostate cancer.

Simultaneous expression of highly homologous RLN1 and RLN2 genes in prostate impairs their accurate delineation. We used PacBio SMRT sequencing and RNA-Seq in LNCaP cells in order to dissect the expression of RLN1 and RLN2 variants. We identified a novel fusion transcript comprising the RLN1 and RLN2 genes and found evidence of its expression in the normal and prostate cancer tissues. The RLN1-RLN2 fusion putatively encodes RLN2 isoform with the deleted secretory signal peptide. The identification of the fusion transcript provided information to determine unique RLN1-RLN2 fusion and RLN1 regions. The RLN1-RLN2 fusion was co-expressed with RLN1 in LNCaP cells, but the two gene products were inversely regulated by androgens. We showed that RLN1 is underrepresented in common PCa cell lines in comparison to normal and PCa tissue. The current study brings a highly relevant update to the relaxin field, and will encourage further studies of RLN1 and RLN2 in PCa and broader. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

September 22, 2019

The gut commensal microbiome of Drosophila melanogaster is modified by the endosymbiont Wolbachia.

Endosymbiotic Wolbachia bacteria and the gut microbiome have independently been shown to affect several aspects of insect biology, including reproduction, development, life span, stem cell activity, and resistance to human pathogens, in insect vectors. This work shows that Wolbachia bacteria, which reside mainly in the fly germline, affect the microbial species present in the fly gut in a lab-reared strain. Drosophila melanogaster hosts two main genera of commensal bacteria-Acetobacter and Lactobacillus. Wolbachia-infected flies have significantly reduced titers of Acetobacter. Sampling of the microbiome of axenic flies fed with equal proportions of both bacteria shows that the presence of Wolbachia bacteria is a significant determinant of the composition of the microbiome throughout fly development. However, this effect is host genotype dependent. To investigate the mechanism of microbiome modulation, the effect of Wolbachia bacteria on Imd and reactive oxygen species pathways, the main regulators of immune response in the fly gut, was measured. The presence of Wolbachia bacteria does not induce significant changes in the expression of the genes for the effector molecules in either pathway. Furthermore, microbiome modulation is not due to direct interaction between Wolbachia bacteria and gut microbes. Confocal analysis shows that Wolbachia bacteria are absent from the gut lumen. These results indicate that the mechanistic basis of the modulation of composition of the microbiome by Wolbachia bacteria is more complex than a direct bacterial interaction or the effect of Wolbachia bacteria on fly immunity. The findings reported here highlight the importance of considering the composition of the gut microbiome and host genetic background during Wolbachia-induced phenotypic studies and when formulating microbe-based disease vector control strategies. IMPORTANCE Wolbachia bacteria are intracellular bacteria present in the microbiome of a large fraction of insects and parasitic nematodes. They can block mosquitos’ ability to transmit several infectious disease-causing pathogens, including Zika, dengue, chikungunya, and West Nile viruses and malaria parasites. Certain extracellular bacteria present in the gut lumen of these insects can also block pathogen transmission. However, our understanding of interactions between Wolbachia and gut bacteria and how they influence each other is limited. Here we show that the presence of Wolbachia strain wMel changes the composition of gut commensal bacteria in the fruit fly. Our findings implicate interactions between bacterial species as a key factor in determining the overall composition of the microbiome and thus reveal new paradigms to consider in the development of disease control strategies.

September 22, 2019

Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data.

DNA assembly is a core methodological step in metagenomic pipelines used to study the structure and function within microbial communities. Here we investigate the utility of Pacific Biosciences long and high accuracy circular consensus sequencing (CCS) reads for metagenomic projects. We compared the application and performance of both PacBio CCS and Illumina HiSeq data with assembly and taxonomic binning algorithms using metagenomic samples representing a complex microbial community. Eight SMRT cells produced approximately 94 Mb of CCS reads from a biogas reactor microbiome sample that averaged 1319 nt in length and 99.7% accuracy. CCS data assembly generated a comparative number of large contigs greater than 1?kb, to those assembled from a ~190x larger HiSeq dataset (~18 Gb) produced from the same sample (i.e approximately 62% of total contigs). Hybrid assemblies using PacBio CCS and HiSeq contigs produced improvements in assembly statistics, including an increase in the average contig length and number of large contigs. The incorporation of CCS data produced significant enhancements in taxonomic binning and genome reconstruction of two dominant phylotypes, which assembled and binned poorly using HiSeq data alone. Collectively these results illustrate the value of PacBio CCS reads in certain metagenomics applications.

September 22, 2019

Multiple regulatory networks are activated during cold stress in Medicago sativa L.

Cultivated alfalfa (Medicago sativa L.) is one of the most important perennial legume forages in the world, and it has considerable potential as a valuable forage crop for livestock. However, the molecular mechanisms underlying alfalfa responses to cold stress are largely unknown. In this study, the transcriptome changes in alfalfa under cold stress at 4 °C for 2, 6, 24, and 48 h (three replicates for each time point) were analyzed using the high-throughput sequencing platform, BGISEQ-500, resulting in the identification of 50,809 annotated unigenes and 5283 differentially expressed genes (DEGs). Metabolic pathway enrichment analysis demonstrated that the DEGs were involved in carbohydrate metabolism, photosynthesis, plant hormone signal transduction, and the biosynthesis of amino acids. Moreover, the physiological changes of glutathione and proline content, catalase, and peroxidase activity were in accordance with dynamic transcript profiles of the relevant genes. Additionally, some transcription factors might play important roles in the alfalfa response to cold stress, as determined by the expression pattern of the related genes during 48 h of cold stress treatment. These findings provide valuable information for identifying and characterizing important components in the cold signaling network in alfalfa and enhancing the understanding of the molecular mechanisms underlying alfalfa responses to cold stress.

September 22, 2019

ISOdb: A comprehensive database of full-length isoforms generated by Iso-Seq.

The accurate landscape of transcript isoforms plays an important role in the understanding of gene function and gene regulation. However, building complete transcripts is very challenging for short reads generated using next-generation sequencing. Fortunately, isoform sequencing (Iso-Seq) using single-molecule sequencing technologies, such as PacBio SMRT, provides long reads spanning entire transcript isoforms which do not require assembly. Therefore, we have developed ISOdb, a comprehensive resource database for hosting and carrying out an in-depth analysis of Iso-Seq datasets and visualising the full-length transcript isoforms. The current version of ISOdb has collected 93 publicly available Iso-Seq samples from eight species and presents the samples in two levels: (1) sample level, including metainformation, long read distribution, isoform numbers, and alternative splicing (AS) events of each sample; (2) gene level, including the total isoforms, novel isoform number, novel AS number, and isoform visualisation of each gene. In addition, ISOdb provides a user interface in the website for uploading sample information to facilitate the collection and analysis of researchers’ datasets. Currently, ISOdb is the first repository that offers comprehensive resources and convenient public access for hosting, analysing, and visualising Iso-Seq data, which is freely available.

September 22, 2019

Diverse antibiotic resistance genes in dairy cow manure.

Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. IMPORTANCE The increasing prevalence of antibiotic resistance among bacteria is one of the most intractable challenges in 21st-century public health. The origins of resistance are complex, and a better understanding of the impacts of antibiotics used on farms would produce a more robust platform for public policy. Microbiomes of farm animals are reservoirs of antibiotic resistance genes, which may affect distribution of antibiotic resistance genes in human pathogens. Previous studies have focused on antibiotic resistance genes in manures of animals subjected to intensive antibiotic use, such as pigs and chickens. Cow manure has received less attention, although it is commonly used in crop production. Here, we report the discovery of novel and diverse antibiotic resistance genes in the cow microbiome, demonstrating that it is a significant reservoir of antibiotic resistance genes. The genomic resource presented here lays the groundwork for understanding the dispersal of antibiotic resistance from the agroecosystem to other settings.

September 22, 2019

The genomic and functional landscapes of developmental plasticity in the American cockroach.

Many cockroach species have adapted to urban environments, and some have been serious pests of public health in the tropics and subtropics. Here, we present the 3.38-Gb genome and a consensus gene set of the American cockroach, Periplaneta americana. We report insights from both genomic and functional investigations into the underlying basis of its adaptation to urban environments and developmental plasticity. In comparison with other insects, expansions of gene families in P. americana exist for most core gene families likely associated with environmental adaptation, such as chemoreception and detoxification. Multiple pathways regulating metamorphic development are well conserved, and RNAi experiments inform on key roles of 20-hydroxyecdysone, juvenile hormone, insulin, and decapentaplegic signals in regulating plasticity. Our analyses reveal a high level of sequence identity in genes between the American cockroach and two termite species, advancing it as a valuable model to study the evolutionary relationships between cockroaches and termites.

September 22, 2019

Genome sequence of a potential probiotic strain, Lactobacillus fermentum HFB3, isolated from a human gut.

A draft genome sequence of 2.04 Mb is reported for Lactobacillus fermentum HFB3, which is a lactic acid bacterium with probiotic properties. The gene-coding clusters also predicted the presence of genes responsible for probiotic characteristics. Copyright © 2015 Kumari et al.

Auto Tag: De novo assembly

De novo assembly of a Chinese soybean genome.

Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

Complete genome sequence of Endomicrobium proavitum, a free-living relative of the intracellular symbionts of termite gut flagellates (phylum Elusimicrobia).

Complete genome sequence of Bacillus kochii Oregon-R-modENCODE strain BDGP4, isolated from Drosophila melanogaster gut.

Precise fecal microbiome of the herbivorous Tibetan antelope inhabiting high-altitude alpine plateau

Long reads: their purpose and place.

Full-length RNA sequencing reveals unique transcriptome composition in bermudagrass.

Identification of a novel fusion transcript between human relaxin-1 (RLN1) and human relaxin-2 (RLN2) in prostate cancer.

The gut commensal microbiome of Drosophila melanogaster is modified by the endosymbiont Wolbachia.

Improved metagenome assemblies and taxonomic binning using long-read circular consensus sequence data.

Multiple regulatory networks are activated during cold stress in Medicago sativa L.

ISOdb: A comprehensive database of full-length isoforms generated by Iso-Seq.

Diverse antibiotic resistance genes in dairy cow manure.

The genomic and functional landscapes of developmental plasticity in the American cockroach.

Genome sequence of a potential probiotic strain, Lactobacillus fermentum HFB3, isolated from a human gut.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert