Menu
April 21, 2020

Assembly of long, error-prone reads using repeat graphs.

Accurate genome assembly is hampered by repetitive regions. Although long single molecule sequencing reads are better able to resolve genomic repeats than short-read data, most long-read assembly algorithms do not provide the repeat characterization necessary for producing optimal assemblies. Here, we present Flye, a long-read assembly algorithm that generates arbitrary paths in an unknown repeat graph, called disjointigs, and constructs an accurate repeat graph from these error-riddled disjointigs. We benchmark Flye against five state-of-the-art assemblers and show that it generates better or comparable assemblies, while being an order of magnitude faster. Flye nearly doubled the contiguity of the human genome assembly (as measured by the NGA50 assembly quality metric) compared with existing assemblers.


April 21, 2020

Immunogenetic factors driving formation of ultralong VH CDR3 in Bos taurus antibodies.

The antibody repertoire of Bos taurus is characterized by a subset of variable heavy (VH) chain regions with ultralong third complementarity determining regions (CDR3) which, compared to other species, can provide a potent response to challenging antigens like HIV env. These unusual CDR3 can range to over seventy highly diverse amino acids in length and form unique ß-ribbon ‘stalk’ and disulfide bonded ‘knob’ structures, far from the typical antigen binding site. The genetic components and processes for forming these unusual cattle antibody VH CDR3 are not well understood. Here we analyze sequences of Bos taurus antibody VH domains and find that the subset with ultralong CDR3 exclusively uses a single variable gene, IGHV1-7 (VHBUL) rearranged to the longest diversity gene, IGHD8-2. An eight nucleotide duplication at the 3′ end of IGHV1-7 encodes a longer V-region producing an extended F ß-strand that contributes to the stalk in a rearranged CDR3. A low amino acid variability was observed in CDR1 and CDR2, suggesting that antigen binding for this subset most likely only depends on the CDR3. Importantly a novel, potentially AID mediated, deletional diversification mechanism of the B. taurus VH ultralong CDR3 knob was discovered, in which interior codons of the IGHD8-2 region are removed while maintaining integral structural components of the knob and descending strand of the stalk in place. These deletions serve to further diversify cysteine positions, and thus disulfide bonded loops. Hence, both germline and somatic genetic factors and processes appear to be involved in diversification of this structurally unusual cattle VH ultralong CDR3 repertoire.


April 21, 2020

Full-length mRNA sequencing in Saccharina japonica and identification of carbonic anhydrase genes

The carbonic anhydrases (CAs) are a group of enzymes that play an important role in the absorption and transportation of CO2 in Saccharina japonica. They are encoded by a superfamily of genes with seven subtypes that are unrelated in sequence but share conserved function in catalyzing the reversible conversion of CO2 and HCO3-. Here we have characterized the CA members in the transcriptome of S. japonica using Single-molecule real-time (SMRT) sequencing technology. Approximately 9830.4 megabases from 5,028,003 quality subreads were generated, and they were assembled into 326,512 full-length non-chimeric (FLNC) reads, with an average flnc read length of 2181 bp. After removing redundant sequences, 79,010 unique transcripts were obtained of which 38,039 transcripts were successfully annotated. From the full-length transcriptome, we have identified 7 full-length cDNA sequences for CA genes (4 a-CAs, 1 ß-CAs and 2 ?-CAs) and assessed for their potential functions based on phylogenetic analysis. Characterizations of CAs will provide the ground for future studies to determine the involvement of CAs in inorganic carbon absorption and transportation in S. japonica.


April 21, 2020

Complete genome sequence of an IMP-8, CTX-M-14, CTX-M-3 and QnrS1 co-producing Enterobacter asburiae isolate from a patient with wound infection.

The aim of this study was to investigate the characteristics and complete genome sequence of an IMP-8, CTX-M-14, CTX-M-3 and QnrS1 co-producing multidrug-resistant Enterobacter asburiae isolate (EN3600) from a patient with wound infection.Species identification was confirmed by matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry (MALDI-TOF/MS). Carbapenemase genes were identified by PCR and Sanger sequencing. The complete genome sequence of E. asburiae EN3600 was obtained using a PacBio RS II platform. Genome annotation was done by Rapid Annotation using Subsystem Technology (RAST) server. Acquired antimicrobial resistance genes (ARGs) and plasmid replicons were detected using ResFinder 2.1 and PlasmidFinder 1.3, respectively.The genome of E. asburiae EN3600 consists of a 4.8-Mbp chromosome and five plasmids. The annotated genome contains various ARGs conferring resistance to aminoglycosides, ß-lactams, fluoroquinolones, fosfomycin, macrolides, phenicols, rifampicin and sulfonamides. In addition, plasmids of incompatibility (Inc) groups IncHI2A, IncFIB(pECLA), IncFIB(pQil) and IncP1 were identified. The genes blaIMP-8, blaCTX-M-14 and blaCTX-M-3 were located on different plasmids. The blaIMP-8 gene was carried by an 86-kb IncFIB(pQil) plasmid. The blaCTX-M-3 and qnrS1 genes were co-harboured by an IncP1 plasmid. In addition, blaCTX-M-14 was associated with blaTEM-1B, blaOXA-1, catB3 and sul1 genes in a 116-kb non-typeable plasmid.To our knowledge, this is the first complete genome sequence of an E. asburiae isolate co-producing IMP-8, CTX-M-14, CTX-M-3 and QnrS1. This genome may facilitate the understanding of the resistome, pathogenesis and genomic features of Enterobacter cloacae complex (ECC) and will provide valuable information for accurate identification of ECC.Copyright © 2019 International Society for Antimicrobial Chemotherapy. Published by Elsevier Ltd. All rights reserved.


April 21, 2020

A high-quality draft genome assembly of Sinella curviseta: A soil model organism (Collembola).

Sinella curviseta, among the most widespread springtails (Collembola) in Northern Hemisphere, has often been treated as a model organism in soil ecology and environmental toxicology. However, little information on its genetic knowledge severely hinders our understanding of its adaptations to the soil habitat. We present the largest genome assembly within Collembola using ~44.86?Gb (118X) of single-molecule real-time Pacific Bioscience Sequel sequencing. The final assembly of 599 scaffolds was ~381.46?Mb with a N50 length of 3.28?Mb, which captured 95.3% complete and 1.5% partial arthropod Benchmarking Universal Single-Copy Orthologs (n?=?1066). Transcripts and circularized mitochondrial genome were also assembled. We predicted 23,943 protein-coding genes, of which 83.88% were supported by transcriptome-based evidence and 82.49% matched protein records in UniProt. In addition, we also identified 222,501 repeats and 881 noncoding RNAs. Phylogenetic reconstructions for Collembola support Tomoceridae sistered to the remaining Entomobryomorpha with the position of Symphypleona not fully resolved. Gene family evolution analyses identified 9,898 gene families, of which 156 experienced significant expansions or contractions. Our high-quality reference genome of S. curviseta provides the genetic basis for future investigations in evolutionary biology, soil ecology, and ecotoxicology. © The Author(s) 2019. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


April 21, 2020

Genome sequence of Jatropha curcas L., a non-edible biodiesel plant, provides a resource to improve seed-related traits.

Jatropha curcas (physic nut), a non-edible oilseed crop, represents one of the most promising alternative energy sources due to its high seed oil content, rapid growth and adaptability to various environments. We report ~339 Mbp draft whole genome sequence of J. curcas var. Chai Nat using both the PacBio and Illumina sequencing platforms. We identified and categorized differentially expressed genes related to biosynthesis of lipid and toxic compound among four stages of seed development. Triacylglycerol (TAG), the major component of seed storage oil, is mainly synthesized by phospholipid:diacylglycerol acyltransferase in Jatropha, and continuous high expression of homologs of oleosin over seed development contributes to accumulation of high level of oil in kernels by preventing the breakdown of TAG. A physical cluster of genes for diterpenoid biosynthetic enzymes, including casbene synthases highly responsible for a toxic compound, phorbol ester, in seed cake, was syntenically highly conserved between Jatropha and castor bean. Transcriptomic analysis of female and male flowers revealed the up-regulation of a dozen family of TFs in female flower. Additionally, we constructed a robust species tree enabling estimation of divergence times among nine Jatropha species and five commercial crops in Malpighiales order. Our results will help researchers and breeders increase energy efficiency of this important oil seed crop by improving yield and oil content, and eliminating toxic compound in seed cake for animal feed. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Analysis of Chromosomal Numbers, Mitochondrial Genome, and Full-Length Transcriptome of Onychostoma brevibarba.

Onychostoma brevibarba is a new discovered species which is distributed in Xiang Jiang River of the middle Chang Jiang basin in Hunan Province, South China. In this study, the ploidy levels of O. brevibarba were confirmed by counting chromosomal numbers and analyzing karyotype. The complete mitochondrial genome of O. brevibarba was determined and analyzed. Besides, we firstly performed the full-length transcriptome of O. brevibarba derived from 5 different tissues using the PacBio SMRT sequencing. The result shows that O. brevibarba was a diploid with 48 chromosomes. The complete mitogenome of O. brevibarba was 16,602 bp in size and very similar (89.1-91.3%) to that of other Onychostoma species but was distinct from all congeners. The full-length transcriptome dataset of O. brevibarba comprised 120,239 unigenes. Among the unigenes, 91,542 were functionally annotated, whereas 26,794 were found to have two or more isoforms. This study could provide many new insights into cytology and molecular characteristics of O. brevibarba; it laid the foundation for further exploration of the genomic signatures of species of Onychostoma.


April 21, 2020

a-Difluoromethylornithine reduces gastric carcinogenesis by causing mutations in Helicobacter pylori cagY.

Infection by Helicobacter pylori is the primary cause of gastric adenocarcinoma. The most potent H. pylori virulence factor is cytotoxin-associated gene A (CagA), which is translocated by a type 4 secretion system (T4SS) into gastric epithelial cells and activates oncogenic signaling pathways. The gene cagY encodes for a key component of the T4SS and can undergo gene rearrangements. We have shown that the cancer chemopreventive agent a-difluoromethylornithine (DFMO), known to inhibit the enzyme ornithine decarboxylase, reduces H. pylori-mediated gastric cancer incidence in Mongolian gerbils. In the present study, we questioned whether DFMO might directly affect H. pylori pathogenicity. We show that H. pylori output strains isolated from gerbils treated with DFMO exhibit reduced ability to translocate CagA in gastric epithelial cells. Further, we frequently detected genomic modifications in the middle repeat region of the cagY gene of output strains from DFMO-treated animals, which were associated with alterations in the CagY protein. Gerbils did not develop carcinoma when infected with a DFMO output strain containing rearranged cagY or the parental strain in which the wild-type cagY was replaced by cagY with DFMO-induced rearrangements. Lastly, we demonstrate that in vitro treatment of H. pylori by DFMO induces oxidative DNA damage, expression of the DNA repair enzyme MutS2, and mutations in cagY, demonstrating that DFMO directly affects genomic stability. Deletion of mutS2 abrogated the ability of DFMO to induce cagY rearrangements directly. In conclusion, DFMO-induced oxidative stress in H. pylori leads to genomic alterations and attenuates virulence.


April 21, 2020

The Genome of Cucurbita argyrosperma (Silver-Seed Gourd) Reveals Faster Rates of Protein-Coding Gene and Long Noncoding RNA Turnover and Neofunctionalization within Cucurbita.

Whole-genome duplications are an important source of evolutionary novelties that change the mode and tempo at which genetic elements evolve within a genome. The Cucurbita genus experienced a whole-genome duplication around 30 million years ago, although the evolutionary dynamics of the coding and noncoding genes in this genus have not yet been scrutinized. Here, we analyzed the genomes of four Cucurbita species, including a newly assembled genome of Cucurbita argyrosperma, and compared the gene contents of these species with those of five other members of the Cucurbitaceae family to assess the evolutionary dynamics of protein-coding and long intergenic noncoding RNA (lincRNA) genes after the genome duplication. We report that Cucurbita genomes have a higher protein-coding gene birth-death rate compared with the genomes of the other members of the Cucurbitaceae family. C. argyrosperma gene families associated with pollination and transmembrane transport had significantly faster evolutionary rates. lincRNA families showed high levels of gene turnover throughout the phylogeny, and 67.7% of the lincRNA families in Cucurbita showed evidence of birth from the neofunctionalization of previously existing protein-coding genes. Collectively, our results suggest that the whole-genome duplication in Cucurbita resulted in faster rates of gene family evolution through the neofunctionalization of duplicated genes. Copyright © 2019 The Author. Published by Elsevier Inc. All rights reserved.


April 21, 2020

Whole genome sequence of Auricularia heimuer (Basidiomycota, Fungi), the third most important cultivated mushroom worldwide.

Heimuer, Auricularia heimuer, is one of the most famous traditional Chinese foods and medicines, and it is the third most important cultivated mushroom worldwide. The aim of this study is to develop genomic resources for A. heimuer to furnish tools that can be used to study its secondary metabolite production capability, wood degradation ability and biosynthesis of polysaccharides. The genome was obtained from single spore mycelia of the strain Dai 13782 by using combined high-throughput Illumina HiSeq 4000 system with the PacBio RSII long-read sequencing platform. Functional annotation was accomplished by blasting protein sequences with different public available databases to obtain their corresponding annotations. It is 49.76Mb in size with a N50 scaffold size of 1,350,668bp and encodes 16,244 putative predicted genes. This is the first genome-scale assembly and annotation for A. heimuer, which is the third sequenced species in Auricularia. Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Complete genome sequence of the novel agarolytic Catenovulum-like strain CCB-QB4

Members of the genus Catenovulum are recognized for their ability to degrade algal biomass. Here we report the complete genome of Cantenovulum–like strain CCB-QB4, an agarolytic bacterium isolated from the coastal area of Penang, Malaysia. The sequenced genome is composed of a 5,663,044?bp circular chromosome and a 208,085?bp circular plasmid. It contained 4409 protein coding and 83 RNA genes, including 62 tRNAs and 21 rRNAs. The genome of CCB-QB4 contains many agarases, which correlate with the high capacity of the strain to degrade agar. Genome sequencing of CCB-QB4 reveals gene candidates of potential interest in enzymatic industries or applications in the field of polysaccharides degradation.


April 21, 2020

The high prevalence of antibiotic heteroresistance in pathogenic bacteria is mainly caused by gene amplification.

When choosing antibiotics to treat bacterial infections, it is assumed that the susceptibility of the target bacteria to an antibiotic is reflected by laboratory estimates of the minimum inhibitory concentration (MIC) needed to prevent bacterial growth. A caveat of using MIC data for this purpose is heteroresistance, the presence of a resistant subpopulation in a main population of susceptible cells. We investigated the prevalence and mechanisms of heteroresistance in 41 clinical isolates of the pathogens Escherichia coli, Salmonella enterica, Klebsiella pneumoniae and Acinetobacter baumannii against 28 different antibiotics. For the 766 bacteria-antibiotic combinations tested, as much as 27.4% of the total was heteroresistant. Genetic analysis demonstrated that a majority of heteroresistance cases were unstable, with an increased resistance of the subpopulations resulting from spontaneous tandem amplifications, typically including known resistance genes. Using mathematical modelling, we show how heteroresistance in the parameter range estimated in this study can result in the failure of antibiotic treatment of infections with bacteria that are classified as antibiotic susceptible. The high prevalence of heteroresistance with the potential for treatment failure highlights the limitations of MIC as the sole criterion for susceptibility determinations. These results call for the development of facile and rapid protocols to identify heteroresistance in pathogens.


April 21, 2020

Gut pathobionts underlie intestinal barrier dysfunction and liver T helper 17 cell immune response in primary sclerosing cholangitis.

Primary sclerosing cholangitis (PSC) is a chronic inflammatory liver disease and its frequent complication with ulcerative colitis highlights the pathogenic role of epithelial barrier dysfunction. Intestinal barrier dysfunction has been implicated in the pathogenesis of PSC, yet its underlying mechanism remains unknown. Here, we identify Klebsiella pneumonia in the microbiota of patients with PSC and demonstrate that K.?pneumoniae disrupts the epithelial barrier to initiate bacterial translocation and liver inflammatory responses. Gnotobiotic mice inoculated with PSC-derived microbiota exhibited T helper 17 (TH17) cell responses in the liver and increased susceptibility to hepatobiliary injuries. Bacterial culture of mesenteric lymph nodes in these mice isolated K.?pneumoniae, Proteus mirabilis and Enterococcus gallinarum, which were prevalently detected in patients with PSC. A bacterial-organoid co-culture system visualized the epithelial-damaging effect of PSC-derived K.?pneumoniae that was associated with bacterial translocation and susceptibility to TH17-mediated hepatobiliary injuries. We also show that antibiotic treatment ameliorated the TH17 immune response induced by PSC-derived microbiota. These results highlight the role of pathobionts in intestinal barrier dysfunction and liver inflammation, providing insights into therapeutic strategies for PSC.


April 21, 2020

Complete genome sequence of Pseudomonas frederiksbergensis ERDD5:01 revealed genetic bases for survivability at high altitude ecosystem and bioprospection potential.

Pseudomonas frederiksbergensis ERDD5:01 is a psychrotrophic bacteria isolated from the glacial stream flowing from East Rathong glacier in Sikkim Himalaya. The strain showed survivability at high altitude stress conditions like freezing, frequent freeze-thaw cycles, and UV-C radiations. The complete genome of 5,746,824?bp circular chromosome and a plasmid of 371,027?bp was sequenced to understand the genetic basis of its survival strategy. Multiple copies of cold-associated genes encoding cold active chaperons, general stress response, osmotic stress, oxidative stress, membrane/cell wall alteration, carbon storage/starvation and, DNA repair mechanisms supported its survivability at extreme cold and radiations corroborating with the bacterial physiological findings. The molecular cold adaptation analysis in comparison with the genome of 15 mesophilic Pseudomonas species revealed functional insight into the strategies of cold adaptation. The genomic data also revealed the presence of industrially important enzymes.Copyright © 2018 Elsevier Inc. All rights reserved.


April 21, 2020

Phylogenetic relationships and regional spread of meningococcal strains in the meningitis belt, 2011-2016.

Historically, the major cause of meningococcal epidemics in the meningitis belt of sub-Saharan Africa has been Neisseria meningitidis serogroup A (NmA), but the incidence has been substantially reduced since the introduction of a serogroup A conjugate vaccine starting in 2010. We performed whole-genome sequencing on isolates collected post-2010 to assess their phylogenetic relationships and inter-country transmission.A total of 716 invasive meningococcal isolates collected between 2011 and 2016 from 11 meningitis belt countries were whole-genome sequenced for molecular characterization by the three WHO Collaborating Centers for Meningitis.We identified three previously-reported clonal complexes (CC): CC11 (n?=?434), CC181 (n?=?62) and CC5 (n?=?90) primarily associated with NmW, NmX, and NmA, respectively, and an emerging CC10217 (n?=?126) associated with NmC. CC11 expanded throughout the meningitis belt independent of the 2000 Hajj outbreak strain, with isolates from Central African countries forming a distinct sub-lineage within this expansion. Two major sub-lineages were identified for CC181 isolates, one mainly expanding in West African countries and the other found in Chad. CC10217 isolates from the large outbreaks in Nigeria and Niger were more closely related than those from the few cases in Mali and Burkina Faso.Whole-genome based phylogenies revealed geographically distinct strain circulation as well as inter-country transmission events. Our results stress the importance of continued meningococcal molecular surveillance in the region, as well as the development of an affordable vaccine targeting these strains. FUND: Meningitis Research Foundation; CDC’s Office of Advanced Molecular Detection; GAVI, the Vaccine Alliance. Copyright © 2019. Published by Elsevier B.V.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.