Menu
April 21, 2020  |  

Long-read sequencing for rare human genetic diseases.

During the past decade, the search for pathogenic mutations in rare human genetic diseases has involved huge efforts to sequence coding regions, or the entire genome, using massively parallel short-read sequencers. However, the approximate current diagnostic rate is <50% using these approaches, and there remain many rare genetic diseases with unknown cause. There may be many reasons for this, but one plausible explanation is that the responsible mutations are in regions of the genome that are difficult to sequence using conventional technologies (e.g., tandem-repeat expansion or complex chromosomal structural aberrations). Despite the drawbacks of high cost and a shortage of standard analytical methods, several studies have analyzed pathogenic changes in the genome using long-read sequencers. The results of these studies provide hope that further application of long-read sequencers to identify the causative mutations in unsolved genetic diseases may expand our understanding of the human genome and diseases. Such approaches may also be applied to molecular diagnosis and therapeutic strategies for patients with genetic diseases in the future.


April 21, 2020  |  

Tandem repeats lead to sequence assembly errors and impose multi-level challenges for genome and protein databases.

The widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with ‘ready-to-use’ deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotation-deposition workflow, and that may proliferate in public database repositories affecting all downstream analyses. As a case study, we provide examples of the Atlantic cod genome, whose sequencing and assembly were hindered by a particularly high prevalence of tandem repeats. We complement this case study with examples from other species, where mis-annotations and sequencing errors have propagated into protein databases. With this review, we aim to raise the awareness level within the community of database users, and alert scientists working in the underlying workflow of database creation that the data they omit or improperly assemble may well contain important biological information valuable to others. © The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.


April 21, 2020  |  

The bracteatus pineapple genome and domestication of clonally propagated crops.

Domestication of clonally propagated crops such as pineapple from South America was hypothesized to be a ‘one-step operation’. We sequenced the genome of Ananas comosus var. bracteatus CB5 and assembled 513?Mb into 25 chromosomes with 29,412 genes. Comparison of the genomes of CB5, F153 and MD2 elucidated the genomic basis of fiber production, color formation, sugar accumulation and fruit maturation. We also resequenced 89 Ananas genomes. Cultivars ‘Smooth Cayenne’ and ‘Queen’ exhibited ancient and recent admixture, while ‘Singapore Spanish’ supported a one-step operation of domestication. We identified 25 selective sweeps, including a strong sweep containing a pair of tandemly duplicated bromelain inhibitors. Four candidate genes for self-incompatibility were linked in F153, but were not functional in self-compatible CB5. Our findings support the coexistence of sexual recombination and a one-step operation in the domestication of clonally propagated crops. This work guides the exploration of sexual and asexual domestication trajectories in other clonally propagated crops.


April 21, 2020  |  

Chryseobacterium mulctrae sp. nov., isolated from raw cow’s milk.

A Gram-stain-negative bacterial strain, designated CA10T, was isolated from bovine raw milk sampled in Anseong, Republic of Korea. Cells were yellow-pigmented, aerobic, non-motile bacilli and grew optimally at 30?°C and pH 7.0 on tryptic soy agar without supplementation of NaCl. Phylogenetic analysis based on the 16S rRNA gene sequences revealed that strain CA10T belonged to the genus Chryseobacterium, family Flavobacteriaceae, and was most closely related to Chryseobacterium indoltheticum ATCC 27950T (98.75?% similarity). The average nucleotide identity and digital DNA-DNA hybridization values of strain CA10T were 94.4 and 56.9?%, respectively, relative to Chryseobacterium scophthalmum DSM 16779T, being lower than the cut-off values of 95-96?and 70?%, respectively. The predominant respiratory quinone was menaquinone-6; major polar lipid, phosphatidylethanolamine; major fatty acids, iso-C15?:?0, summed feature 9 (iso-C17?:?1?9c and/or C16?:?0 10-methyl), summed feature 3 (iso-C15?:?0 2-OH and/or C16?:?1?7c) and iso-C17?:?0 3-OH. The results of physiological, chemotaxonomic and biochemical analyses suggested that strain CA10T is a novel species of genus Chryseobacterium, for which the name Chryseobacterium mulctrae sp. nov. is proposed. The type strain is CA10T (=KACC 21234T=JCM 33443T).


April 21, 2020  |  

Allopseudarcicella aquatilis gen. nov., sp. nov., isolated from freshwater.

A Gram-stain-negative, rod-shaped and red-pigmented strain, HME7025T, was isolated from freshwater sampled in the Republic of Korea. Phylogenetic analysis based on its 16S rRNA gene sequence revealed that strain HME7025T formed a lineage within the family Cytophagaceae of the phylum Bacteroidetes. Strain HME7025T was closely related to the genera Pseudarcicella, Arcicella and Flectobacillus. The 16S rRNA gene sequence similarity values of strain HME7025T were under 94.5?% to its closest phylogenetic neighbours. The major fatty acids of strain HME7025T were iso-C15?:?0 (41.9?%), summed feature 3 (comprising C16?:?1?7c and/or C16?:?1?6c; 12.2?%) and anteiso-C15?:?0 (10.8?%). The major respiratory quinone was menaquinone-7. The major polar lipids were phosphatidylethanolamine, two unidentified aminophospholipids and one unidentified polar lipid. The DNA G+C content of strain HME7025T was 37.9?mol%. On the basis of the evidence presented in this study, strain HME7025T represents a novel species of a novel genus within the family Cytophagaceae, for which the name Allopseudarcicella aquatilis gen. nov., sp. nov. is proposed. The type strain is HME7025T (=KCTC 23617T=CECT 7957T).


April 21, 2020  |  

The complete genome sequence and comparative genome analysis of the multi-drug resistant food-borne pathogen Bacillus cereus.

Bacillus cereus is an opportunistic human pathogen causing food-borne gastrointestinal infections and non-gastrointestinal infections worldwide. The strain B. cereus FORC_013 was isolated from fried eel. Its genome was completely sequenced by PacBio technology, analyzed and compared with other complete genome sequences of Bacillus to elucidate the distinct pathogenic features of the strain isolated in South Korea. Genomic analysis revealed pathogenesis and host immune evasion-associated genes encoding tissue-destructive exoenzymes, and pore-forming toxins. In particular, tissue-destructive (hemolysin BL, nonhaemolytic enterotoxins) and cytolytic proteins (cytolysin) were observed in the genome, which damage the plasma membrane of the epithelial cells of the small intestine causing diarrhea in humans. Capsule biosynthesis gene found in both chromosome and plasmid, which might be responsible for protecting the pathogen from the host cell immune defense system after host cell invasion. Additionally, multidrug resistance operon and efflux pumps were identified in the genome, which play a prominent role in multi-antibiotic resistance. Comparative phylogenetic tree analysis of the strain FORC_013 and other B. cereus strains revealed that the closest strains are ATCC 14579 and B4264. This genome data can be used to identify virulence factors that can be applied for the development of novel biomarkers for the rapid detection of this pathogen in foods.Copyright © 2018. Published by Elsevier Inc.


April 21, 2020  |  

Chlorella vulgaris genome assembly and annotation reveals the molecular basis for metabolic acclimation to high light conditions.

Chlorella vulgaris is a fast-growing fresh-water microalga cultivated at the industrial scale for applications ranging from food to biofuel production. To advance our understanding of its biology and to establish genetics tools for biotechnological manipulation, we sequenced the nuclear and organelle genomes of Chlorella vulgaris 211/11P by combining next generation sequencing and optical mapping of isolated DNA molecules. This hybrid approach allowed to assemble the nuclear genome in 14 pseudo-molecules with an N50 of 2.8 Mb and 98.9% of scaffolded genome. The integration of RNA-seq data obtained at two different irradiances of growth (high light-HL versus low light -LL) enabled to identify 10,724 nuclear genes, coding for 11,082 transcripts. Moreover 121 and 48 genes were respectively found in the chloroplast and mitochondrial genome. Functional annotation and expression analysis of nuclear, chloroplast and mitochondrial genome sequences revealed peculiar features of Chlorella vulgaris. Evidence of horizontal gene transfers from chloroplast to mitochondrial genome was observed. Furthermore, comparative transcriptomic analyses of LL vs HL provide insights into the molecular basis for metabolic rearrangement in HL vs. LL conditions leading to enhanced de novo fatty acid biosynthesis and triacylglycerol accumulation. The occurrence of a cytosolic fatty acid biosynthetic pathway can be predicted and its upregulation upon HL exposure is observed, consistent with increased lipid amount under HL. These data provide a rich genetic resource for future genome editing studies, and potential targets for biotechnological manipulation of Chlorella vulgaris or other microalgae species to improve biomass and lipid productivity.This article is protected by copyright. All rights reserved.


April 21, 2020  |  

A robust benchmark for germline structural variant detection

New technologies and analysis methods are enabling genomic structural variants (SVs) to be detected with ever-increasing accuracy, resolution, and comprehensiveness. Translating these methods to routine research and clinical practice requires robust benchmark sets. We developed the first benchmark set for identification of both false negative and false positive germline SVs, which complements recent efforts emphasizing increasingly comprehensive characterization of SVs. To create this benchmark for a broadly consented son in a Personal Genome Project trio with broadly available cells and DNA, the Genome in a Bottle (GIAB) Consortium integrated 19 sequence-resolved variant calling methods, both alignment- and de novo assembly-based, from short-, linked-, and long-read sequencing, as well as optical and electronic mapping. The final benchmark set contains 12745 isolated, sequence-resolved insertion and deletion calls =50 base pairs (bp) discovered by at least 2 technologies or 5 callsets, genotyped as heterozygous or homozygous variants by long reads. The Tier 1 benchmark regions, for which any extra calls are putative false positives, cover 2.66 Gbp and 9641 SVs supported by at least one diploid assembly. Support for SVs was assessed using svviz with short-, linked-, and long-read sequence data. In general, there was strong support from multiple technologies for the benchmark SVs, with 90 % of the Tier 1 SVs having support in reads from more than one technology. The Mendelian genotype error rate was 0.3 %, and genotype concordance with manual curation was >98.7 %. We demonstrate the utility of the benchmark set by showing it reliably identifies both false negatives and false positives in high-quality SV callsets from short-, linked-, and long-read sequencing and optical mapping.


April 21, 2020  |  

Comparative genomics reveals unique wood-decay strategies and fruiting body development in the Schizophyllaceae.

Agaricomycetes are fruiting body-forming fungi that produce some of the most efficient enzyme systems to degrade wood. Despite decades-long interest in their biology, the evolution and functional diversity of both wood-decay and fruiting body formation are incompletely known. We performed comparative genomic and transcriptomic analyses of wood-decay and fruiting body development in Auriculariopsis ampla and Schizophyllum commune (Schizophyllaceae), species with secondarily simplified morphologies, an enigmatic wood-decay strategy and weak pathogenicity to woody plants. The plant cell wall-degrading enzyme repertoires of Schizophyllaceae are transitional between those of white rot species and less efficient wood-degraders such as brown rot or mycorrhizal fungi. Rich repertoires of suberinase and tannase genes were found in both species, with tannases restricted to Agaricomycetes that preferentially colonize bark-covered wood, suggesting potential complementation of their weaker wood-decaying abilities and adaptations to wood colonization through the bark. Fruiting body transcriptomes revealed a high rate of divergence in developmental gene expression, but also several genes with conserved expression patterns, including novel transcription factors and small-secreted proteins, some of the latter which might represent fruiting body effectors. Taken together, our analyses highlighted novel aspects of wood-decay and fruiting body development in an important family of mushroom-forming fungi. © 2019 The Authors. New Phytologist © 2019 New Phytologist Trust.


April 21, 2020  |  

Transcriptional initiation of a small RNA, not R-loop stability, dictates the frequency of pilin antigenic variation in Neisseria gonorrhoeae.

Neisseria gonorrhoeae, the sole causative agent of gonorrhea, constitutively undergoes diversification of the Type IV pilus. Gene conversion occurs between one of the several donor silent copies located in distinct loci and the recipient pilE gene, encoding the major pilin subunit of the pilus. A guanine quadruplex (G4) DNA structure and a cis-acting sRNA (G4-sRNA) are located upstream of the pilE gene and both are required for pilin antigenic variation (Av). We show that the reduced sRNA transcription lowers pilin Av frequencies. Extended transcriptional elongation is not required for Av, since limiting the transcript to 32 nt allows for normal Av frequencies. Using chromatin immunoprecipitation (ChIP) assays, we show that cellular G4s are less abundant when sRNA transcription is lower. In addition, using ChIP, we demonstrate that the G4-sRNA forms a stable RNA:DNA hybrid (R-loop) with its template strand. However, modulating R-loop levels by controlling RNase HI expression does not alter G4 abundance quantified through ChIP. Since pilin Av frequencies were not altered when modulating R-loop levels by controlling RNase HI expression, we conclude that transcription of the sRNA is necessary, but stable R-loops are not required to promote pilin Av. © 2019 John Wiley & Sons Ltd.


April 21, 2020  |  

Convergent horizontal gene transfer and cross-talk of mobile nucleic acids in parasitic plants.

Horizontal gene transfer (HGT), the movement and genomic integration of DNA across species boundaries, is commonly associated with bacteria and other microorganisms, but functional HGT (fHGT) is increasingly being recognized in heterotrophic parasitic plants that obtain their nutrients and water from their host plants through direct haustorial feeding. Here, in the holoparasitic stem parasite Cuscuta, we identify 108?transcribed and probably functional HGT events in Cuscuta campestris and related species, plus 42?additional regions with host-derived transposon, pseudogene and non-coding sequences. Surprisingly, 18?Cuscuta fHGTs were acquired from the same gene families by independent HGT events in Orobanchaceae parasites, and the majority are highly expressed in the haustorial feeding structures in both lineages. Convergent retention and expression of HGT sequences suggests an adaptive role for specific additional genes in parasite biology. Between 16 and 20 of the transcribed HGT events are inferred as ancestral in Cuscuta based on transcriptome sequences from species across the phylogenetic range of the genus, implicating fHGT in the successful radiation of Cuscuta parasites. Genome sequencing of C. campestris supports transfer of genomic DNA-rather than retroprocessed RNA-as the mechanism of fHGT. Many of the C. campestris genes horizontally acquired are also frequent sources of 24-nucleotide small RNAs that are typically associated with RNA-directed DNA methylation. One HGT encoding a leucine-rich repeat protein kinase overlaps with a microRNA that has been shown to regulate host gene expression, suggesting that HGT-derived parasite small RNAs may function in the parasite-host interaction. This study enriches our understanding of HGT by describing a parasite-host system with unprecedented gene exchange that points to convergent evolution of HGT events and the functional importance of horizontally transferred coding and non-coding sequences.


April 21, 2020  |  

Antibiotic susceptibility of plant-derived lactic acid bacteria conferring health benefits to human.

Lactic acid bacteria (LAB) confer health benefits to human when administered orally. We have recently isolated several species of LAB strains from plant sources, such as fruits, vegetables, flowers, and medicinal plants. Since antibiotics used to treat bacterial infection diseases induce the emergence of drug-resistant bacteria in intestinal microflora, it is important to evaluate the susceptibility of LAB strains to antibiotics to ensure the safety and security of processed foods. The aim of the present study is to determine the minimum inhibitory concentration (MIC) of antibiotics against several plant-derived LAB strains. When aminoglycoside antibiotics, such as streptomycin (SM), kanamycin (KM), and gentamicin (GM), were evaluated using LAB susceptibility test medium (LSM), the MIC was higher than when using Mueller-Hinton (MH) medium. Etest, which is an antibiotic susceptibility assay method consisting of a predefined gradient of antibiotic concentrations on a plastic strip, is used to determine the MIC of antibiotics world-wide. In the present study, we demonstrated that Etest was particularly valuable while testing LAB strains. We also show that the low susceptibility of the plant-derived LAB strains against each antibiotic tested is due to intrinsic resistance and not acquired resistance. This finding is based on the whole-genome sequence information reflecting the horizontal spread of the drug-resistance genes in the LAB strains.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.