Menu
July 7, 2019

Susan Celniker: Foundational resources to study a dynamic genome.

The Genetics Society of America’s George W. Beadle Award honors individuals who have made outstanding contributions to the community of genetics researchers and who exemplify the qualities of its namesake. The 2016 recipient, Susan E. Celniker, played a key role in the sequencing, annotation, and characterization of the Drosophila genome. She participated in early sequencing efforts at the Lawrence Berkeley National Laboratory and led the modENCODE Fly Transcriptome Consortium. Her efforts were critical to ensuring that the Drosophila genome was well-annotated, making it one of the best curated animal genomes available. As the Principal Investigator for the BDGP, Celniker has enabled the study of proteomes by creating a collection of over 13,000 clones that match annotated genes for protein expression in cells or transgenic flies, and she has established the most comprehensive spatial gene expression atlas in any organism, with in situ imaging of more than 80% of the Drosophila protein-coding transcriptome through embryogenesis. In addition to providing the research community with these invaluable resources and reagents, she continues to develop new tools and datasets for genetics researchers to explore the spatial and temporal control of gene expression.


July 7, 2019

Microbial sequence typing in the genomic era.

Next-generation sequencing (NGS), also known as high-throughput sequencing, is changing the field of microbial genomics research. NGS allows for a more comprehensive analysis of the diversity, structure and composition of microbial genes and genomes compared to the traditional automated Sanger capillary sequencing at a lower cost. NGS strategies have expanded the versatility of standard and widely used typing approaches based on nucleotide variation in several hundred DNA sequences and a few gene fragments (MLST, MLVA, rMLST and cgMLST). NGS can now accommodate variation in thousands or millions of sequences from selected amplicons to full genomes (WGS, NGMLST and HiMLST). To extract signals from high-dimensional NGS data and make valid statistical inferences, novel analytic and statistical techniques are needed. In this review, we describe standard and new approaches for microbial sequence typing at gene and genome levels and guidelines for subsequent analysis, including methods and computational frameworks. We also present several applications of these approaches to some disciplines, namely genotyping, phylogenetics and molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019

Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations.

Mutations, the fuel of evolution, are first manifested as rare DNA changes within a population of cells. Although next-generation sequencing (NGS) technologies have revolutionized the study of genomic variation between species and individual organisms, most have limited ability to accurately detect and quantify rare variants among the different genome copies in heterogeneous mixtures of cells or molecules. We describe the technical challenges in characterizing subclonal variants using conventional NGS protocols and the recent development of error correction strategies, both computational and experimental, including consensus sequencing of single DNA molecules. We also highlight major applications for low-frequency mutation detection in science and medicine, describe emerging methodologies and provide our vision for the future of DNA sequencing.


July 7, 2019

Ten steps to get started in Genome Assembly and Annotation.

As a part of the ELIXIR-EXCELERATE efforts in capacity building, we present here 10 steps to facilitate researchers getting started in genome assembly and genome annotation. The guidelines given are broadly applicable, intended to be stable over time, and cover all aspects from start to finish of a general assembly and annotation project. Intrinsic properties of genomes are discussed, as is the importance of using high quality DNA. Different sequencing technologies and generally applicable workflows for genome assembly are also detailed. We cover structural and functional annotation and encourage readers to also annotate transposable elements, something that is often omitted from annotation workflows. The importance of data management is stressed, and we give advice on where to submit data and how to make your results Findable, Accessible, Interoperable, and Reusable (FAIR).


July 7, 2019

Complete genome sequence of the halophile bacterium Kushneria marisflavi KCCM 80003T, isolated from seawater in Korea

We present the genome sequence of Kushneria marisflavi KCCM 80003T isolated from Yellow Sea in Korea. The complete genome of KCCM 80003T consisted of a single, circular chromosome of 3,667,185bp, with an average G+C content of 59.05%, and 3287 coding sequences, 12 rRNAs, and 66 tRNAs. Kushneria marisflavi KCCM 80003T, belonging to the family Halomonadaceae, exhibited resistance to high salt concentrations and possessed potassium metabolism- or osmotic stress-related coding sequences, including potassium homeostasis, ectoine biosynthesis and regulation, choline and betaine uptake, and betaine biosynthesis features in the genome. These results provide a basis for understanding resistance strategies to osmotic stress at the genetic level and accordingly have implications for genetic engineering and biotechnology.


July 7, 2019

Complete genome sequence of the sesame pathogen Ralstonia solanacearum strain SEPPX 05.

Ralstonia solanacearum is a soil-borne phytopathogen associated with bacterial wilt disease of sesame. R. solanacearum is the predominant agent causing damping-off from tropical to temperate regions. Because bacterial wilt has decreased the sesame industry yield, we sequenced the SEPPX05 genome using PacBio and Illumina HiSeq 2500 systems and revealed that R. solanacearum strain SEPPX05 carries a bipartite genome consisting of a 3,930,849 bp chromosome and a 2,066,085 bp megaplasmid with 66.84% G+C content that harbors 5,427 coding sequences. Based on the whole genome, phylogenetic analysis showed that strain SEPPX05 is grouped with two phylotype I strains (EP1 and GMI1000). Pan-genomic analysis shows that R. solanacearum is a complex species with high biological diversity and was able to colonize various environments during evolution. Despite deletions, insertions, and inversions, most genes of strain SEPPX05 have relatively high levels of synteny compared with strain GMI1000. We identified 104 genes involved in virulence-related factors in the SEPPX05 genome and eight absent genes encoding T3Es of GMI1000. Comparing SEPPX05 with other species, we found highly conserved secretion systems central to modulating interactions of host bacteria. These data may provide important clues for understanding underlying pathogenic mechanisms of R. solanacearum and help in the control of sesame bacterial wilt.


July 7, 2019

Host genetic variation strongly influences the microbiome structure and function in fungal fruiting-bodies.

Despite increasing knowledge on host-associated microbiomes, little is known about mechanisms underlying fungus-microbiome interactions. This study aimed to examine the relative importance of host genetic, geographic and environmental variations in structuring fungus-associated microbiomes. We analyzed the taxonomic composition and function of microbiomes inhabiting fungal fruiting-bodies in relation to host genetic variation, soil pH and geographic distance between samples. For this, we sequenced the metagenomes of 40 fruiting-bodies collected from six fairy rings (i.e., genets) of a saprotrophic fungus Marasmius oreades. Our analyses revealed that fine genetic variations between host fungi could strongly affect their associated microbiome, explaining, respectively, 25% and 37% of the variation in microbiome structure and function, whereas geographic distance and soil pH remained of secondary importance. These results, together with the smaller genome size of fungi compared to other eukaryotes, suggest that fruiting-bodies are suitable for further genome-centric studies on host-microbiome interactions.© 2018 Society for Applied Microbiology and John Wiley & Sons Ltd.


July 7, 2019

Draft genome sequence of the phytopathogenic fungus Ganoderma boninense, the causal agent of basal stem rot disease on oil palm.

Ganoderma boninense is the dominant fungal pathogen of basal stem rot (BSR) disease on Elaeis guineensis We sequenced the nuclear genome of mycelia using both Illumina and Pacific Biosciences platforms for assembly of scaffolds. The draft genome comprised 79.24?Mb, 495 scaffolds, and 26,226 predicted coding sequences. Copyright © 2018 Utomo et al.


July 7, 2019

A fast approximate algorithm for mapping long reads to large reference databases.

Emerging single-molecule sequencing technologies from Pacific Biosciences and Oxford Nanopore have revived interest in long-read mapping algorithms. Alignment-based seed-and-extend methods demonstrate good accuracy, but face limited scalability, while faster alignment-free methods typically trade decreased precision for efficiency. In this article, we combine a fast approximate read mapping algorithm based on minimizers with a novel MinHash identity estimation technique to achieve both scalability and precision. In contrast to prior methods, we develop a mathematical framework that defines the types of mapping targets we uncover, establish probabilistic estimates of p-value and sensitivity, and demonstrate tolerance for alignment error rates up to 20%. With this framework, our algorithm automatically adapts to different minimum length and identity requirements and provides both positional and identity estimates for each mapping reported. For mapping human PacBio reads to the hg38 reference, our method is 290?×?faster than Burrows-Wheeler Aligner-MEM with a lower memory footprint and recall rate of 96%. We further demonstrate the scalability of our method by mapping noisy PacBio reads (each =5?kbp in length) to the complete NCBI RefSeq database containing 838 Gbp of sequence and >60,000 genomes.


July 7, 2019

Complete genome sequence of the environmental Burkholderia pseudomallei sequence type 131 isolate MSHR1435, associated with a chronic melioidosis infection.

The Burkholderia pseudomallei isolate MSHR1435 is a fully virulent environmental sequence type 131 (ST131) isolate that is epidemiologically associated with a 17.5-year chronic melioidosis infection. The completed genome will serve as a reference for studies of environmental ecology, virulence, and chronic B. pseudomallei infections. Copyright © 2018 Sahl et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.