Menu
July 7, 2019

Population scale mapping of transposable element diversity reveals links to gene regulation and epigenomic variation.

Variation in the presence or absence of transposable elements (TEs) is a major source of genetic variation between individuals. Here, we identified 23,095 TE presence/absence variants between 216 Arabidopsis accessions. Most TE variants were rare, and we find these rare variants associated with local extremes of gene expression and DNA methylation levels within the population. Of the common alleles identified, two thirds were not in linkage disequilibrium with nearby SNPs, implicating these variants as a source of novel genetic diversity. Many common TE variants were associated with significantly altered expression of nearby genes, and a major fraction of inter-accession DNA methylation differences were associated with nearby TE insertions. Overall, this demonstrates that TE variants are a rich source of genetic diversity that likely plays an important role in facilitating epigenomic and transcriptional differences between individuals, and indicates a strong genetic basis for epigenetic variation.


July 7, 2019

Complete genome sequence of Enterococcus faecium ATCC 700221.

We report the complete genome sequence of a vancomycin-resistant isolate of Enterococcus faecium derived from human feces. The genome comprises one chromosome of 2.9 Mb and three plasmids. The strain harbors a plasmid-borne vanA-type vancomycin resistance locus and is a member of multilocus sequencing type (MLST) cluster ST-17. Copyright © 2016 McKenney et al.


July 7, 2019

Microevolution of monophasic Salmonella Typhimurium during epidemic, United Kingdom, 2005-2010.

Microevolution associated with emergence and expansion of new epidemic clones of bacterial pathogens holds the key to epidemiologic success. To determine microevolution associated with monophasic Salmonella Typhimurium during an epidemic, we performed comparative whole-genome sequencing and phylogenomic analysis of isolates from the United Kingdom and Italy during 2005-2012. These isolates formed a single clade distinct from recent monophasic epidemic clones previously described from North America and Spain. The UK monophasic epidemic clones showed a novel genomic island encoding resistance to heavy metals and a composite transposon encoding antimicrobial drug resistance genes not present in other Salmonella Typhimurium isolates, which may have contributed to epidemiologic success. A remarkable amount of genotypic variation accumulated during clonal expansion that occurred during the epidemic, including multiple independent acquisitions of a novel prophage carrying the sopE gene and multiple deletion events affecting the phase II flagellin locus. This high level of microevolution may affect antigenicity, pathogenicity, and transmission.


July 7, 2019

ABO allele-level frequency estimation based on population-scale genotyping by next generation sequencing.

The characterization of the ABO blood group status is vital for blood transfusion and solid organ transplantation. Several methods for the molecular characterization of the ABO gene, which encodes the alleles that give rise to the different ABO blood groups, have been described. However, the application of those methods has so far been restricted to selected samples and not been applied to population-scale analysis.We describe a cost-effective method for high-throughput genotyping of the ABO system by next generation sequencing. Sample specific barcodes and sequencing adaptors are introduced during PCR, rendering the products suitable for direct sequencing on Illumina MiSeq or HiSeq instruments. Complete sequence coverage of exons 6 and 7 enables molecular discrimination of the ABO subgroups and many alleles. The workflow was applied to ABO genotype more than a million samples. We report the allele group frequencies calculated on a subset of more than 110,000 sampled individuals of German origin. Further we discuss the potential of the workflow for high resolution genotyping taking the observed allele group frequencies into account. Finally, sequence analysis revealed 287 distinct so far not described alleles of which the most abundant one was identified in 174 samples.The described workflow delivers high resolution ABO genotyping at low cost enabling population-scale molecular ABO characterization.


July 7, 2019

A hot L1 retrotransposon evades somatic repression and initiates human colorectal cancer.

Although human LINE-1 (L1) elements are actively mobilized in many cancers, a role for somatic L1 retrotransposition in tumor initiation has not been conclusively demonstrated. Here, we identify a novel somatic L1 insertion in the APC tumor suppressor gene that provided us with a unique opportunity to determine whether such insertions can actually initiate colorectal cancer (CRC), and if so, how this might occur. Our data support a model whereby a hot L1 source element on Chromosome 17 of the patient’s genome evaded somatic repression in normal colon tissues and thereby initiated CRC by mutating the APC gene. This insertion worked together with a point mutation in the second APC allele to initiate tumorigenesis through the classic two-hit CRC pathway. We also show that L1 source profiles vary considerably depending on the ancestry of an individual, and that population-specific hot L1 elements represent a novel form of cancer risk. © 2016 Scott et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019

Extensive sequencing of seven human genomes to characterize benchmark reference materials.

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly.


July 7, 2019

Structural and functional analysis of the finished genome of the recently isolated toxic Anabaena sp. WA102.

Very few closed genomes of the cyanobacteria that commonly produce toxic blooms in lakes and reservoirs are available, limiting our understanding of the properties of these organisms. A new anatoxin-a-producing member of the Nostocaceae, Anabaena sp. WA102, was isolated from a freshwater lake in Washington State, USA, in 2013 and maintained in non-axenic culture.The Anabaena sp. WA102 5.7 Mbp genome assembly has been closed with long-read, single-molecule sequencing and separately a draft genome assembly has been produced with short-read sequencing technology. The closed and draft genome assemblies are compared, showing a correlation between long repeats in the genome and the many gaps in the short-read assembly. Anabaena sp. WA102 encodes anatoxin-a biosynthetic genes, as does its close relative Anabaena sp. AL93 (also introduced in this study). These strains are distinguished by differences in the genes for light-harvesting phycobilins, with Anabaena sp. AL93 possessing a phycoerythrocyanin operon. Biologically relevant structural variants in the Anabaena sp. WA102 genome were detected only by long-read sequencing: a tandem triplication of the anaBCD promoter region in the anatoxin-a synthase gene cluster (not triplicated in Anabaena sp. AL93) and a 5-kbp deletion variant present in two-thirds of the population. The genome has a large number of mobile elements (160). Strikingly, there was no synteny with the genome of its nearest fully assembled relative, Anabaena sp. 90.Structural and functional genome analyses indicate that Anabaena sp. WA102 has a flexible genome. Genome closure, which can be readily achieved with long-read sequencing, reveals large scale (e.g., gene order) and local structural features that should be considered in understanding genome evolution and function.


July 7, 2019

Ploidy influences the functional attributes of de novo lager yeast hybrids.

The genomes of hybrid organisms, such as lager yeast (Saccharomyces cerevisiae × Saccharomyces eubayanus), contain orthologous genes, the functionality and effect of which may differ depending on their origin and copy number. How the parental subgenomes in lager yeast contribute to important phenotypic traits such as fermentation performance, aroma production, and stress tolerance remains poorly understood. Here, three de novo lager yeast hybrids with different ploidy levels (allodiploid, allotriploid, and allotetraploid) were generated through hybridization techniques without genetic modification. The hybrids were characterized in fermentations of both high gravity wort (15 °P) and very high gravity wort (25 °P), which were monitored for aroma compound and sugar concentrations. The hybrid strains with higher DNA content performed better during fermentation and produced higher concentrations of flavor-active esters in both worts. The hybrid strains also outperformed both the parent strains. Genome sequencing revealed that several genes related to the formation of flavor-active esters (ATF1, ATF2¸ EHT1, EEB1, and BAT1) were present in higher copy numbers in the higher ploidy hybrid strains. A direct relationship between gene copy number and transcript level was also observed. The measured ester concentrations and transcript levels also suggest that the functionality of the S. cerevisiae- and S. eubayanus-derived gene products differs. The results contribute to our understanding of the complex molecular mechanisms that determine phenotypes in lager yeast hybrids and are expected to facilitate targeted strain development through interspecific hybridization.


July 7, 2019

A detailed analysis of the recombination landscape of the button mushroom Agaricus bisporus var. bisporus.

The button mushroom (Agaricus bisporus) is one of the world’s most cultivated mushroom species, but in spite of its economic importance generation of new cultivars by outbreeding is exceptional. Previous genetic analyses of the white bisporus variety, including all cultivars and most wild isolates revealed that crossing over frequencies are low, which might explain the lack of introducing novel traits into existing cultivars. By generating two high quality whole genome sequence assemblies (one de novo and the other by improving the existing reference genome) of the first commercial white hybrid Horst U1, a detailed study of the crossover (CO) landscape was initiated. Using a set of 626 SNPs in a haploid offspring of 139 single spore isolates and whole genome sequencing on a limited number of homo- and heterokaryotic single spore isolates, we precisely mapped all COs showing that they are almost exclusively restricted to regions of about 100kb at the chromosome ends. Most basidia of A. bisporus var. bisporus produce two spores and pair preferentially via non-sister nuclei. Combined with the COs restricted to the chromosome ends, these spores retain most of the heterozygosity of the parent thus explaining how present-day white cultivars are genetically so close to the first hybrid marketed in 1980. To our knowledge this is the first example of an organism which displays such specific CO landscape. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019

1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana.

Arabidopsis thaliana serves as a model organism for the study of fundamental physiological, cellular, and molecular processes. It has also greatly advanced our understanding of intraspecific genome variation. We present a detailed map of variation in 1,135 high-quality re-sequenced natural inbred lines representing the native Eurasian and North African range and recently colonized North America. We identify relict populations that continue to inhabit ancestral habitats, primarily in the Iberian Peninsula. They have mixed with a lineage that has spread to northern latitudes from an unknown glacial refugium and is now found in a much broader spectrum of habitats. Insights into the history of the species and the fine-scale distribution of genetic diversity provide the basis for full exploitation of A. thaliana natural variation through integration of genomes and epigenomes with molecular and non-molecular phenotypes. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.


July 7, 2019

Structural variation detection using next-generation sequencing data: A comparative technical review.

Structural variations (SVs) are mutations in the genome of size at least fifty nucleotides. They contribute to the phenotypic differences among healthy individuals, cause severe diseases and even cancers by breaking or linking genes. Thus, it is crucial to systematically profile SVs in the genome. In the past decade, many next-generation sequencing (NGS)-based SV detection methods have been proposed due to the significant cost reduction of NGS experiments and their ability to unbiasedly detect SVs to the base-pair resolution. These SV detection methods vary in both sensitivity and specificity, since they use different SV-property-dependent and library-property-dependent features. As a result, predictions from different SV callers are often inconsistent. Besides, the noises in the data (both platform-specific sequencing error and artificial chimeric reads) impede the specificity of SV detection. Poorly characterized regions in the human genome (e.g., repeat regions) greatly impact the reads mapping and in turn affect the SV calling accuracy. Calling of complex SVs requires specialized SV callers. Apart from accuracy, processing speed of SV caller is another factor deciding its usability. Knowing the pros and cons of different SV calling techniques and the objectives of the biological study are essential for biologists and bioinformaticians to make informed decisions. This paper describes different components in the SV calling pipeline and reviews the techniques used by existing SV callers. Through simulation study, we also demonstrate that library properties, especially insert size, greatly impact the sensitivity of different SV callers. We hope the community can benefit from this work both in designing new SV calling methods and in selecting the appropriate SV caller for specific biological studies. Copyright © 2016 Elsevier Inc. All rights reserved.


July 7, 2019

Use of multiple sequencing technologies to produce a high-quality genome of the fungus Pseudogymnoascus destructans, the causative agent of bat white-nose syndrome.

White-nose syndrome has recently emerged as one of the most devastating wildlife diseases recorded, causing widespread mortality in numerous bat species throughout eastern North America. Here, we present an improved reference genome of the fungal pathogen Pseudogymnoascus destructans for use in comparative genomic studies. Copyright © 2016 Drees et al.


July 7, 2019

Plasmid dynamics in KPC-positive Klebsiella pneumoniae during long-term patient colonization.

Carbapenem-resistant Klebsiella pneumoniae strains are formidable hospital pathogens that pose a serious threat to patients around the globe due to a rising incidence in health care facilities, high mortality rates associated with infection, and potential to spread antibiotic resistance to other bacterial species, such as Escherichia coli Over 6 months in 2011, 17 patients at the National Institutes of Health (NIH) Clinical Center became colonized with a highly virulent, transmissible carbapenem-resistant strain of K. pneumoniae Our real-time genomic sequencing tracked patient-to-patient routes of transmission and informed epidemiologists’ actions to monitor and control this outbreak. Two of these patients remained colonized with carbapenemase-producing organisms for at least 2 to 4 years, providing the opportunity to undertake a focused genomic study of long-term colonization with antibiotic-resistant bacteria. Whole-genome sequencing studies shed light on the underlying complex microbial colonization, including mixed or evolving bacterial populations and gain or loss of plasmids. Isolates from NIH patient 15 showed complex plasmid rearrangements, leaving the chromosome and the blaKPC-carrying plasmid intact but rearranging the two other plasmids of this outbreak strain. NIH patient 16 has shown continuous colonization with blaKPC-positive organisms across multiple time points spanning 2011 to 2015. Genomic studies defined a complex pattern of succession and plasmid transmission across two different K. pneumoniae sequence types and an E. coli isolate. These findings demonstrate the utility of genomic methods for understanding strain succession, genome plasticity, and long-term carriage of antibiotic-resistant organisms.In 2011, the NIH Clinical Center had a nosocomial outbreak involving 19 patients who became colonized or infected with blaKPC-positive Klebsiella pneumoniae Patients who have intestinal colonization with blaKPC-positive K. pneumoniae are at risk for developing infections that are difficult or nearly impossible to treat with existing antibiotic options. Two of those patients remained colonized with blaKPC-positive Klebsiella pneumoniae for over a year, leading to the initiation of a detailed genomic analysis exploring mixed colonization, plasmid recombination, and plasmid diversification. Whole-genome sequence analysis identified a variety of changes, both subtle and large, in the blaKPC-positive organisms. Long-term colonization of patients with blaKPC-positive Klebsiella pneumoniae creates new opportunities for horizontal gene transfer of plasmids encoding antibiotic resistance genes and poses complications for the delivery of health care. Copyright © 2016 Conlan et al.


July 7, 2019

Cloche is a bHLH-PAS transcription factor that drives haemato-vascular specification.

Vascular and haematopoietic cells organize into specialized tissues during early embryogenesis to supply essential nutrients to all organs and thus play critical roles in development and disease. At the top of the haemato-vascular specification cascade lies cloche, a gene that when mutated in zebrafish leads to the striking phenotype of loss of most endothelial and haematopoietic cells and a significant increase in cardiomyocyte numbers. Although this mutant has been analysed extensively to investigate mesoderm diversification and differentiation and continues to be broadly used as a unique avascular model, the isolation of the cloche gene has been challenging due to its telomeric location. Here we used a deletion allele of cloche to identify several new cloche candidate genes within this genomic region, and systematically genome-edited each candidate. Through this comprehensive interrogation, we succeeded in isolating the cloche gene and discovered that it encodes a PAS-domain-containing bHLH transcription factor, and that it is expressed in a highly specific spatiotemporal pattern starting during late gastrulation. Gain-of-function experiments show that it can potently induce endothelial gene expression. Epistasis experiments reveal that it functions upstream of etv2 and tal1, the earliest expressed endothelial and haematopoietic transcription factor genes identified to date. A mammalian cloche orthologue can also rescue blood vessel formation in zebrafish cloche mutants, indicating a highly conserved role in vertebrate vasculogenesis and haematopoiesis. The identification of this master regulator of endothelial and haematopoietic fate enhances our understanding of early mesoderm diversification and may lead to improved protocols for the generation of endothelial and haematopoietic cells in vivo and in vitro.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.