Menu
July 7, 2019  |  

Automated structural variant verification in human genomesw using single-molecule electronic DNA mapping.

The importance of structural variation in human disease and the difficulty of detecting structural variants larger than 50 base pairs has led to the development of several long-read sequencing technologies and optical mapping platforms. Frequently, multiple technologies and ad hoc methods are required to obtain a consensus regarding the location, size and nature of a structural variant, with no approach able to reliably bridge the gap of variant sizes between the domain of short-read approaches and the largest rearrangements observed with optical mapping. To address this unmet need, we have developed a new software package, SV-VerifyTM, which utilizes data collected with the Nabsys High Definition Mapping (HD-MappingTM) system, to perform hypothesis-based verification of putative deletions. We demonstrate that whole genome maps, constructed from electronic detection of tagged DNA, hundreds of kilobases in length, can be used effectively to facilitate calling of structural variants ranging in size from 300 base pairs to hundreds of kilobase pairs. SV-Verify implements hypothesis-based verification of putative structural variants using a set of support vector machines and is capable of concurrently testing several thousand independent hypotheses. We describe support vector machine training, utilizing a well-characterized human genome, and application of the resulting classifiers to another human genome, demonstrating high sensitivity and specificity for deletions >= 300 base pairs.


July 7, 2019  |  

Comparative genomic analysis of Mycobacterium tuberculosis Beijing-like strains revealed specific genetic variations associated with virulence and drug resistance.

Isolates of the Mycobacterium tuberculosis lineage 2/East-Asian are considered one of the most successful strains due to their increased pathogenicity, hyper-virulence associated with drug resistance, and high transmission. Recent studies in Colombia have shown that the Beijing-like genotype is associated with multidrug-resistance and high prevalence in the southwest of the country, but the genetic basis of its success in dissemination is unknown. In contribution to this matter, we obtained the whole sequences of six genomes of clinical isolates assigned to the Beijing-like genotype. The genomes were compared with the reference genome of M. tuberculosis H37Rv and 53 previously published M. tuberculosis genomes. We found that the six Beijing-like isolates belong to a modern Beijing sub-lineage and share specific genomic variants: i.e. deletion in the PPE8 gene, in Rv3806c (ubiA) responsible of high ethambutol resistance and in Rv3862c (whiB6) which is involved in granuloma formation and virulence, are some of them. Moreover, each isolated has exclusively single nucleotide polymorphisms (SNPs) in genes related with cell wall processes and cell metabolism. We identified polymorphisms in genes related to drug resistance that could explain the drug-resistant phenotypes found in the six isolates from Colombia. We hypothesize that changes due to these genetic variations contribute to the success of these strains. Finally, we analyzed the IS6110 insertion sequences finding very low variance between them, suggesting that SNPs is the major cause of variability found in Beijing-like strains circulating in Colombia. Copyright © 2017 Elsevier B.V. All rights reserved.


July 7, 2019  |  

Emergence and genomic diversification of a virulent serogroup W: ST-2881 (CC175) Neisseria meningitidis clone in the African meningitis belt

Countries of the African ‘meningitis belt’ are susceptible to meningococcal meningitis outbreaks. While in the past major epidemics have been primarily caused by serogroup A meningococci, W strains are currently responsible for most of the cases. After an epidemic in Mecca in 2000, W:ST-11 strains have caused many outbreaks worldwide. An unrelated W:ST-2881 clone was described for the first time in 2002, with the first meningitis cases caused by these bacteria reported in 2003. Here we describe results of a comparative whole-genome analysis of 74 W:ST-2881 strains isolated within the framework of two longitudinal colonization and disease studies conducted in Ghana and Burkina Faso. Genomic data indicate that the W:ST-2881 clone has emerged from Y:ST-175(CC175) bacteria by capsule switching. The circulating W:ST-2881 populations were composed of a variety of closely related but distinct genomic variants with no systematic differences between colonization and disease isolates. Two distinct and geographically clustered phylogenetic clonal variants were identified in Burkina Faso and a third in Ghana. On the basis of the presence or absence of 17 recombination fragments, the Ghanaian variant could be differentiated into five clusters. All 25 Ghanaian disease isolates clustered together with 23 out of 40 Ghanaian isolates associated with carriage within one cluster, indicating that W:ST-2881 clusters differ in virulence. More than half of the genes affected by horizontal gene transfer encoded proteins of the ‘cell envelope’ and the ‘transport/binding protein’ categories, which indicates that exchange of non-capsular antigens plays an important role in immune evasion.


July 7, 2019  |  

Pearl millet genome sequence provides a resource to improve agronomic traits in arid environments.

Pearl millet [Cenchrus americanus (L.) Morrone] is a staple food for more than 90 million farmers in arid and semi-arid regions of sub-Saharan Africa, India and South Asia. We report the ~1.79 Gb draft whole genome sequence of reference genotype Tift 23D2B1-P1-P5, which contains an estimated 38,579 genes. We highlight the substantial enrichment for wax biosynthesis genes, which may contribute to heat and drought tolerance in this crop. We resequenced and analyzed 994 pearl millet lines, enabling insights into population structure, genetic diversity and domestication. We use these resequencing data to establish marker trait associations for genomic selection, to define heterotic pools, and to predict hybrid performance. We believe that these resources should empower researchers and breeders to improve this important staple crop.


July 7, 2019  |  

Genomic and functional analysis of Romboutsia ilealis CRIBT reveals adaptation to the small intestine.

The microbiota in the small intestine relies on their capacity to rapidly import and ferment available carbohydrates to survive in a complex and highly competitive ecosystem. Understanding how these communities function requires elucidating the role of its key players, the interactions among them and with their environment/host.The genome of the gut bacterium Romboutsia ilealis CRIBT was sequenced with multiple technologies (Illumina paired-end, mate-pair and PacBio). The transcriptome was sequenced (Illumina HiSeq) after growth on three different carbohydrate sources, and short chain fatty acids were measured via HPLC.We present the complete genome of Romboutsia ilealis CRIBT, a natural inhabitant and key player of the small intestine of rats. R. ilealis CRIBT possesses a circular chromosome of 2,581,778 bp and a plasmid of 6,145 bp, carrying 2,351 and eight predicted protein coding sequences, respectively. Analysis of the genome revealed limited capacity to synthesize amino acids and vitamins, whereas multiple and partially redundant pathways for the utilization of different relatively simple carbohydrates are present. Transcriptome analysis allowed identification of the key components in the degradation of glucose, L-fucose and fructo-oligosaccharides.This revealed that R. ilealis CRIBT is adapted to a nutrient-rich environment where carbohydrates, amino acids and vitamins are abundantly available.


July 7, 2019  |  

Parallel evolution of group B Streptococcus hypervirulent clonal complex 17 unveils new pathoadaptive mutations.

Group B Streptococcus (GBS) is a commensal of the gastrointestinal and genitourinary tracts, while a prevailing cause of neonatal disease worldwide. Of the various clonal complexes (CCs), CC17 is overrepresented in GBS-infected newborns for reasons that are still largely unknown. Here, we report a comprehensive genomic analysis of 626 CC17 isolates collected worldwide, identifying the genetic traits behind their successful adaptation to humans and the underlying differences between carriage and clinical strains. Comparative analysis with 923 GBS genomes belonging to CC1, CC19, and CC23 revealed that the evolution of CC17 is distinct from that of other human-adapted lineages and recurrently targets functions related to nucleotide and amino acid metabolism, cell adhesion, regulation, and immune evasion. We show that the most distinctive features of disease-specific CC17 isolates were frequent mutations in the virulence-associated CovS and Stk1 kinases, underscoring the crucial role of the entire CovRS regulatory pathway in modulating the pathogenicity of GBS. Importantly, parallel and convergent evolution of major components of the bacterial cell envelope, such as the capsule biosynthesis operon, the pilus, and Rib, reflects adaptation to host immune pressures and should be taken into account in the ongoing development of a GBS vaccine. The presence of recurrent targets of evolution not previously implicated in virulence also opens the way for uncovering new functions involved in host colonization and GBS pathogenesis. IMPORTANCE The incidence of group B Streptococcus (GBS) neonatal disease continues to be a significant cause of concern worldwide. Strains belonging to clonal complex 17 (CC17) are the most frequently responsible for GBS infections in neonates, especially among late-onset disease cases. Therefore, we undertook the largest genomic study of GBS CC17 strains to date to decipher the genetic bases of their remarkable colonization and infection ability. We show that crucial functions involved in different steps of the colonization or infection process of GBS are distinctly mutated during the adaptation of CC17 to the human host. In particular, our results implicate the CovRS two-component regulator of virulence in the differentiation between carriage- and disease-associated isolates. Not only does this work raise important implications for the ongoing development of a vaccine against GBS but might also drive the discovery of key functions for GBS adaptation and pathogenesis that have been overlooked until now. Author Video: An author video summary of this article is available.


July 7, 2019  |  

SVachra: a tool to identify genomic structural variation in mate pair sequencing data containing inward and outward facing reads.

Characterization of genomic structural variation (SV) is essential to expanding the research and clinical applications of genome sequencing. Reliance upon short DNA fragment paired end sequencing has yielded a wealth of single nucleotide variants and internal sequencing read insertions-deletions, at the cost of limited SV detection. Multi-kilobase DNA fragment mate pair sequencing has supplemented the void in SV detection, but introduced new analytic challenges requiring SV detection tools specifically designed for mate pair sequencing data. Here, we introduce SVachra – Structural Variation Assessment of CHRomosomal Aberrations, a breakpoint calling program that identifies large insertions-deletions, inversions, inter- and intra-chromosomal translocations utilizing both inward and outward facing read types generated by mate pair sequencing.We demonstrate SVachra’s utility by executing the program on large-insert (Illumina Nextera) mate pair sequencing data from the personal genome of a single subject (HS1011). An additional data set of long-read (Pacific BioSciences RSII) was also generated to validate SV calls from SVachra and other comparison SV calling programs. SVachra exhibited the highest validation rate and reported the widest distribution of SV types and size ranges when compared to other SV callers.SVachra is a highly specific breakpoint calling program that exhibits a more unbiased SV detection methodology than other callers.


July 7, 2019  |  

The Mobile Element Locator Tool (MELT): population-scale mobile element discovery and biology.

Mobile element insertions (MEIs) represent ~25% of all structural variants in human genomes. Moreover, when they disrupt genes, MEIs can influence human traits and diseases. Therefore, MEIs should be fully discovered along with other forms of genetic variation in whole genome sequencing (WGS) projects involving population genetics, human diseases, and clinical genomics. Here, we describe the Mobile Element Locator Tool (MELT), which was developed as part of the 1000 Genomes Project to perform MEI discovery on a population scale. Using both Illumina WGS data and simulations, we demonstrate that MELT outperforms existing MEI discovery tools in terms of speed, scalability, specificity, and sensitivity, while also detecting a broader spectrum of MEI-associated features. Several run modes were developed to perform MEI discovery on local and cloud systems. In addition to using MELT to discover MEIs in modern humans as part of the 1000 Genomes Project, we also used it to discover MEIs in chimpanzees and ancient (Neanderthal and Denisovan) hominids. We detected diverse patterns of MEI stratification across these populations that likely were caused by (1) diverse rates of MEI production from source elements, (2) diverse patterns of MEI inheritance, and (3) the introgression of ancient MEIs into modern human genomes. Overall, our study provides the most comprehensive map of MEIs to date spanning chimpanzees, ancient hominids, and modern humans and reveals new aspects of MEI biology in these lineages. We also demonstrate that MELT is a robust platform for MEI discovery and analysis in a variety of experimental settings.© 2017 Gardner et al.; Published by Cold Spring Harbor Laboratory Press.


July 7, 2019  |  

Archetype JC polyomavirus prevails in a rare case of JC polyomavirus nephropathy and in stable renal transplant recipients with JC polyomavirus viruria.

JC polyomavirus (JCPyV) is reactivated in approximately 20% of renal transplant recipients and it may rarely cause JCPyV-associated nephropathy (JCPyVAN). Whereas progressive multifocal leukoencephalopathy of the brain is caused by rearranged neurotropic JCPyV, little is known about viral sequence variation in JCPyVAN due to the rarity of this condition.Using single-molecule real-time sequencing, characterization of full-length JCPyV genomes from urine and plasma of one JCPyVAN patient and twenty stable renal transplant recipients with JCPyV viruria was attempted. Sequence analysis of JCPyV strains was performed with the emphasis on the NCCR region, the major capsid protein gene VP1 and the large T antigen (LTag) gene.Exclusively archetype strains were identified in urine of the JCPyVAN patient. Full-length JCPyV sequences were not retrieved from plasma. Archetype strains were found in urine of nineteen stable renal transplant recipients, with JCPyV quasispecies detected in five samples. In a patient with minor graft dysfunction, a strain with archetype-like NCCR region was discovered. Individual point mutations were detected in both VP1 and LTag genes.Archetype JCPyV was dominant in the JCPyVAN patient and in stable renal transplant recipients. Archetype rather than rearranged JCPyV seems to drive the pathogenesis of JCPyVAN.


July 7, 2019  |  

Structural variation offers new home for disease associations and gene discovery

Following completion of the Human Genome Project, most studies of human genetic variation have centered on single nucleotide polymorphisms (SNPs). SNPs are numerous in individual genomes and serve as useful genetic markers in association studies across a population. These markers have been leveraged to identify genetic loci for disease risk and draw associations with numerous traits of interest. Despite their usefulness, SNPs do not tell the whole story. For example, most SNPs are associated with only a small increased risk of disease, and they usually cannot identify on their own which genes are causal. This has resulted in what many researchers have referred to as missing or hidden heritability.


July 7, 2019  |  

Rapid and affordable size-selected PacBio single-molecule real-time sequencing template library construction using the bead-beating DNA extraction method

This study demonstrated that bead-beating method facilitates a simple and rapid protocol for genomic DNA isolation for Pacific BioSciences (PacBio) sequencing with library construction of sufficient length. The protocol may also be beneficial for inactivating pathogens by simultaneous and instant DNA fragmentation, with no special equipment required to obtain large DNA fragments. This protocol was comparable in terms of quality to the standard protocol suggested by PacBioand represents an alternative, rapid shortcut for performing accurate PacBio sequencing.


July 7, 2019  |  

Dense and accurate whole-chromosome haplotyping of individual genomes.

The diploid nature of the human genome is neglected in many analyses done today, where a genome is perceived as a set of unphased variants with respect to a reference genome. This lack of haplotype-level analyses can be explained by a lack of methods that can produce dense and accurate chromosome-length haplotypes at reasonable costs. Here we introduce an integrative phasing strategy that combines global, but sparse haplotypes obtained from strand-specific single-cell sequencing (Strand-seq) with dense, yet local, haplotype information available through long-read or linked-read sequencing. We provide comprehensive guidance on the required sequencing depths and reliably assign more than 95% of alleles (NA12878) to their parental haplotypes using as few as 10 Strand-seq libraries in combination with 10-fold coverage PacBio data or, alternatively, 10X Genomics linked-read sequencing data. We conclude that the combination of Strand-seq with different technologies represents an attractive solution to chart the genetic variation of diploid genomes.


July 7, 2019  |  

Variant review with the Integrative Genomics Viewer.

Manual review of aligned reads for confirmation and interpretation of variant calls is an important step in many variant calling pipelines for next-generation sequencing (NGS) data. Visual inspection can greatly increase the confidence in calls, reduce the risk of false positives, and help characterize complex events. The Integrative Genomics Viewer (IGV) was one of the first tools to provide NGS data visualization, and it currently provides a rich set of tools for inspection, validation, and interpretation of NGS datasets, as well as other types of genomic data. Here, we present a short overview of IGV’s variant review features for both single-nucleotide variants and structural variants, with examples from both cancer and germline datasets. IGV is freely available at https://www.igv.org Cancer Res; 77(21); e31-34. ©2017 AACR.©2017 American Association for Cancer Research.


July 7, 2019  |  

Tools for annotation and comparison of structural variation.

The impact of structural variants (SVs) on a variety of organisms and diseases like cancer has become increasingly evident. Methods for SV detection when studying genomic differences across cells, individuals or populations are being actively developed. Currently, just a few methods are available to compare different SVs callsets, and no specialized methods are available to annotate SVs that account for the unique characteristics of these variant types. Here, we introduce SURVIVOR_ant, a tool that compares types and breakpoints for candidate SVs from different callsets and enables fast comparison of SVs to genomic features such as genes and repetitive regions, as well as to previously established SV datasets such as from the 1000 Genomes Project. As proof of concept we compared 16 SV callsets generated by different SV calling methods on a single genome, the Genome in a Bottle sample HG002 (Ashkenazi son), and annotated the SVs with gene annotations, 1000 Genomes Project SV calls, and four different types of repetitive regions. Computation time to annotate 134,528 SVs with 33,954 of annotations was 22 seconds on a laptop.


July 7, 2019  |  

Hunting structural variants: Population by population

Until recently, most population-scale genome sequencing studies have focused on identifying single nucleotide variants (SNVs) to explore genetic differences between individuals. Like so many SNV-based genome-wide association studies, however, these efforts have had difficulty identifying causative genetic mechanisms underlying most complex functions. More and more, the genomics community has realised that structural variation is likely responsible for many of the traits and phenotypes that scientists have not been able to attribute to SNVs. This class of variants, defined as genetic differences of 50 bp or larger, accounts for most of the DNA sequence differences between any two people. Structural variants (SVs) are also already known to cause many common and rare diseases including ALS, schizophrenia, leukemia, Carney complex, and Huntington’s disease. Despite the importance of SVs, these larger variants have been understudied and underreported compared to their single-nucleotide counterparts. One reason is that they remain difficult to detect. Their length often means they cannot be fully spanned using short sequencing reads. They also often occur in highly repetitive or GC-rich regions of the genome, making them challenging targets. As such, this class of human genetic variation has remained vastly under-explored in global populations and is now ripe for discovery.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.