Menu
September 22, 2019

Genomic variation among and within six Juglans species.

Genomic analysis in Juglans (walnuts) is expected to transform the breeding and agricultural production of both nuts and lumber. To that end, we report here the determination of reference sequences for six additional relatives of Juglans regia: Juglans sigillata (also from section Dioscaryon), Juglans nigra, Juglans microcarpa, Juglans hindsii (from section Rhysocaryon), Juglans cathayensis (from section Cardiocaryon), and the closely related Pterocarya stenoptera While these are ‘draft’ genomes, ranging in size between 640Mbp and 990Mbp, their contiguities and accuracies can support powerful annotations of genomic variation that are often the foundation of new avenues of research and breeding. We annotated nucleotide divergence and synteny by creating complete pairwise alignments of each reference genome to the remaining six. In addition, we have re-sequenced a sample of accessions from four Juglans species (including regia). The variation discovered in these surveys comprises a critical resource for experimentation and breeding, as well as a solid complementary annotation. To demonstrate the potential of these resources the structural and sequence variation in and around the polyphenol oxidase loci, PPO1 and PPO2 were investigated. As reported for other seed crops variation in this gene is implicated in the domestication of walnuts. The apparently Juglandaceae specific PPO1 duplicate shows accelerated divergence and an excess of amino acid replacement on the lineage leading to accessions of the domesticated nut crop species, Juglans regia and sigillata. Copyright © 2018 Stevens et al.


September 22, 2019

Using XCAVATOR and EXCAVATOR2 to Identify CNVs from WGS, WES, and TS Data.

Copy Number Variants (CNVs) are structural rearrangements contributing to phenotypic variation but also associated with many disease states. In recent years, the identification of CNVs from high-throughput sequencing experiments has become a common practice for both research and clinical purposes. Several computational methods have been developed so far. In this unit, we describe and give instructions on how to run two read count-based tools, XCAVATOR and EXCAVATOR2, which are tailored for the detection of both germline and somatic CNVs from different sequencing experiments (whole-genome, whole-exome, and targeted) in various disease contexts and population genetic studies. © 2018 by John Wiley & Sons, Inc.© 2018 John Wiley & Sons, Inc.


September 22, 2019

Integrating long-range connectivity information into de Bruijn graphs.

The de Bruijn graph is a simple and efficient data structure that is used in many areas of sequence analysis including genome assembly, read error correction and variant calling. The data structure has a single parameter k, is straightforward to implement and is tractable for large genomes with high sequencing depth. It also enables representation of multiple samples simultaneously to facilitate comparison. However, unlike the string graph, a de Bruijn graph does not retain long range information that is inherent in the read data. For this reason, applications that rely on de Bruijn graphs can produce sub-optimal results given their input data.We present a novel assembly graph data structure: the Linked de Bruijn Graph (LdBG). Constructed by adding annotations on top of a de Bruijn graph, it stores long range connectivity information through the graph. We show that with error-free data it is possible to losslessly store and recover sequence from a Linked de Bruijn graph. With assembly simulations we demonstrate that the LdBG data structure outperforms both our de Bruijn graph and the String Graph Assembler (SGA). Finally we apply the LdBG to Klebsiella pneumoniae short read data to make large (12 kbp) variant calls, which we validate using PacBio sequencing data, and to characterize the genomic context of drug-resistance genes.Linked de Bruijn Graphs and associated algorithms are implemented as part of McCortex, which is available under the MIT license at https://github.com/mcveanlab/mccortex.Supplementary data are available at Bioinformatics online.


September 22, 2019

Tracing genomic divergence of Vibrio bacteria in the Harveyi clade.

The mechanism of bacterial speciation remains a topic of tremendous interest. To understand the ecological and evolutionary mechanisms of speciation in Vibrio bacteria, we analyzed the genomic dissimilarities between three closely related species in the so-called Harveyi clade of the genus Vibrio, V. campbellii, V. jasicida, and V. hyugaensis The analysis focused on strains isolated from diverse geographic locations over a long period of time. The results of phylogenetic analyses and calculations of average nucleotide identity (ANI) supported the classification of V. jasicida and V. hyugaensis into two species. These analyses also identified two well-supported clades in V. campbellii; however, strains from both clades were classified as members of the same species. Comparative analyses of the complete genome sequences of representative strains from the three species identified higher syntenic coverage between genomes of V. jasicida and V. hyugaensis than that between the genomes from the two V. campbellii clades. The results from comparative analyses of gene content between bacteria from the three species did not support the hypothesis that gene gain and/or loss contributed to their speciation. We also did not find support for the hypothesis that ecological diversification toward associations with marine animals contributed to the speciation of V. jasicida and V. hyugaensis Overall, based on the results obtained in this study, we propose that speciation in Harveyi clade species is a result of stochastic diversification of local populations, which was influenced by multiple evolutionary processes, followed by extinction events.IMPORTANCE To investigate the mechanisms underlying speciation in the genus Vibrio, we provided a well-assembled reference of genomes and performed systematic genomic comparisons among three evolutionarily closely related species. We resolved taxonomic ambiguities and identified genomic features separating the three species. Based on the study results, we propose a hypothesis explaining how species in the Harveyi clade of Vibrio bacteria diversified. Copyright © 2018 American Society for Microbiology.


September 22, 2019

Whole genome sequencing, de novo assembly and phenotypic profiling for the new budding yeast species Saccharomyces jurei.

Saccharomyces sensu stricto complex consist of yeast species, which are not only important in the fermentation industry but are also model systems for genomic and ecological analysis. Here, we present the complete genome assemblies of Saccharomyces jurei, a newly discovered Saccharomyces sensu stricto species from high altitude oaks. Phylogenetic and phenotypic analysis revealed that S. jurei is more closely related to S. mikatae, than S. cerevisiae, and S. paradoxus The karyotype of S. jurei presents two reciprocal chromosomal translocations between chromosome VI/VII and I/XIII when compared to the S. cerevisiae genome. Interestingly, while the rearrangement I/XIII is unique to S. jurei, the other is in common with S. mikatae strain IFO1815, suggesting shared evolutionary history of this species after the split between S. cerevisiae and S. mikatae The number of Ty elements differed in the new species, with a higher number of Ty elements present in S. jurei than in S. cerevisiae Phenotypically, the S. jurei strain NCYC 3962 has relatively higher fitness than the other strain NCYC 3947T under most of the environmental stress conditions tested and showed remarkably increased fitness in higher concentration of acetic acid compared to the other sensu stricto species. Both strains were found to be better adapted to lower temperatures compared to S. cerevisiae. Copyright © 2018 Naseeb et al.


September 22, 2019

Conservation genomics of the declining North American bumblebee Bombus terricola reveals inbreeding and selection on immune genes.

The yellow-banded bumblebee Bombus terricola was common in North America but has recently declined and is now on the IUCN Red List of threatened species. The causes of B. terricola’s decline are not well understood. Our objectives were to create a partial genome and then use this to estimate population data of conservation interest, and to determine whether genes showing signs of recent selection suggest a specific cause of decline. First, we generated a draft partial genome (contig set) for B. terricola, sequenced using Pacific Biosciences RS II at an average depth of 35×. Second, we sequenced the individual genomes of 22 bumblebee gynes from Ontario and Quebec using Illumina HiSeq 2500, each at an average depth of 20×, which were used to improve the PacBio genome calls and for population genetic analyses. The latter revealed that several samples had long runs of homozygosity, and individuals had high inbreeding coefficient F, consistent with low effective population size. Our data suggest that B. terricola’s effective population size has decreased orders of magnitude from pre-Holocene levels. We carried out tests of selection to identify genes that may have played a role in ameliorating environmental stressors underlying B. terricola’s decline. Several immune-related genes have signatures of recent positive selection, which is consistent with the pathogen-spillover hypothesis for B. terricola’s decline. The new B. terricola contig set can help solve the mystery of bumblebee decline by enabling functional genomics research to directly assess the health of pollinators and identify the stressors causing declines.


September 22, 2019

Whole-genome resequencing and pan-transcriptome reconstruction highlight the impact of genomic structural Variation on secondary metabolite gene clusters in the grapevine Esca pathogen Phaeoacremonium minimum.

The Ascomycete fungus Phaeoacremonium minimum is one of the primary causal agents of Esca, a widespread and damaging grapevine trunk disease. Variation in virulence among Pm. minimum isolates has been reported, but the underlying genetic basis of the phenotypic variability remains unknown. The goal of this study was to characterize intraspecific genetic diversity and explore its potential impact on virulence functions associated with secondary metabolism, cellular transport, and cell wall decomposition. We generated a chromosome-scale genome assembly, using single molecule real-time sequencing, and resequenced the genomes and transcriptomes of multiple isolates to identify sequence and structural polymorphisms. Numerous insertion and deletion events were found for a total of about 1 Mbp in each isolate. Structural variation in this extremely gene dense genome frequently caused presence/absence polymorphisms of multiple adjacent genes, mostly belonging to biosynthetic clusters associated with secondary metabolism. Because of the observed intraspecific diversity in gene content due to structural variation we concluded that a transcriptome reference developed from a single isolate is insufficient to represent the virulence factor repertoire of the species. We therefore compiled a pan-transcriptome reference of Pm. minimum comprising a non-redundant set of 15,245 protein-coding sequences. Using naturally infected field samples expressing Esca symptoms, we demonstrated that mapping of meta-transcriptomics data on a multi-species reference that included the Pm. minimum pan-transcriptome allows the profiling of an expanded set of virulence factors, including variable genes associated with secondary metabolism and cellular transport.


September 22, 2019

Evolution of the U.S. biological select agent Rathayibacter toxicus.

Rathayibacter toxicus is a species of Gram-positive, corynetoxin-producing bacteria that causes annual ryegrass toxicity, a disease often fatal to grazing animals. A phylogenomic approach was employed to model the evolution of R. toxicus to explain the low genetic diversity observed among isolates collected during a 30-year period of sampling in three regions of Australia, gain insight into the taxonomy of Rathayibacter, and provide a framework for studying these bacteria. Analyses of a data set of more than 100 sequenced Rathayibacter genomes indicated that Rathayibacter forms nine species-level groups. R. toxicus is the most genetically distant, and evidence suggested that this species experienced a dramatic event in its evolution. Its genome is significantly reduced in size but is colinear to those of sister species. Moreover, R. toxicus has low intergroup genomic diversity and almost no intragroup genomic diversity between ecologically separated isolates. R. toxicus is the only species of the genus that encodes a clustered regularly interspaced short palindromic repeat (CRISPR) locus and that is known to host a bacteriophage parasite. The spacers, which represent a chronological history of infections, were characterized for information on past events. We propose a three-stage process that emphasizes the importance of the bacteriophage and CRISPR in the genome reduction and low genetic diversity of the R. toxicus species.IMPORTANCERathayibacter toxicus is a toxin-producing species found in Australia and is often fatal to grazing animals. The threat of introduction of the species into the United States led to its inclusion in the Federal Select Agent Program, which makes R. toxicus a highly regulated species. This work provides novel insights into the evolution of R. toxicusR. toxicus is the only species in the genus to have acquired a CRISPR adaptive immune system to protect against bacteriophages. Results suggest that coexistence with the bacteriophage NCPPB3778 led to the massive shrinkage of the R. toxicus genome, species divergence, and the maintenance of low genetic diversity in extant bacterial groups. This work contributes to an understanding of the evolution and ecology of an agriculturally important species of bacteria. Copyright © 2018 Davis et al.


September 22, 2019

Population genomics of Culiseta melanura, the principal vector of Eastern equine encephalitis virus in the United States.

Eastern Equine Encephalitis (EEE) (Togaviridae, Alphavirus) is a highly pathogenic mosquito-borne arbovirus that circulates in an enzootic cycle involving Culiseta melanura mosquitoes and wild Passeriformes birds in freshwater swamp habitats. Recently, the northeastern United States has experienced an intensification of virus activity with increased human involvement and northward expansion into new regions. In addition to its principal role in enzootic transmission of EEE virus among avian hosts, recent studies on the blood-feeding behavior of Cs. melanura throughout its geographic range suggest that this mosquito may also be involved in epizootic / epidemic transmission to equines and humans in certain locales. Variations in blood feeding behavior may be a function of host availability, environmental factors, and/or underlying genetic differences among regional populations. Despite the importance of Cs. melanura in transmission and maintenance of EEE virus, the genetics of this species remains largely unexplored.To investigate the occurrence of genetic variation in Cs. melanura, the genome of this mosquito vector was sequenced resulting in a draft genome assembly of 1.28 gigabases with a contig N50 of 93.36 kilobases. Populations of Cs. melanura from 10 EEE virus foci in the eastern North America were genotyped with double-digest RAD-seq. Following alignment of reads to the reference genome, variant calling, and filtering, 40,384 SNPs were retained for downstream analyses. Subsequent analyses revealed genetic differentiation between northern and southern populations of this mosquito species. Moreover, limited fine-scale population structure was detected throughout northeastern North America, suggesting local differentiation of populations but also a history of ancestral polymorphism or contemporary gene flow. Additionally, a genetically distinct cluster was identified predominantly at two northern sites.This study elucidates the first evidence of fine-scale population structure in Cs. melanura throughout its eastern range and detects evidence of gene flow between populations in northeastern North America. This investigation provides the groundwork for examining the consequences of genetic variations in the populations of this mosquito species that could influence vector-host interactions and the risk of human and equine infection with EEE virus.


September 22, 2019

Comparison of highly and weakly virulent Dickeya solani strains, with a view on the pangenome and panregulon of this species.

Bacteria belonging to the genera Dickeya and Pectobacterium are responsible for significant economic losses in a wide variety of crops and ornamentals. During last years, increasing losses in potato production have been attributed to the appearance of Dickeya solani. The D. solani strains investigated so far share genetic homogeneity, although different virulence levels were observed among strains of various origins. The purpose of this study was to investigate the genetic traits possibly related to the diverse virulence levels by means of comparative genomics. First, we developed a new genome assembly pipeline which allowed us to complete the D. solani genomes. Four de novo sequenced and ten publicly available genomes were used to identify the structure of the D. solani pangenome, in which 74.8 and 25.2% of genes were grouped into the core and dispensable genome, respectively. For D. solani panregulon analysis, we performed a binding site prediction for four transcription factors, namely CRP, KdgR, PecS and Fur, to detect the regulons of these virulence regulators. Most of the D. solani potential virulence factors were predicted to belong to the accessory regulons of CRP, KdgR, and PecS. Thus, some differences in gene expression could exist between D. solani strains. The comparison between a highly and a low virulent strain, IFB0099 and IFB0223, respectively, disclosed only small differences between their genomes but significant differences in the production of virulence factors like pectinases, cellulases and proteases, and in their mobility. The D. solani strains also diverge in the number and size of prophages present in their genomes. Another relevant difference is the disruption of the adhesin gene fhaB2 in the highly virulent strain. Strain IFB0223, which has a complete adhesin gene, is less mobile and less aggressive than IFB0099. This suggests that in this case, mobility rather than adherence is needed in order to trigger disease symptoms. This study highlights the utility of comparative genomics in predicting D. solani traits involved in the aggressiveness of this emerging plant pathogen.


September 22, 2019

Variant O89 O-antigen of E. coli is associated with group 1 capsule loci and multidrug resistance.

Bacterial surface polysaccharides play significant roles in fitness and virulence. In Gram-negative bacteria such as Escherichia coli, major surface polysaccharides are lipopolysaccharide (LPS) and capsule, representing O- and K-antigens, respectively. There are multiple combinations of O:K types, many of which are well-characterized and can be related to ecotype or pathotype. In this investigation, we have identified a novel O:K permutation resulting through a process of major genome reorganization in a clade of E. coli. A multidrug-resistant, extended-spectrum ß-lactamase (ESBL)-producing strain – E. coli 26561 – represented a prototype of strains combining a locus variant of O89 and group 1 capsular polysaccharide. Specifically, the variant O89 locus in this strain was truncated at gnd, flanked by insertion sequences and located between nfsB and ybdK and we apply the term O89m for this variant. The prototype lacked colanic acid and O-antigen loci between yegH and hisI with this tandem polysaccharide locus being replaced with a group 1 capsule (G1C) which, rather than being a recognized E. coli capsule type, this locus matched to Klebsiella K10 capsule type. A genomic survey identified more than 200 E. coli strains which possessed the O89m locus variant with one of a variety of G1C types. Isolates from our collection with the combination of O89m and G1C all displayed a mucoid phenotype and E. coli 26561 was unusual in exhibiting a mucoviscous phenotype more recognized as a characteristic among Klebsiella strains. Despite the locus truncation and novel location, all O89m:G1C strains examined showed a ladder pattern typifying smooth LPS and also showed high molecular weight, alcian blue-staining polysaccharide in cellular and/or extra-cellular fractions. Expression of both O-antigen and capsule biosynthesis loci were confirmed in prototype strain 26561 through quantitative proteome analysis. Further in silico exploration of more than 200 E. coli strains possessing the O89m:G1C combination identified a very high prevalence of multidrug resistance (MDR) – 85% possessed resistance to three or more antibiotic classes and a high proportion (58%) of these carried ESBL and/or carbapenemase. The increasing isolation of O89m:G1C isolates from extra-intestinal infection sites suggests that these represents an emergent clade of invasive, MDR E. coli.


September 22, 2019

Draft genome assembly of the invasive cane toad, Rhinella marina.

The cane toad (Rhinella marina formerly Bufo marinus) is a species native to Central and South America that has spread across many regions of the globe. Cane toads are known for their rapid adaptation and deleterious impacts on native fauna in invaded regions. However, despite an iconic status, there are major gaps in our understanding of cane toad genetics. The availability of a genome would help to close these gaps and accelerate cane toad research.We report a draft genome assembly for R. marina, the first of its kind for the Bufonidae family. We used a combination of long-read Pacific Biosciences RS II and short-read Illumina HiSeq X sequencing to generate 359.5 Gb of raw sequence data. The final hybrid assembly of 31,392 scaffolds was 2.55 Gb in length with a scaffold N50 of 168 kb. BUSCO analysis revealed that the assembly included full length or partial fragments of 90.6% of tetrapod universal single-copy orthologs (n = 3950), illustrating that the gene-containing regions have been well assembled. Annotation predicted 25,846 protein coding genes with similarity to known proteins in Swiss-Prot. Repeat sequences were estimated to account for 63.9% of the assembly.The R. marina draft genome assembly will be an invaluable resource that can be used to further probe the biology of this invasive species. Future analysis of the genome will provide insights into cane toad evolution and enrich our understanding of their interplay with the ecosystem at large.


September 22, 2019

Evolutionary history of human Plasmodium vivax revealed by genome-wide analyses of related ape parasites.

Wild-living African apes are endemically infected with parasites that are closely related to human Plasmodium vivax, a leading cause of malaria outside Africa. This finding suggests that the origin of P. vivax was in Africa, even though the parasite is now rare in humans there. To elucidate the emergence of human P. vivax and its relationship to the ape parasites, we analyzed genome sequence data of P. vivax strains infecting six chimpanzees and one gorilla from Cameroon, Gabon, and Côte d’Ivoire. We found that ape and human parasites share nearly identical core genomes, differing by only 2% of coding sequences. However, compared with the ape parasites, human strains of P. vivax exhibit about 10-fold less diversity and have a relative excess of nonsynonymous nucleotide polymorphisms, with site-frequency spectra suggesting they are subject to greatly relaxed purifying selection. These data suggest that human P. vivax has undergone an extreme bottleneck, followed by rapid population expansion. Investigating potential host-specificity determinants, we found that ape P. vivax parasites encode intact orthologs of three reticulocyte-binding protein genes (rbp2d, rbp2e, and rbp3), which are pseudogenes in all human P. vivax strains. However, binding studies of recombinant RBP2e and RBP3 proteins to human, chimpanzee, and gorilla erythrocytes revealed no evidence of host-specific barriers to red blood cell invasion. These data suggest that, from an ancient stock of P. vivax parasites capable of infecting both humans and apes, a severely bottlenecked lineage emerged out of Africa and underwent rapid population growth as it spread globally. Copyright © 2018 the Author(s). Published by PNAS.


September 22, 2019

Genomic approaches for studying crop evolution.

Understanding how crop plants evolved from their wild relatives and spread around the world can inform about the origins of agriculture. Here, we review how the rapid development of genomic resources and tools has made it possible to conduct genetic mapping and population genetic studies to unravel the molecular underpinnings of domestication and crop evolution in diverse crop species. We propose three future avenues for the study of crop evolution: establishment of high-quality reference genomes for crops and their wild relatives; genomic characterization of germplasm collections; and the adoption of novel methodologies such as archaeogenetics, epigenomics, and genome editing.


September 22, 2019

Repeated inversions within a pannier intron drive diversification of intraspecific colour patterns of ladybird beetles.

How genetic information is modified to generate phenotypic variation within a species is one of the central questions in evolutionary biology. Here we focus on the striking intraspecific diversity of >200 aposematic elytral (forewing) colour patterns of the multicoloured Asian ladybird beetle, Harmonia axyridis, which is regulated by a tightly linked genetic locus h. Our loss-of-function analyses, genetic association studies, de novo genome assemblies, and gene expression data reveal that the GATA transcription factor gene pannier is the major regulatory gene located at the h locus, and suggest that repeated inversions and cis-regulatory modifications at pannier led to the expansion of colour pattern variation in H. axyridis. Moreover, we show that the colour-patterning function of pannier is conserved in the seven-spotted ladybird beetle, Coccinella septempunctata, suggesting that H. axyridis’ extraordinary intraspecific variation may have arisen from ancient modifications in conserved elytral colour-patterning mechanisms in ladybird beetles.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.