Menu
July 7, 2019

Genomic resources and their influence on the detection of the signal of positive selection in genome scans.

Genome scans represent powerful approaches to investigate the action of natural selection on the genetic variation of natural populations and to better understand local adaptation. This is very useful, for example, in the field of conservation biology and evolutionary biology. Thanks to Next Generation Sequencing, genomic resources are growing exponentially, improving genome scan analyses in non-model species. Thousands of SNPs called using Reduced Representation Sequencing are increasingly used in genome scans. Besides, genome sequences are also becoming increasingly available, allowing better processing of short-read data, offering physical localization of variants, and improving haplotype reconstruction and data imputation. Ultimately, genome sequences are also becoming the raw material for selection inferences. Here, we discuss how the increasing availability of such genomic resources, notably genome sequences, influences the detection of signals of selection. Mainly, increasing data density and having the information of physical linkage data expand genome scans by (i) improving the overall quality of the data, (ii) helping the reconstruction of demographic history for the population studied to decrease false-positive rates and (iii) improving the statistical power of methods to detect the signal of selection. Of particular importance, the availability of a high-quality reference genome can improve the detection of the signal of selection by (i) allowing matching the potential candidate loci to linked coding regions under selection, (ii) rapidly moving the investigation to the gene and function and (iii) ensuring that the highly variable regions of the genomes that include functional genes are also investigated. For all those reasons, using reference genomes in genome scan analyses is highly recommended. © 2015 John Wiley & Sons Ltd.


July 7, 2019

Assembly and characterization of the MHC class I region of the Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis).

The Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis; YFP) is the sole freshwater subspecies of N. asiaeorientalis and is now critically endangered. Major histocompatibility complex (MHC) is a family of highly polymorphic genes that play an important immunological role in antigen presentation in the vertebrates. Currently, however, little is known about MHC region in the genome of the YFP, which hampers conservation genetics and evolutionary ecology study using MHC genes. In this work, a nucleotide sequence of 774,811 bp covering the YFP MHC class I region was obtained by screening a YFP bacterial artificial chromosome (BAC) library, followed by sequencing and assembly of positive BAC clones. A total of 45 genes were successfully annotated, of which four were MHC class I genes. There are high similarities among the four YFP MHC class I genes (>94 %). Divergence in the coding region of the four YFP MHC class I genes is mainly localized to exons 2 and 3, which encode the antigen-binding sites of MHC class I genes. Additionally, comparison of the MHC structure in YFP to those of cattle, sheep, and pig showed that MHC class I genes are located in genome regions with regard to the conserved genes, and the YFP contains the fewest MHC class I genes among these species. This is the first report characterizing a cetacean MHC class I region and describing its organization, which would be valuable for further investigation of adaptation in natural populations of the YFP and other cetaceans.


July 7, 2019

Analysis of hepatitis C NS5A resistance associated polymorphisms using ultra deep single molecule real time (SMRT) sequencing.

Development of Hepatitis C virus (HCV) resistance against direct-acting antivirals (DAAs), including NS5A inhibitors, is an obstacle to successful treatment of HCV when DAAs are used in sub-optimal combinations. Furthermore, it has been shown that baseline (pre-existing) resistance against DAAs is present in treatment naïve-patients and this will potentially complicate future treatment strategies in different HCV genotypes (GTs). Thus the aim was to detect low levels of NS5A resistant associated variants (RAVs) in a limited sample set of treatment-naïve patients of HCV GT1a and 3a, since such polymorphisms can display in vitro resistance as high as 60000 fold. Ultra-deep single molecule real time (SMRT) sequencing with the Pacific Biosciences (PacBio) RSII instrument was used to detect these RAVs. The SMRT sequencing was conducted on ten samples; three of them positive with Sanger sequencing (GT1a Q30H and Y93N, and GT3a Y93H), five GT1a samples, and two GT3a non-positive samples. The same methods were applied to the HCV GT1a H77-plasmid in a dilution series, in order to determine the error rates of replication, which in turn was used to determine the limit of detection (LOD), as defined by mean + 3SD, of minority variants down to 0.24%. We found important baseline NS5A RAVs at levels between 0.24 and 0.5%, which could potentially have clinical relevance. This new method with low level detection of baseline RAVs could be useful in predicting the most cost-efficient combination of DAA treatment, and reduce the treatment duration for an HCV infected individual. Copyright © 2015 Elsevier B.V. All rights reserved.


July 7, 2019

Refinement of the canine CD1 locus topology and investigation of antibody binding to recombinant canine CD1 isoforms.

CD1 molecules are antigen-presenting glycoproteins primarily found on dendritic cells (DCs) responsible for lipid antigen presentation to CD1-restricted T cells. Despite their pivotal role in immunity, little is known about CD1 protein expression in dogs, notably due to lack of isoform-specific antibodies. The canine (Canis familiaris) CD1 locus was previously found to contain three functional CD1A genes: canCD1A2, canCD1A6, and canCD1A8, where two variants of canCD1A8, canCD1A8.1 and canCD1A8.2, were assumed to be allelic variants. However, we hypothesized that these rather represented two separate genes. Sequencing of three overlapping bacterial artificial chromosomes (BACs) spanning the entire canine CD1 locus revealed canCD1A8.2 and canCD1A8.1 to be located in tandem between canCD1A7 and canCD1C, and canCD1A8.1 was consequently renamed canCD1A9. Green fluorescent protein (GFP)-fused canine CD1 transcripts were recombinantly expressed in 293T cells. All proteins showed a highly positive GFP expression except for canine CD1d and a splice variant of canine CD1a8 lacking exon 3. Probing with a panel of anti-CD1 monoclonal antibodies (mAbs) showed that Ca13.9H11 and Ca9.AG5 only recognized canine CD1a8 and CD1a9 isoforms, and Fe1.5F4 mAb solely recognized canine CD1a6. Anti-CD1b mAbs recognized the canine CD1b protein, but also bound CD1a2, CD1a8, and CD1a9. Interestingly, Ca9.AG5 showed allele specificity based on a single nucleotide polymorphism (SNP) located at position 321. Our findings have refined the structure of the canine CD1 locus and available antibody specificity against canine CD1 proteins. These are important fundamentals for future investigation of the role of canine CD1 in lipid immunity.


July 7, 2019

Multiple and diverse vsp and vlp sequences in Borrelia miyamotoi, a hard tick-borne zoonotic pathogen.

Based on chromosome sequences, the human pathogen Borrelia miyamotoi phylogenetically clusters with species that cause relapsing fever. But atypically for relapsing fever agents, B. miyamotoi is transmitted not by soft ticks but by hard ticks, which also are vectors of Lyme disease Borrelia species. To further assess the relationships of B. miyamotoi to species that cause relapsing fever, I investigated extrachromosomal sequences of a North American strain with specific attention on plasmid-borne vsp and vlp genes, which are the underpinnings of antigenic variation during relapsing fever. For a hybrid approach to achieve assemblies that spanned more than one of the paralogous vsp and vlp genes, a database of short-reads from next-generation sequencing was supplemented with long-reads obtained with real-time DNA sequencing from single polymerase molecules. This yielded three contigs of 31, 16, and 11 kb, which each contained multiple and diverse sequences that were homologous to vsp and vlp genes of the relapsing fever agent B. hermsii. Two plasmid fragments had coding sequences for plasmid partition proteins that differed from each other from paralogous proteins for the megaplasmid and a small plasmid of B. miyamotoi. One of 4 vsp genes, vsp1, was present at two loci, one of which was downstream of a candiate prokaryotic promoter. A limited RNA-seq analysis of a population growing in the blood of mice indicated that of the 4 different vsp genes vsp1 was the one that was expressed. The findings indicate that B. miyamotoi has at least four types of plasmids, two or more of which bear vsp and vlp gene sequences that are as numerous and diverse as those of relapsing fever Borrelia. The database and insights from these findings provide a foundation for further investigations of the immune responses to this pathogen and of the capability of B. miyamotoi for antigenic variation.


July 7, 2019

Colistin-Nonsusceptible Pseudomonas aeruginosa Sequence Type 654 with blaNDM-1 Arrives in North America.

This study describes 3 different blaNDM-1 genetic platforms in 3 different species obtained from the same patient who was directly transferred to an institution in Calgary, Alberta, Canada, following a prolonged hospital stay in India. The blaNDM-1 in the Escherichia coli isolate was located on a 176-kb IncA/C plasmid contained within an ISCR1 region. The blaNDM-1 in the Providencia rettgeri isolate was located on a 117-kb IncT plasmid contained within Tn3000, while the blaNDM-1 in the Pseudomonas aeruginosa isolate was located on the chromosome within an ISCR3 region. This report highlights the plasticity of the genetic regions and environments associated with blaNDM-1. To the best of our knowledge, this is the first report of P. aeruginosa with blaNDM-1 identified in North America and the first report of blaOXA-181 in P. rettgeri. The P. aeruginosa isolate belonged to the international high-risk sequence type 654 clone and was nonsusceptible to colistin. This case emphasizes the need for the use of appropriate infection prevention and control measures and vigilant screening for carbapenem-resistant Gram-negative bacteria in patients with a history of travel to areas of endemicity, such as the Indian subcontinent. Copyright © 2016, American Society for Microbiology. All Rights Reserved.


July 7, 2019

Rapid emergence and evolution of Staphylococcus aureus clones harbouring fusC-containing Staphylococcal cassette chromosome elements.

The prevalence of fusidic acid (FA) resistance amongst Staphylococcus aureus in New Zealand (NZ) is amongst the highest reported globally, with a recent study describing a resistance rate of approximately 28%. Three FA-resistant S. aureus clones (ST5 MRSA, ST1 MSSA and ST1 MRSA) have emerged over the past decade and now predominate in NZ, and in all three clones FA resistance is mediated by the fusC gene. In particular, ST5 MRSA has rapidly become the dominant MRSA clone in NZ, although the origin of FA-resistant ST5 MRSA has not been explored, and the genetic context of fusC in FA-resistant NZ isolates is unknown. To better understand the rapid emergence of FA-resistant S. aureus, we used population-based comparative genomics to characterise a collection of FA-resistant and FA-susceptible isolates from NZ. FA-resistant NZ ST5 MRSA displayed minimal genetic diversity, and represented a phylogenetically distinct clade within a global population model of clonal complex 5 (CC5) S. aureus. In all lineages, fusC was invariably located within staphylococcal cassette chromosome (SCC) elements, suggesting that SCC-mediated horizontal transfer is the primary mechanism of fusC dissemination. The genotypic association of fusC with mecA has important implications for the emergence of MRSA clones in populations with high usage of fusidic acid. In addition, we found that fusC was co-located with a recently described virulence factor (tirS) in dominant NZ S. aureus clones, suggesting a potential fitness advantage. This study points to the likely molecular mechanisms responsible for the successful emergence and spread of FA-resistant S. aureus. Copyright © 2016 Baines et al.


July 7, 2019

Genome analysis of three Pneumocystis species reveals adaptation mechanisms to life exclusively in mammalian hosts.

Pneumocystis jirovecii is a major cause of life-threatening pneumonia in immunosuppressed patients including transplant recipients and those with HIV/AIDS, yet surprisingly little is known about the biology of this fungal pathogen. Here we report near complete genome assemblies for three Pneumocystis species that infect humans, rats and mice. Pneumocystis genomes are highly compact relative to other fungi, with substantial reductions of ribosomal RNA genes, transporters, transcription factors and many metabolic pathways, but contain expansions of surface proteins, especially a unique and complex surface glycoprotein superfamily, as well as proteases and RNA processing proteins. Unexpectedly, the key fungal cell wall components chitin and outer chain N-mannans are absent, based on genome content and experimental validation. Our findings suggest that Pneumocystis has developed unique mechanisms of adaptation to life exclusively in mammalian hosts, including dependence on the lungs for gas and nutrients and highly efficient strategies to escape both host innate and acquired immune defenses.


July 7, 2019

Insights into adaptations to a near-obligate nematode endoparasitic lifestyle from the finished genome of Drechmeria coniospora.

Nematophagous fungi employ three distinct predatory strategies: nematode trapping, parasitism of females and eggs, and endoparasitism. While endoparasites play key roles in controlling nematode populations in nature, their application for integrated pest management is hindered by the limited understanding of their biology. We present a comparative analysis of a high quality finished genome assembly of Drechmeria coniospora, a model endoparasitic nematophagous fungus, integrated with a transcriptomic study. Adaptation of D. coniospora to its almost completely obligate endoparasitic lifestyle led to the simplification of many orthologous gene families involved in the saprophytic trophic mode, while maintaining orthologs of most known fungal pathogen-host interaction proteins, stress response circuits and putative effectors of the small secreted protein type. The need to adhere to and penetrate the host cuticle led to a selective radiation of surface proteins and hydrolytic enzymes. Although the endoparasite has a simplified secondary metabolome, it produces a novel peptaibiotic family that shows antibacterial, antifungal and nematicidal activities. Our analyses emphasize the basic malleability of the D. coniospora genome: loss of genes advantageous for the saprophytic lifestyle; modulation of elements that its cohort species utilize for entomopathogenesis; and expansion of protein families necessary for the nematode endoparasitic lifestyle.


July 7, 2019

Global phylogeography and evolutionary history of Shigella dysenteriae type 1

Together with plague, smallpox and typhus, epidemics of dysentery have been a major scourge of human populations for centuries1. A previous genomic study concluded that Shigella dysenteriae type 1 (Sd1), the epidemic dysentery bacillus, emerged and spread worldwide after the First World War, with no clear pattern of transmission2. This is not consistent with the massive cyclic dysentery epidemics reported in Europe during the eighteenth and nineteenth centuries1,3,4 and the first isolation of Sd1 in Japan in 18975. Here, we report a whole-genome analysis of 331 Sd1 isolates from around the world, collected between 1915 and 2011, providing us with unprecedented insight into the historical spread of this pathogen. We show here that Sd1 has existed since at least the eighteenth century and that it swept the globe at the end of the nineteenth century, diversifying into distinct lineages associated with the First World War, Second World War and various conflicts or natural disasters across Africa, Asia and Central America. We also provide a unique historical perspective on the evolution of antibiotic resistance over a 100-year period, beginning decades before the antibiotic era, and identify a prevalent multiple antibiotic-resistant lineage in South Asia that was transmitted in several waves to Africa, where it caused severe outbreaks of disease.


July 7, 2019

Horizontal gene acquisitions, mobile element proliferation, and genome decay in the host-restricted plant pathogen Erwinia tracheiphila.

Modern industrial agriculture depends on high-density cultivation of genetically similar crop plants, creating favorable conditions for the emergence of novel pathogens with increased fitness in managed compared with ecologically intact settings. Here, we present the genome sequence of six strains of the cucurbit bacterial wilt pathogen Erwinia tracheiphila (Enterobacteriaceae) isolated from infected squash plants in New York, Pennsylvania, Kentucky, and Michigan. These genomes exhibit a high proportion of recent horizontal gene acquisitions, invasion and remarkable amplification of mobile genetic elements, and pseudogenization of approximately 20% of the coding sequences. These genome attributes indicate that E. tracheiphila recently emerged as a host-restricted pathogen. Furthermore, chromosomal rearrangements associated with phage and transposable element proliferation contribute to substantial differences in gene content and genetic architecture between the six E. tracheiphila strains and other Erwinia species. Together, these data lead us to hypothesize that E. tracheiphila has undergone recent evolution through both genome decay (pseudogenization) and genome expansion (horizontal gene transfer and mobile element amplification). Despite evidence of dramatic genomic changes, the six strains are genetically monomorphic, suggesting a recent population bottleneck and emergence into E. tracheiphila’s current ecological niche. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Campylobacter fetus subspecies contain conserved type IV secretion systems on multiple genomic islands and plasmids.

The features contributing to differences in pathogenicity of the Campylobacter fetus subspecies are unknown. Putative factors involved in pathogenesis are located in genomic islands that encode a type IV secretion system (T4SS) and fic domain (filamentation induced by cyclic AMP) proteins, which may disrupt host cell processes. In the genomes of 27 C. fetus strains, three phylogenetically-different T4SS-encoding regions (T4SSs) were identified: one was located in both the chromosome and in extra-chromosomal plasmids; one was located exclusively in the chromosome; and one exclusively in extra-chromosomal plasmids. We observed that C. fetus strains can contain multiple T4SSs and that homologous T4SSs can be present both in chromosomal genomic islands (GI) and on plasmids in the C. fetus strains. The GIs of the chromosomally located T4SS differed mainly by the presence of fic genes, insertion sequence elements and phage-related or hypothetical proteins. Comparative analysis showed that T4SS sequences, inserted in the same locations, were conserved in the studied C. fetus genomes. Using phylogenetic analysis of the T4SSs, it was shown that C. fetus may have acquired the T4SS regions from other Campylobacter species by horizontal gene transfer. The identified T4SSs and fic genes were found in Cff and Cfv strains, although the presence of T4SSs and fic genes were significantly associated with Cfv strains. The T4SSs and fic genes could not be associated with S-layer serotypes or geographical origin of the strains.


July 7, 2019

Comparative genomic analyses of the Moraxella catarrhalis serosensitive and seroresistant lineages demonstrate their independent evolution.

The bacterial species Moraxella catarrhalishas been hypothesized as being composed of two distinct lineages (referred to as the seroresistant [SR] and serosensitive [SS]) with separate evolutionary histories based on several molecular typing methods, whereas 16S ribotyping has suggested an additional split within the SS lineage. Previously, we characterized whole-genome sequences of 12 SR-lineage isolates, which revealed a relatively small supragenome when compared with other opportunistic nasopharyngeal pathogens, suggestive of a relatively short evolutionary history. Here, we performed whole-genome sequencing on 18 strains from both ribotypes of the SS lineage, an additional SR strain, as well as four previously identified highly divergent strains based on multilocus sequence typing analyses. All 35 strains were subjected to a battery of comparative genomic analyses which clearly show that there are three lineages-the SR, SS, and the divergent. The SR and SS lineages are closely related, but distinct from each other based on three different methods of comparison: Allelic differences observed among core genes; possession of lineage-specific sets of core and distributed genes; and by an alignment of concatenated core sequences irrespective of gene annotation. All these methods show that the SS lineage has much longer interstrain branches than the SR lineage indicating that this lineage has likely been evolving either longer or faster than the SR lineage. There is evidence of extensive horizontal gene transfer (HGT) within both of these lineages, and to a lesser degree between them. In particular, we identified very high rates of HGT between these two lineages for ß-lactamase genes. The four divergent strains aresui generis, being much more distantly related to both the SR and SS groups than these other two groups are to each other. Based on average nucleotide identities, gene content, GC content, and genome size, this group could be considered as a separate taxonomic group. The SR and SS lineages, although distinct, clearly form a single species based on multiple criteria including a large common core genome, average nucleotide identity values, GC content, and genome size. Although neither of these lineages arose from within the other based on phylogenetic analyses, the question of how and when these lineages split and then subsequently reunited in the human nasopharynx is explored. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Stability of the encoding plasmids and surface expression of CS6 differs in enterotoxigenic Escherichia coli (ETEC) encoding Different heat-stable (ST) enterotoxins (STh and STp).

Enterotoxigenic Escherichia coli (ETEC), one of the most common reasons of diarrhea among infants and children in developing countries, causes disease by expression of either or both of the enterotoxins heat-labile (LT) and heat-stable (ST; divided into human-type [STh] and porcine-type [STp] variants), and colonization factors (CFs) among which CS6 is one of the most prevalent ETEC CFs. In this study we show that ETEC isolates expressing CS6+STh have higher copy numbers of the cssABCD operon encoding CS6 than those expressing CS6+STp. Long term cultivation of up to ten over-night passages of ETEC isolates harboring CS6+STh (n = 10) or CS6+STp (n = 15) showed instability of phenotypic expression of CS6 in a majority of the CS6+STp isolates, whereas most of the CS6+STh isolates retained CS6 expression. The observed instability was a correlated with loss of genes cssA and cssD as examined by PCR. Mobilization of the CS6 plasmid from an unstable CS6+STp isolate into a laboratory E. coli strain resulted in loss of the plasmid after a single over-night passage whereas the plasmid from an CS6+STh strain was retained in the laboratory strain during 10 passages. A sequence comparison between the CS6 plasmids from a stable and an unstable ETEC isolate revealed that genes necessary for plasmid stabilization, for example pemI, pemK, stbA, stbB and parM, were not present in the unstable ETEC isolate. Our results indicate that stable retention of CS6 may in part be affected by the stability of the plasmid on which both CS6 and STp or STh are located.


July 7, 2019

Complete genome sequence of Enterococcus faecium commensal isolate E1002.

The emergence of vancomycin-resistant enterococci (VRE) has been associated with an increase in multidrug-resistant nosocomial infections. Here, we report the 2.614-Mb genome sequence of the Enterococcus faecium commensal isolate E1002, which will be instrumental in further understanding the determinants of the commensal and pathogenic lifestyle of E. faecium. Copyright © 2016 Tytgat et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.