Menu
September 22, 2019

Improved full-length killer cell immunoglobulin-like receptor transcript discovery in Mauritian cynomolgus macaques.

Killer cell immunoglobulin-like receptors (KIRs) modulate disease progression of pathogens including HIV, malaria, and hepatitis C. Cynomolgus and rhesus macaques are widely used as nonhuman primate models to study human pathogens, and so, considerable effort has been put into characterizing their KIR genetics. However, previous studies have relied on cDNA cloning and Sanger sequencing that lack the throughput of current sequencing platforms. In this study, we present a high throughput, full-length allele discovery method utilizing Pacific Biosciences circular consensus sequencing (CCS). We also describe a new approach to Macaque Exome Sequencing (MES) and the development of the Rhexome1.0, an adapted target capture reagent that includes macaque-specific capture probe sets. By using sequence reads generated by whole genome sequencing (WGS) and MES to inform primer design, we were able to increase the sensitivity of KIR allele discovery. We demonstrate this increased sensitivity by defining nine novel alleles within a cohort of Mauritian cynomolgus macaques (MCM), a geographically isolated population with restricted KIR genetics that was thought to be completely characterized. Finally, we describe an approach to genotyping KIRs directly from sequence reads generated using WGS/MES reads. The findings presented here expand our understanding of KIR genetics in MCM by associating new genes with all eight KIR haplotypes and demonstrating the existence of at least one KIR3DS gene associated with every haplotype.


September 22, 2019

Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing.

Divergence of alternative splicing represents one of the major driving forces to shape phenotypic diversity during evolution. However, the extent to which these divergences could be explained by the evolving cis-regulatory versus trans-acting factors remains unresolved. To globally investigate the relative contributions of the two factors for the first time in mammals, we measured splicing difference between C57BL/6J and SPRET/EiJ mouse strains and allele-specific splicing pattern in their F1 hybrid. Out of 11,818 alternative splicing events expressed in the cultured fibroblast cells, we identified 796 with significant difference between the parental strains. After integrating allele-specific data from F1 hybrid, we demonstrated that these events could be predominately attributed to cis-regulatory variants, including those residing at and beyond canonical splicing sites. Contrary to previous observations in Drosophila, such predominant contribution was consistently observed across different types of alternative splicing. Further analysis of liver tissues from the same mouse strains and reanalysis of published datasets on other strains showed similar trends, implying in general the predominant contribution of cis-regulatory changes in the evolution of mouse alternative splicing. © 2015 The Authors. Published under the terms of the CC BY 4.0 license.


September 22, 2019

Divergent brain gene expression profiles between alternative behavioural helper types in a cooperative breeder.

Juveniles of the cooperatively breeding cichlid fish Neolamprologus pulcher either consistently provide help in form of alloparental egg care (“cleaners”) or consistently abstain from helping (“noncleaners”). These phenotypes are not based on heritable genetic differences. Instead, they arise during ontogeny, which should lead to differences in brain structure or physiology, a currently untested prediction. We compared brain gene expression profiles of cleaners and noncleaners in two experimental conditions, a helping opportunity and a control condition. We aimed to identify (a) expression differences between cleaners and noncleaners in the control, (b) changes in gene expression induced by the opportunity and (c) differences in plasticity of gene expression between cleaners and noncleaners. Control cleaners and noncleaners differed in the expression of a single gene, irx2, which regulates neural differentiation. During the opportunity, cleaners and noncleaners had three upregulated genes in common, which were implicated in neuroplasticity, hormonal signalling and cell proliferation. Thus, the stimulus in the opportunity was sufficiently salient. Cleaners also showed higher expression of seven additional genes that were unique to the opportunity. One of these cleaner-specific genes is implicated in neuropeptide metabolism, indicating that this process is associated with cleaning performance. This suggests that the two types employed different pathways to integrate social information, preparing them for accelerated reaction to future opportunities. Interestingly, three developmental genes were downregulated between the control and the opportunity in cleaners only. Our results indicate that the two behavioural types responded differently to the helping opportunity and that only cleaners responded by downregulating developmental genes.© 2018 John Wiley & Sons Ltd.


September 22, 2019

Single molecule, full-length transcript sequencing provides insight into the extreme metabolism of ruby-throated hummingbird Archilochus colubris

Hummingbirds oxidize ingested nectar sugars directly to fuel foraging but cannot sustain this fuel use during fasting periods, such as during the night or during long-distance migratory flights. Instead, fasting hummingbirds switch to oxidizing stored lipids, derived from ingested sugars. The hummingbird liver plays a key role in moderating energy homeostasis and this remarkable capacity for fuel switching. Additionally, liver is the principle location of de novo lipogenesis, which can occur at exceptionally high rates, such as during premigratory fattening. Yet understanding how this tissue and whole organism moderates energy turnover is hampered by a lack of information regarding how relevant enzymes differ in sequence, expression, and regulation. We generated a de novo transcriptome of the hummingbird liver using PacBio full-length cDNA sequencing (Iso-Seq), yielding a total of 8.6Gb of sequencing data, or 2.6M reads from 4 different size fractions. We analyzed data using the SMRTAnalysis v3.1 Iso-Seq pipeline, then clustered isoforms into gene families to generate de novo gene contigs using Cogent. We performed orthology analysis to identify closely related sequences between our transcriptome and other avian and human gene sets. Finally, we closely examined homology of critical lipid metabolism genes between our transcriptome data and avian and human genomes. We confirmed high levels of sequence divergence within hummingbird lipogenic enzymes, suggesting a high probability of adaptive divergent function in the hepatic lipogenic pathways. Our results leverage cutting-edge technology and a novel bioinformatics pipeline to provide a first direct look at the transcriptome of this incredible organism.


September 22, 2019

Isoform evolution in primates through independent combination of alternative RNA processing events.

Recent RNA-seq technology revealed thousands of splicing events that are under rapid evolution in primates, whereas the reliability of these events, as well as their combination on the isoform level, have not been adequately addressed due to its limited sequencing length. Here, we performed comparative transcriptome analyses in human and rhesus macaque cerebellum using single molecule long-read sequencing (Iso-seq) and matched RNA-seq. Besides 359 million RNA-seq reads, 4,165,527 Iso-seq reads were generated with a mean length of 14,875?bp, covering 11,466 human genes, and 10,159 macaque genes. With Iso-seq data, we substantially expanded the repertoire of alternative RNA processing events in primates, and found that intron retention and alternative polyadenylation are surprisingly more prevalent in primates than previously estimated. We then investigated the combinatorial mode of these alternative events at the whole-transcript level, and found that the combination of these events is largely independent along the transcript, leading to thousands of novel isoforms missed by current annotations. Notably, these novel isoforms are selectively constrained in general, and 1,119 isoforms have even higher expression than the previously annotated major isoforms in human, indicating that the complexity of the human transcriptome is still significantly underestimated. Comparative transcriptome analysis further revealed 502 genes encoding selectively constrained, lineage-specific isoforms in human but not in rhesus macaque, linking them to some lineage-specific functions. Overall, we propose that the independent combination of alternative RNA processing events has contributed to complex isoform evolution in primates, which provides a new foundation for the study of phenotypic difference among primates.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


September 22, 2019

Genome and secretome analysis of Pochonia chlamydosporia provide new insight into egg-parasitic mechanisms.

Pochonia chlamydosporia infects eggs and females of economically important plant-parasitic nematodes. The fungal isolates parasitizing different nematodes are genetically distinct. To understand their intraspecific genetic differentiation, parasitic mechanisms, and adaptive evolution, we assembled seven putative chromosomes of P. chlamydosporia strain 170 isolated from root-knot nematode eggs (~44?Mb, including 7.19% of transposable elements) and compared them with the genome of the strain 123 (~41?Mb) isolated from cereal cyst nematode. We focus on secretomes of the fungus, which play important roles in pathogenicity and fungus-host/environment interactions, and identified 1,750 secreted proteins, with a high proportion of carboxypeptidases, subtilisins, and chitinases. We analyzed the phylogenies of these genes and predicted new pathogenic molecules. By comparative transcriptome analysis, we found that secreted proteins involved in responses to nutrient stress are mainly comprised of proteases and glycoside hydrolases. Moreover, 32 secreted proteins undergoing positive selection and 71 duplicated gene pairs encoding secreted proteins are identified. Two duplicated pairs encoding secreted glycosyl hydrolases (GH30), which may be related to fungal endophytic process and lost in many insect-pathogenic fungi but exist in nematophagous fungi, are putatively acquired from bacteria by horizontal gene transfer. The results help understanding genetic origins and evolution of parasitism-related genes.


September 22, 2019

Analysis of the hybrid genomes of two field isolates of the soil-borne fungal species Verticillium longisporum.

Brassica plant species are attacked by a number of pathogens; among them, the ones with a soil-borne lifestyle have become increasingly important. Verticillium stem stripe caused by Verticillium longisporum is one example. This fungal species is thought to be of a hybrid origin, having a genome composed of combinations of lineages denominated A and D. In this study we report the draft genomes of 2 V. longisporum field isolates sequenced using the Illumina technology. Genomic characterization and lineage composition, followed by selected gene analysis to facilitate the comprehension of its genomic features and potential effector categories were performed.The draft genomes of 2 Verticillium longisporum single spore isolates (VL1 and VL2) have an estimated ungapped size of about 70 Mb. The total number of protein encoding genes identified in VL1 was 20,793, whereas 21,072 gene models were predicted in VL2. The predicted genome size, gene contents, including the gene families coding for carbohydrate active enzymes were almost double the numbers found in V. dahliae and V. albo-atrum. Single nucleotide polymorphisms (SNPs) were frequently distributed in the two genomes but the distribution of heterozygosity and depth was not independent. Further analysis of potential parental lineages suggests that the V. longisporum genome is composed of two parts, A1 and D1, where A1 is more ancient than the parental lineage genome D1, the latter being more closer related to V. dahliae. Presence of the mating-type genes MAT1-1-1 and MAT1-2-1 in the V. longisporum genomes were confirmed. However, the MAT genes in V. dahliae, V. albo-atrum and V. longisporum have experienced extensive nucleotide changes at least partly explaining the present asexual nature of these fungal species.The established draft genome of V. longisporum is comparatively large compared to other studied ascomycete fungi. Consequently, high numbers of genes were predicted in the two V. longisporum genomes, among them many secreted proteins and carbohydrate active enzyme (CAZy) encoding genes. The genome is composed of two parts, where one lineage is more ancient than the part being more closely related to V. dahliae. Dissimilar mating-type sequences were identified indicating possible ancient hybridization events.


September 22, 2019

Comparative genomics reveals cotton-specific virulence factors in flexible genomic regions in Verticillium dahliae and evidence of horizontal gene transfer from Fusarium.

Verticillium dahliae isolates are most virulent on the host from which they were originally isolated. Mechanisms underlying these dominant host adaptations are currently unknown. We sequenced the genome of V. dahliae Vd991, which is highly virulent on its original host, cotton, and performed comparisons with the reference genomes of JR2 (from tomato) and VdLs.17 (from lettuce). Pathogenicity-related factor prediction, orthology and multigene family classification, transcriptome analyses, phylogenetic analyses, and pathogenicity experiments were performed. The Vd991 genome harbored several exclusive, lineage-specific (LS) genes within LS regions (LSRs). Deletion mutants of the seven genes within one LSR (G-LSR2) in Vd991 were less virulent only on cotton. Integration of G-LSR2 genes individually into JR2 and VdLs.17 resulted in significantly enhanced virulence on cotton but did not affect virulence on tomato or lettuce. Transcription levels of the seven LS genes in Vd991 were higher during the early stages of cotton infection, as compared with other hosts. Phylogenetic analyses suggested that G-LSR2 was acquired from Fusarium oxysporum f. sp. vasinfectum through horizontal gene transfer. Our results provide evidence that horizontal gene transfer from Fusarium to Vd991 contributed significantly to its adaptation to cotton and may represent a significant mechanism in the evolution of an asexual plant pathogen.© 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.


September 22, 2019

Translating genomics into practice for real-time surveillance and response to carbapenemase-producing Enterobacteriaceae: evidence from a complex multi-institutional KPC outbreak.

Until recently, Klebsiella pneumoniae carbapenemase (KPC)-producing Enterobacteriaceae were rarely identified in Australia. Following an increase in the number of incident cases across the state of Victoria, we undertook a real-time combined genomic and epidemiological investigation. The scope of this study included identifying risk factors and routes of transmission, and investigating the utility of genomics to enhance traditional field epidemiology for informing management of established widespread outbreaks.All KPC-producing Enterobacteriaceae isolates referred to the state reference laboratory from 2012 onwards were included. Whole-genome sequencing was performed in parallel with a detailed descriptive epidemiological investigation of each case, using Illumina sequencing on each isolate. This was complemented with PacBio long-read sequencing on selected isolates to establish high-quality reference sequences and interrogate characteristics of KPC-encoding plasmids.Initial investigations indicated that the outbreak was widespread, with 86 KPC-producing Enterobacteriaceae isolates (K. pneumoniae 92%) identified from 35 different locations across metropolitan and rural Victoria between 2012 and 2015. Initial combined analyses of the epidemiological and genomic data resolved the outbreak into distinct nosocomial transmission networks, and identified healthcare facilities at the epicentre of KPC transmission. New cases were assigned to transmission networks in real-time, allowing focussed infection control efforts. PacBio sequencing confirmed a secondary transmission network arising from inter-species plasmid transmission. Insights from Bayesian transmission inference and analyses of within-host diversity informed the development of state-wide public health and infection control guidelines, including interventions such as an intensive approach to screening contacts following new case detection to minimise unrecognised colonisation.A real-time combined epidemiological and genomic investigation proved critical to identifying and defining multiple transmission networks of KPC Enterobacteriaceae, while data from either investigation alone were inconclusive. The investigation was fundamental to informing infection control measures in real-time and the development of state-wide public health guidelines on carbapenemase-producing Enterobacteriaceae surveillance and management.


September 22, 2019

Pangenome analyses of the wheat pathogen Zymoseptoria tritici reveal the structural basis of a highly plastic eukaryotic genome.

Structural variation contributes substantially to polymorphism within species. Chromosomal rearrangements that impact genes can lead to functional variation among individuals and influence the expression of phenotypic traits. Genomes of fungal pathogens show substantial chromosomal polymorphism that can drive virulence evolution on host plants. Assessing the adaptive significance of structural variation is challenging, because most studies rely on inferences based on a single reference genome sequence.We constructed and analyzed the pangenome of Zymoseptoria tritici, a major pathogen of wheat that evolved host specialization by chromosomal rearrangements and gene deletions. We used single-molecule real-time sequencing and high-density genetic maps to assemble multiple genomes. We annotated the gene space based on transcriptomics data that covered the infection life cycle of each strain. Based on a total of five telomere-to-telomere genomes, we constructed a pangenome for the species and identified a core set of 9149 genes. However, an additional 6600 genes were exclusive to a subset of the isolates. The substantial accessory genome encoded on average fewer expressed genes but a larger fraction of the candidate effector genes that may interact with the host during infection. We expanded our analyses of the pangenome to a worldwide collection of 123 isolates of the same species. We confirmed that accessory genes were indeed more likely to show deletion polymorphisms and loss-of-function mutations compared to core genes.The pangenome construction of a highly polymorphic eukaryotic pathogen showed that a single reference genome significantly underestimates the gene space of a species. The substantial accessory genome provides a cradle for adaptive evolution.


September 22, 2019

The draft genome assembly of Dermatophagoides pteronyssinus supports identification of novel allergen isoforms in Dermatophagoides species.

Background: Dermatophagoides pteronyssinus (DP) and Dermatophagoides farinae (DF) are highly similar disease-asso- ciated mites with frequently overlapping geographic distributions. A draft genome of DP was assembled to identify the candidate allergens in DP that are homologous to those in DF, investigate allergen isoforms, and facilitate comparisons with related Acari. Methods: PacBio and Illumina whole-genome sequencing was performed on DP. Assembly and reconstruction of the genomes were optimized for isoform identification in a heterogeneous population. Bioinformatic analyses of Acari genomes were performed. Results: The predicted size of the DP nuclear genome is 52.5 Mb. A predicted set of 19,368 proteins was identified, including all 19 currently recognized allergens from this species. Orthologs for 12 allergens established for DF were found. The population of DP mites showed a high level of heterozygosity that allowed the identification of 43 new isoforms for both established and candidate allergens in DP including a new isoform for the major allergen Der p 23. Reanalyzing the previous DF data assuming heterozygosity, 14 new allergen isoforms could be identified. Some new isoforms were observed in both species, suggesting that these isoforms predated speciation. The high quality of both genomes allowed an examination of synteny which showed that many allergen orthologs are physically clustered but with species-specific exon/intron structures. Comparative genomic analyses of other Acariformes mites showed that most of the allergen homologs are widely conserved within this Superorder. Conclusions: Candidate allergens in DP were identified to facilitate future serological studies. While DP and DF are highly similar genetically, species-specific allergen isoforms exist to facilitate molecular differentiation.


September 22, 2019

Sequence analysis of European maize inbred line F2 provides new insights into molecular and chromosomal characteristics of presence/absence variants.

Maize is well known for its exceptional structural diversity, including copy number variants (CNVs) and presence/absence variants (PAVs), and there is growing evidence for the role of structural variation in maize adaptation. While PAVs have been described in this important crop species, they have been only scarcely characterized at the sequence level and the extent of presence/absence variation and relative chromosomal landscape of inbred-specific regions remain to be elucidated.De novo genome sequencing of the French F2 maize inbred line revealed 10,044 novel genomic regions larger than 1 kb, making up 88 Mb of DNA, that are present in F2 but not in B73 (PAV). This set of maize PAV sequences allowed us to annotate PAV content and to analyze sequence breakpoints. Using PAV genotyping on a collection of 25 temperate lines, we also analyzed Linkage Disequilibrium in PAVs and flanking regions, and PAV frequencies within maize genetic groups.We highlight the possible role of MMEJ-type double strand break repair in maize PAV formation and discover 395 new genes with transcriptional support. Pattern of linkage disequilibrium within PAVs strikingly differs from this of flanking regions and is in accordance with the intuition that PAVs may recombine less than other genomic regions. We show that most PAVs are ancient, while some are found only in European Flint material, thus pinpointing structural features that may be at the origin of adaptive traits involved in the success of this material. Characterization of such PAVs will provide useful material for further association genetic studies in European and temperate maize.


September 22, 2019

Insights on a founder effect: the case of Xylella fastidiosa in the Salento area of Apulia, Italy

Xylella fastidiosa causing disease on different plant species has been reported in several European countries, since 2013. Based on multilocus sequence typing (MLST) results, there is evidence of repeated introductions of the pathogen in Spain and France. In contrast, in the Salento area of Apulia (Puglia) in Southern Italy, the existence of a unique Apulian MLST genotype of X. fastidiosa, causing the olive quick decline syndrome (OQDS; also referred to as “CoDiRO” or “ST53”) was proven, and this was tentatively ascribed to X. fastidiosa subsp. pauca. In order to acquire information on intra population diversity European Food Safety Authority (EFSA) has strongly called for the characterization of X. fastidiosa isolates from Apulia to produce the necessary data to better understand strain diversity and evolution. In this work, for the first time the existence of sub-variants within a set of 14 “ST53” isolates of X. fastidiosa collected from different locations was searched using DNA typing methods targeting the whole pathogen genome. Invariably, VNTR, RAPD and rep-PCR (ERIC and BOX motifs) analyses indicated that all tested isolates possessed the same genomic fingerprint, supporting the existence of predominant epidemiological strain in Apulia. To further explore the degree of clonality within this population, two isolates from two different Salento areas (Taviano and Ugento) were completely sequenced using PacBio SMRT technology. The whole genome map and sequence comparisons revealed that both isolates are nearly identical, showing less than 0.001% nucleotide diversity. However, the complete and circularized Salento-1 and Salento-2 genome sequences were different, in genome and plasmid size, from the reference strain 9a5c of X. fastidiosa subsp. pauca (from citrus), and showed a PCR-proved large genome inversion of about 1.7 Mb. Genome-wide indices ANIm and dDDH indicated that the three isolates of X. fastidiosa from Salento (Apulia, Italy), namely Salento-1, Salento-2, and De Donno, whose complete genome sequence has been recently released, share a very recent common ancestor. This highlights the importance of continuous and extensive monitoring of molecular variation of this invasive pathogen to understand evolution of adaptive traits, and the necessity for adoption of all possible measures to reduce the risk of new introductions that may augment pathogen diversity.


September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.


September 22, 2019

Genome analysis of clinical multilocus sequence Type 11 Klebsiella pneumoniae from China.

The increasing prevalence of KPC-producing Klebsiella pneumoniae strains in clinical settings has been largely attributed to dissemination of organisms of specific multilocus sequence types, such as ST258 and ST11. Compared with the ST258 clone, which is prevalent in North America and Europe, ST11 is common in China but information regarding its genetic features remains scarce. In this study, we performed detailed genetic characterization of ST11 K. pneumoniae strains by analyzing whole-genome sequences of 58 clinical strains collected from diverse geographic locations in China. The ST11 genomes were found to be highly heterogeneous and clustered into at least three major lineages based on the patterns of single-nucleotide polymorphisms. Exhibiting five different capsular types, these ST11 strains were found to harbor multiple resistance and virulence determinants such as the blaKPC-2 gene, which encodes carbapenemase, and the yersiniabactin-associated virulence genes irp, ybt and fyu. Moreover, genes encoding the virulence factor aerobactin and the regulator of the mucoid phenotype (rmpA) were detectable in six genomes, whereas genes encoding salmochelin were found in three genomes. In conclusion, our data indicated that carriage of a wide range of resistance and virulence genes constitutes the underlying basis of the high level of prevalence of ST11 in clinical settings. Such findings provide insight into the development of novel strategies for prevention, diagnosis and treatment of K. pneumoniae infections.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.