Haplotype Archives - Page 18 of 49

September 22, 2019

Analysis of the hybrid genomes of two field isolates of the soil-borne fungal species Verticillium longisporum.

Brassica plant species are attacked by a number of pathogens; among them, the ones with a soil-borne lifestyle have become increasingly important. Verticillium stem stripe caused by Verticillium longisporum is one example. This fungal species is thought to be of a hybrid origin, having a genome composed of combinations of lineages denominated A and D. In this study we report the draft genomes of 2 V. longisporum field isolates sequenced using the Illumina technology. Genomic characterization and lineage composition, followed by selected gene analysis to facilitate the comprehension of its genomic features and potential effector categories were performed.The draft genomes of 2 Verticillium longisporum single spore isolates (VL1 and VL2) have an estimated ungapped size of about 70 Mb. The total number of protein encoding genes identified in VL1 was 20,793, whereas 21,072 gene models were predicted in VL2. The predicted genome size, gene contents, including the gene families coding for carbohydrate active enzymes were almost double the numbers found in V. dahliae and V. albo-atrum. Single nucleotide polymorphisms (SNPs) were frequently distributed in the two genomes but the distribution of heterozygosity and depth was not independent. Further analysis of potential parental lineages suggests that the V. longisporum genome is composed of two parts, A1 and D1, where A1 is more ancient than the parental lineage genome D1, the latter being more closer related to V. dahliae. Presence of the mating-type genes MAT1-1-1 and MAT1-2-1 in the V. longisporum genomes were confirmed. However, the MAT genes in V. dahliae, V. albo-atrum and V. longisporum have experienced extensive nucleotide changes at least partly explaining the present asexual nature of these fungal species.The established draft genome of V. longisporum is comparatively large compared to other studied ascomycete fungi. Consequently, high numbers of genes were predicted in the two V. longisporum genomes, among them many secreted proteins and carbohydrate active enzyme (CAZy) encoding genes. The genome is composed of two parts, where one lineage is more ancient than the part being more closely related to V. dahliae. Dissimilar mating-type sequences were identified indicating possible ancient hybridization events.

September 22, 2019

Extreme haplotype variation in the desiccation-tolerant clubmoss Selaginella lepidophylla.

Plant genome size varies by four orders of magnitude, and most of this variation stems from dynamic changes in repetitive DNA content. Here we report the small 109?Mb genome of Selaginella lepidophylla, a clubmoss with extreme desiccation tolerance. Single-molecule sequencing enables accurate haplotype assembly of a single heterozygous S. lepidophylla plant, revealing extensive structural variation. We observe numerous haplotype-specific deletions consisting of largely repetitive and heavily methylated sequences, with enrichment in young Gypsy LTR retrotransposons. Such elements are active but rapidly deleted, suggesting “bloat and purge” to maintain a small genome size. Unlike all other land plant lineages, Selaginella has no evidence of a whole-genome duplication event in its evolutionary history, but instead shows unique tandem gene duplication patterns reflecting adaptation to extreme drying. Gene expression changes during desiccation in S. lepidophylla mirror patterns observed across angiosperm resurrection plants.

September 22, 2019

MUMmer4: A fast and versatile genome alignment system.

The MUMmer system and the genome sequence aligner nucmer included within it are among the most widely used alignment packages in genomics. Since the last major release of MUMmer version 3 in 2004, it has been applied to many types of problems including aligning whole genome sequences, aligning reads to a reference genome, and comparing different assemblies of the same genome. Despite its broad utility, MUMmer3 has limitations that can make it difficult to use for large genomes and for the very large sequence data sets that are common today. In this paper we describe MUMmer4, a substantially improved version of MUMmer that addresses genome size constraints by changing the 32-bit suffix tree data structure at the core of MUMmer to a 48-bit suffix array, and that offers improved speed through parallel processing of input query sequences. With a theoretical limit on the input size of 141Tbp, MUMmer4 can now work with input sequences of any biologically realistic length. We show that as a result of these enhancements, the nucmer program in MUMmer4 is easily able to handle alignments of large genomes; we illustrate this with an alignment of the human and chimpanzee genomes, which allows us to compute that the two species are 98% identical across 96% of their length. With the enhancements described here, MUMmer4 can also be used to efficiently align reads to reference genomes, although it is less sensitive and accurate than the dedicated read aligners. The nucmer aligner in MUMmer4 can now be called from scripting languages such as Perl, Python and Ruby. These improvements make MUMer4 one the most versatile genome alignment packages available.

September 22, 2019

Comparative mapping of the ASTRINGENCY locus controlling fruit astringency in hexaploid persimmon (Diospyros kaki Thunb.) with the diploid D. lotus reference genome

Persimmon (Diospyros kaki) is a tree crop species that originated in East Asia, consists mainly of hexaploid individuals (2n = 6x = 90) with some nonaploid individuals. One of the unique characteristics of persimmon is the continuous accumulation of proanthocyanidins (PAs) in its fruit until the middle of fruit development, resulting in a strong astringent taste even at commercial fruit maturity. Among persimmon cultivars, pollination-constant and non-astringent (PCNA) types cease PA accumulation in early fruit development and become non-astringent at commercial maturity. PCNA is an allelic trait to non-PCNA and is controlled by a single locus called the ASTRINGENCY (AST) locus. Previous segregation analyses indicated that the AST locus shows hexasomic inheritance; a recessive allele, ast, at this locus confers PCNA. Here, we report a shuttle mapping approach to delimit the AST locus region in the hexaploid persimmon genome by using D. lotus, a diploid relative of D. kaki, as a reference. A D. lotus F1 population of 333 individuals and 296 D. kaki siblings segregating for the PCNA trait were used to map the AST region using haplotype-specific markers covering the AST region. This indicated that the AST locus is syntenic to an approximately 915-kb region of the D. lotus genome. In this 915-kb region, we found several candidates for AST that were revealed from the fruit transcriptome of a population segregating for the PCNA trait. These results could provide important clues for the isolation of AST in hexaploid persimmon.

September 22, 2019

Predicting an HLA-DPB1 expression marker based on standard DPB1 genotyping: Linkage analysis of over 32,000 samples.

The risk of acute graft-versus-host disease (GvHD) after hematopoietic stem cell transplantation is increased with donor-recipient HLA-DPB1 allele mismatching. The single-nucleotide polymorphism (SNP) rs9277534 within the 3′ untranslated region (UTR) correlates with HLA-DPB1 allotype expression and serves as a marker for permissive HLA-DPB1 mismatches. Since rs9277534 is not routinely typed, we analyzed 32,681 samples of mostly European ancestry to investigate if the rs9277534 allele can be reliably imputed from standard DPB1 genotyping. We confirmed the previously-defined linkages between rs9277534 and 18 DPB1 alleles and established additional linkages for 46 DPB1 alleles. Based on these linkages, the rs9277534 allele could be predicted for 99.6% of the samples based on DPB1 genotypes (99.99% concordance). We demonstrate that 100% prediction accuracy could be achieved if the prediction utilized exon 3 sequence information. DPB1 genotyping based on exon 2 data alone allows no unambiguous rs9277534 allele prediction but was estimated to maintain 99% accuracy for samples of European descent. We conclude that DPB1 genotyping is sufficient to infer the DPB1 expression marker rs9277534 with high accuracy. This information could be used to select donors with permissive HLA-DPB1 mismatches without directly screening for rs9277534. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.

September 22, 2019

Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host.

In contrast to infections with human immunodeficiency virus (HIV) in humans and simian immunodeficiency virus (SIV) in macaques, SIV infection of a natural host, sooty mangabeys (Cercocebus atys), is non-pathogenic despite high viraemia. Here we sequenced and assembled the genome of a captive sooty mangabey. We conducted genome-wide comparative analyses of transcript assemblies from C. atys and AIDS-susceptible species, such as humans and macaques, to identify candidates for host genetic factors that influence susceptibility. We identified several immune-related genes in the genome of C. atys that show substantial sequence divergence from macaques or humans. One of these sequence divergences, a C-terminal frameshift in the toll-like receptor-4 (TLR4) gene of C. atys, is associated with a blunted in vitro response to TLR-4 ligands. In addition, we found a major structural change in exons 3-4 of the immune-regulatory protein intercellular adhesion molecule 2 (ICAM-2); expression of this variant leads to reduced cell surface expression of ICAM-2. These data provide a resource for comparative genomic studies of HIV and/or SIV pathogenesis and may help to elucidate the mechanisms by which SIV-infected sooty mangabeys avoid AIDS.

September 22, 2019

An ancient integration in a plant NLR is maintained as a trans-species polymorphism

Plant immune receptors are under constant selective pressure to maintain resistance to plant pathogens. Nucleotide-binding leucine-rich repeat (NLR) proteins are one class of cytoplasmic immune receptors whose genes commonly show signatures of adaptive evolution. While it is known that balancing selection contributes to maintaining high intraspecific allelic diversity, the evolutionary mechanism that influences the transmission of alleles during speciation remains unclear. The barley Mla locus has over 30 described alleles conferring isolate-specific resistance to barley powdery mildew and contains three NLR families (RGH1, RGH2, and RGH3). We discovered (using sequence capture and RNAseq) the presence of a novel integrated Exo70 domain in RGH2 in the Mla3 haplotype. Allelic variation across barley accessions includes presence/absence of the integrated domain in RGH2. Expanding our search to several Poaceae species, we found shared interspecific conservation in the RGH2-Exo70 integration. We hypothesise that balancing selection has maintained allelic variation at Mla as a trans-species polymorphism over 24 My, thus contributing to and preserving interspecific allelic diversity during speciation.

September 22, 2019

SimulaTE: simulating complex landscapes of transposable elements of populations.

Motivation Estimating the abundance of transposable elements (TEs) in populations (or tissues) promises to answer many open research questions. However, progress is hampered by the lack of concordance between different approaches for TE identification and thus potentially unreliable results. Results To address this problem, we developed SimulaTE a tool that generates TE landscapes for populations using a newly developed domain specific language (DSL). The simple syntax of our DSL allows for easily building even complex TE landscapes that have, for example, nested, truncated and highly diverged TE insertions. Reads may be simulated for the populations using different sequencing technologies (PacBio, Illumina paired-ends) and strategies (sequencing individuals and pooled populations). The comparison between the expected (i.e. simulated) and the observed results will guide researchers in finding the most suitable approach for a particular research question. Availability and implementation SimulaTE is implemented in Python and available at https://sourceforge.net/projects/simulates/. Manual https://sourceforge.net/p/simulates/wiki/Home/#manual; Test data and tutorials https://sourceforge.net/p/simulates/wiki/Home/#walkthrough; Validation https://sourceforge.net/p/simulates/wiki/Home/#validation. Contact robert.kofler@vetmeduni.ac.at

September 22, 2019

The draft genome assembly of Dermatophagoides pteronyssinus supports identification of novel allergen isoforms in Dermatophagoides species.

Background: Dermatophagoides pteronyssinus (DP) and Dermatophagoides farinae (DF) are highly similar disease-asso- ciated mites with frequently overlapping geographic distributions. A draft genome of DP was assembled to identify the candidate allergens in DP that are homologous to those in DF, investigate allergen isoforms, and facilitate comparisons with related Acari. Methods: PacBio and Illumina whole-genome sequencing was performed on DP. Assembly and reconstruction of the genomes were optimized for isoform identification in a heterogeneous population. Bioinformatic analyses of Acari genomes were performed. Results: The predicted size of the DP nuclear genome is 52.5 Mb. A predicted set of 19,368 proteins was identified, including all 19 currently recognized allergens from this species. Orthologs for 12 allergens established for DF were found. The population of DP mites showed a high level of heterozygosity that allowed the identification of 43 new isoforms for both established and candidate allergens in DP including a new isoform for the major allergen Der p 23. Reanalyzing the previous DF data assuming heterozygosity, 14 new allergen isoforms could be identified. Some new isoforms were observed in both species, suggesting that these isoforms predated speciation. The high quality of both genomes allowed an examination of synteny which showed that many allergen orthologs are physically clustered but with species-specific exon/intron structures. Comparative genomic analyses of other Acariformes mites showed that most of the allergen homologs are widely conserved within this Superorder. Conclusions: Candidate allergens in DP were identified to facilitate future serological studies. While DP and DF are highly similar genetically, species-specific allergen isoforms exist to facilitate molecular differentiation.

September 22, 2019

The sea lamprey germline genome provides insights into programmed genome rearrangement and vertebrate evolution.

The sea lamprey (Petromyzon marinus) serves as a comparative model for reconstructing vertebrate evolution. To enable more informed analyses, we developed a new assembly of the lamprey germline genome that integrates several complementary data sets. Analysis of this highly contiguous (chromosome-scale) assembly shows that both chromosomal and whole-genome duplications have played significant roles in the evolution of ancestral vertebrate and lamprey genomes, including chromosomes that carry the six lamprey HOX clusters. The assembly also contains several hundred genes that are reproducibly eliminated from somatic cells during early development in lamprey. Comparative analyses show that gnathostome (mouse) homologs of these genes are frequently marked by polycomb repressive complexes (PRCs) in embryonic stem cells, suggesting overlaps in the regulatory logic of somatic DNA elimination and bivalent states that are regulated by early embryonic PRCs. This new assembly will enhance diverse studies that are informed by lampreys’ unique biology and evolutionary/comparative perspective.

September 22, 2019

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

Performing sequence alignment to identify structural variants, such as large deletions, from genome sequencing data is a fundamental task, but current methods are far from perfect. The current practice is to independently align each DNA read to a reference genome. We show that the propensity of genomic rearrangements to accumulate in repeat-rich regions imposes severe ambiguities in these alignments, and consequently on the variant calls-with current read lengths, this affects more than one third of known large deletions in the C. Venter genome. We present a method to jointly align reads to a genome, whereby alignment ambiguity of one read can be disambiguated by other reads. We show this leads to a significant improvement in the accuracy of identifying large deletions (=20 bases), while imposing minimal computational overhead and maintaining an overall running time that is at par with current tools. A software implementation is available as an open-source Python program called JRA at https://bitbucket.org/jointreadalignment/jra-src.

September 22, 2019

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Oat crown rust, caused by the fungus Pucinnia coronata f. sp. avenae, is a devastating disease that impacts worldwide oat production. For much of its life cycle, P. coronata f. sp. avenae is dikaryotic, with two separate haploid nuclei that may vary in virulence genotype, highlighting the importance of understanding haplotype diversity in this species. We generated highly contiguous de novo genome assemblies of two P. coronata f. sp. avenae isolates, 12SD80 and 12NC29, from long-read sequences. In total, we assembled 603 primary contigs for 12SD80, for a total assembly length of 99.16 Mbp, and 777 primary contigs for 12NC29, for a total length of 105.25 Mbp; approximately 52% of each genome was assembled into alternate haplotypes. This revealed structural variation between haplotypes in each isolate equivalent to more than 2% of the genome size, in addition to about 260,000 and 380,000 heterozygous single-nucleotide polymorphisms in 12SD80 and 12NC29, respectively. Transcript-based annotation identified 26,796 and 28,801 coding sequences for isolates 12SD80 and 12NC29, respectively, including about 7,000 allele pairs in haplotype-phased regions. Furthermore, expression profiling revealed clusters of coexpressed secreted effector candidates, and the majority of orthologous effectors between isolates showed conservation of expression patterns. However, a small subset of orthologs showed divergence in expression, which may contribute to differences in virulence between 12SD80 and 12NC29. This study provides the first haplotype-phased reference genome for a dikaryotic rust fungus as a foundation for future studies into virulence mechanisms in P. coronata f. sp. avenaeIMPORTANCE Disease management strategies for oat crown rust are challenged by the rapid evolution of Puccinia coronata f. sp. avenae, which renders resistance genes in oat varieties ineffective. Despite the economic importance of understanding P. coronata f. sp. avenae, resources to study the molecular mechanisms underpinning pathogenicity and the emergence of new virulence traits are lacking. Such limitations are partly due to the obligate biotrophic lifestyle of P. coronata f. sp. avenae as well as the dikaryotic nature of the genome, features that are also shared with other important rust pathogens. This study reports the first release of a haplotype-phased genome assembly for a dikaryotic fungal species and demonstrates the amenability of using emerging technologies to investigate genetic diversity in populations of P. coronata f. sp. avenae. Copyright © 2018 Miller et al.

September 22, 2019

Emergence of an extensively drug-resistant Salmonella enterica serovar Typhi clone harboring a promiscuous plasmid encoding resistance to fluoroquinolones and third-generation cephalosporins.

Antibiotic resistance is a major problem in Salmonella enterica serovar Typhi, the causative agent of typhoid. Multidrug-resistant (MDR) isolates are prevalent in parts of Asia and Africa and are often associated with the dominant H58 haplotype. Reduced susceptibility to fluoroquinolones is also widespread, and sporadic cases of resistance to third-generation cephalosporins or azithromycin have also been reported. Here, we report the first large-scale emergence and spread of a novel S. Typhi clone harboring resistance to three first-line drugs (chloramphenicol, ampicillin, and trimethoprim-sulfamethoxazole) as well as fluoroquinolones and third-generation cephalosporins in Sindh, Pakistan, which we classify as extensively drug resistant (XDR). Over 300 XDR typhoid cases have emerged in Sindh, Pakistan, since November 2016. Additionally, a single case of travel-associated XDR typhoid has recently been identified in the United Kingdom. Whole-genome sequencing of over 80 of the XDR isolates revealed remarkable genetic clonality and sequence conservation, identified a large number of resistance determinants, and showed that these isolates were of haplotype H58. The XDR S. Typhi clone encodes a chromosomally located resistance region and harbors a plasmid encoding additional resistance elements, including the blaCTX-M-15 extended-spectrum ß-lactamase, and carrying the qnrS fluoroquinolone resistance gene. This antibiotic resistance-associated IncY plasmid exhibited high sequence identity to plasmids found in other enteric bacteria isolated from widely distributed geographic locations. This study highlights three concerning problems: the receding antibiotic arsenal for typhoid treatment, the ability of S. Typhi to transform from MDR to XDR in a single step by acquisition of a plasmid, and the ability of XDR clones to spread globally. IMPORTANCE Typhoid fever is a severe disease caused by the Gram-negative bacterium Salmonella enterica serovar Typhi. Antibiotic-resistant S. Typhi strains have become increasingly common. Here, we report the first large-scale emergence and spread of a novel extensively drug-resistant (XDR) S. Typhi clone in Sindh, Pakistan. The XDR S. Typhi is resistant to the majority of drugs available for the treatment of typhoid fever. This study highlights the evolving threat of antibiotic resistance in S. Typhi and the value of antibiotic susceptibility testing and whole-genome sequencing in understanding emerging infectious diseases. We genetically characterized the XDR S. Typhi to investigate the phylogenetic relationship between these isolates and a global collection of S. Typhi isolates and to identify multiple genes linked to antibiotic resistance. This S. Typhi clone harbored a promiscuous antibiotic resistance plasmid previously identified in other enteric bacteria. The increasing antibiotic resistance in S. Typhi observed here adds urgency to the need for typhoid prevention measures.

September 22, 2019

Anisogamy evolved with a reduced sex-determining region in volvocine green algae

Male and female gametes differing in size—anisogamy—emerged independently from isogamous ancestors in various eukaryotic lineages, although genetic bases of this emergence are still unknown. Volvocine green algae are a model lineage for investigating the transition from isogamy to anisogamy. Here we focus on two closely related volvocine genera that bracket this transition—isogamous Yamagishiella and anisogamous Eudorina. We generated de novo nuclear genome assemblies of both sexes of Yamagishiella and Eudorina to identify the dimorphic sex-determining chromosomal region or mating-type locus (MT) from each. In contrast to the large (>1?Mb) and complex MT of oogamous Volvox, Yamagishiella and Eudorina MT are smaller (7–268?kb) and simpler with only two sex-limited genes—the minus/male-limited MID and the plus/female-limited FUS1. No prominently dimorphic gametologs were identified in either species. Thus, the first step to anisogamy in volvocine algae presumably occurred without an increase in MT size and complexity.

September 22, 2019

Conventional and single-molecule targeted sequencing method for specific variant detection in IKBKG while bypassing the IKBKGP1 pseudogene.

In addition to Sanger sequencing, next-generation sequencing of gene panels and exomes has emerged as a standard diagnostic tool in many laboratories. However, these captures can miss regions, have poor efficiency, or capture pseudogenes, which hamper proper diagnoses. One such example is the primary immunodeficiency-associated gene IKBKG. Its pseudogene IKBKGP1 makes traditional capture methods aspecific. We therefore developed a long-range PCR method to efficiently target IKBKG, as well as two associated genes (IRAK4 and MYD88), while bypassing the IKBKGP1 pseudogene. Sequencing accuracy was evaluated using both conventional short-read technology and a newer long-read, single-molecule sequencer. Different mapping and variant calling options were evaluated in their capability to bypass the pseudogene using both sequencing platforms. Based on these evaluations, we determined a robust diagnostic application for unambiguous sequencing and variant calling in IKBKG, IRAK4, and MYD88. This method allows rapid identification of selected primary immunodeficiency diseases in patients suffering from life-threatening invasive pyogenic bacterial infections. Copyright © 2018 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

Auto Tag: Haplotype

Analysis of the hybrid genomes of two field isolates of the soil-borne fungal species Verticillium longisporum.

Extreme haplotype variation in the desiccation-tolerant clubmoss Selaginella lepidophylla.

MUMmer4: A fast and versatile genome alignment system.

Comparative mapping of the ASTRINGENCY locus controlling fruit astringency in hexaploid persimmon (Diospyros kaki Thunb.) with the diploid D. lotus reference genome

Predicting an HLA-DPB1 expression marker based on standard DPB1 genotyping: Linkage analysis of over 32,000 samples.

Sooty mangabey genome sequence provides insight into AIDS resistance in a natural SIV host.

An ancient integration in a plant NLR is maintained as a trans-species polymorphism

SimulaTE: simulating complex landscapes of transposable elements of populations.

The draft genome assembly of Dermatophagoides pteronyssinus supports identification of novel allergen isoforms in Dermatophagoides species.

The sea lamprey germline genome provides insights into programmed genome rearrangement and vertebrate evolution.

Jointly aligning a group of DNA reads improves accuracy of identifying large deletions.

De novo assembly and phasing of dikaryotic genomes from two isolates of Puccinia coronata f. sp. avenae, the causal agent of oat crown rust.

Emergence of an extensively drug-resistant Salmonella enterica serovar Typhi clone harboring a promiscuous plasmid encoding resistance to fluoroquinolones and third-generation cephalosporins.

Anisogamy evolved with a reduced sex-determining region in volvocine green algae

Conventional and single-molecule targeted sequencing method for specific variant detection in IKBKG while bypassing the IKBKGP1 pseudogene.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert