High-quality genome Archives - Page 11 of 13

September 22, 2019

Creating a functional single-chromosome yeast.

Eukaryotic genomes are generally organized in multiple chromosomes. Here we have created a functional single-chromosome yeast from a Saccharomyces cerevisiae haploid cell containing sixteen linear chromosomes, by successive end-to-end chromosome fusions and centromere deletions. The fusion of sixteen native linear chromosomes into a single chromosome results in marked changes to the global three-dimensional structure of the chromosome due to the loss of all centromere-associated inter-chromosomal interactions, most telomere-associated inter-chromosomal interactions and 67.4% of intra-chromosomal interactions. However, the single-chromosome and wild-type yeast cells have nearly identical transcriptome and similar phenome profiles. The giant single chromosome can support cell life, although this strain shows reduced growth across environments, competitiveness, gamete production and viability. This synthetic biology study demonstrates an approach to exploration of eukaryote evolution with respect to chromosome structure and function.

September 22, 2019

A miR172 target-deficient AP2-like gene correlates with the double flower phenotype in roses.

One of the well-known floral abnormalities in flowering plants is the double-flower phenotype, which corresponds to flowers that develop extra petals, sometimes even containing entire flowers within flowers. Because of their highly priced ornamental value, spontaneous double-flower variants have been found and selected for in a wide range of ornamental species. Previously, double flower formation in roses was associated with a restriction of AGAMOUS expression domain toward the centre of the meristem, leading to extra petals. Here, we characterized the genomic region containing the mutation associated with the switch from simple to double flowers in the rose. An APETALA2-like gene (RcAP2L), a member of the Target Of EAT-type (TOE-type) subfamily, lies within this interval. In the double flower rose, two alleles of RcAP2L are present, one of which harbours a transposable element inserted into intron 8. This insertion leads to the creation of a miR172 resistant RcAP2L variant. Analyses of the presence of this variant in a set of simple and double flower roses demonstrate a correlation between the presence of this allele and the double flower phenotype. These data suggest a role of this miR172 resistant RcAP2L variant in regulating RcAGAMOUS expression and double flower formation in Rosa sp.

September 22, 2019

Genomic analysis of Sparus aurata reveals the evolutionary dynamics of sex-biased genes in a sequential hermaphrodite fish

Sexual dimorphism is a fascinating subject in evolutionary biology and mostly results from sex-biased expression of genes, which have been shown to evolve faster in gonochoristic species. We report here genome and sex-specific transcriptome sequencing of Sparus aurata, a sequential hermaphrodite fish. Evolutionary comparative analysis reveals that sex-biased genes in S. aurata are similar in number and function, but evolved following strikingly divergent patterns compared with gonochoristic species, showing overall slower rates because of stronger functional constraints. Fast evolution is observed only for highly ovary-biased genes due to female-specific patterns of selection that are related to the peculiar reproduction mode of S. aurata, first maturing as male, then as female. To our knowledge, these findings represent the first genome-wide analysis on sex-biased loci in a hermaphrodite vertebrate species, demonstrating how having two sexes in the same individual profoundly affects the fate of a large set of evolutionarily relevant genes.

September 22, 2019

Groundnut entered post-genome sequencing era: Opportunities and challenges in translating genomic information from genome to field

Cultivated groundnut or peanut (Arachis hypogaea) is an allopolyploid crop with a large complex genome and genetic barrier for exchanging genetic diversity from its wild relatives due to ploidy differences. Optimum genetic and genomic resources are key for accelerating the process for trait mapping and gene discovery and deploying diagnostic markers in genomics-assisted breeding. The better utilization of different aspects of peanut biology such as genetics, genomics, transcriptomics, proteomics, epigenomics, metabolomics, and interactomics can be of great help to groundnut genetic improvement program across the globe. The availability of high-quality reference genome is core to all the “omics” approaches, and hence optimum genomic resources are a must for fully exploiting the potential of modern science into conventional breeding. In this context, groundnut is passing through a very critical and transformational phase by making available the required genetic and genomic resources such as reference genomes of progenitors, resequencing of diverse lines, transcriptome resources, germplasm diversity panel, and multi-parent genetic populations for conducting high-resolution trait mapping, identification of associated markers, and development of diagnostic markers for selected traits. Lastly, the available resources have been deployed in translating genomic information from genome to field by developing improved groundnut lines with enhanced resistance to root-knot nematode, rust, and late leaf spot and high oleic acid. In addition, the International Peanut Genome Initiative (IPGI) have made available the high-quality reference genome for cultivated tetraploid groundnut which will facilitate better utilization of genetic resources in groundnut improvement. In parallel, the development of high-density genotyping platforms, such as Axiom_Arachis array with 58 K SNPs, and constitution of training population will initiate the deployment of the modern breeding approach, genomic selection, for achieving higher genetic gains in less time with more precision.

September 22, 2019

Comparative genomics of Salmonella enterica serovar Montevideo reveals lineage-specific gene differences that may influence ecological niche association.

Salmonella enterica serovar Montevideo has been linked to recent foodborne illness outbreaks resulting from contamination of products such as fruits, vegetables, seeds and spices. Studies have shown that Montevideo also is frequently associated with healthy cattle and can be isolated from ground beef, yet human salmonellosis outbreaks of Montevideo associated with ground beef contamination are rare. This disparity fuelled our interest in characterizing the genomic differences between Montevideo strains isolated from healthy cattle and beef products, and those isolated from human patients and outbreak sources. To that end, we sequenced 13 Montevideo strains to completion, producing high-quality genome assemblies of isolates from human patients (n=8) or from healthy cattle at slaughter (n=5). Comparative analysis of sequence data from this study and publicly available sequences (n=72) shows that Montevideo falls into four previously established clades, differentially occupied by cattle and human strains. The results of these analyses reveal differences in metabolic islands, environmental adhesion determinants and virulence factors within each clade, and suggest explanations for the infrequent association between bovine isolates and human illnesses.

September 22, 2019

The chromosome-level genome assemblies of two rattans (Calamus simplicifolius and Daemonorops jenkinsiana).

Calamus simplicifolius and Daemonorops jenkinsiana are two representative rattans, the most significant material sources for the rattan industry. However, the lack of reference genome sequences is a major obstacle for basic and applied biology on rattan.We produced two chromosome-level genome assemblies of C. simplicifolius and D. jenkinsiana using Illumina, Pacific Biosciences, and Hi-C sequencing data. A total of ~730 Gb and ~682 Gb of raw data covered the predicted genome lengths (~1.98 Gb of C. simplicifolius and ~1.61 Gb of D. jenkinsiana) to ~372 × and ~426 × read depths, respectively. The two de novo genome assemblies, ~1.94 Gb and ~1.58 Gb, were generated with scaffold N50s of ~160 Mb and ~119 Mb in C. simplicifolius and D. jenkinsiana, respectively. The C. simplicifolius and D. jenkinsiana genomes were predicted to harbor ?51,235 and ?53,342 intact protein-coding gene models, respectively. Benchmarking Universal Single-Copy Orthologs evaluation demonstrated that genome completeness reached 96.4% and 91.3% in the C. simplicifolius and D. jenkinsiana genomes, respectively. Genome evolution showed that four Arecaceae plants clustered together, and the divergence time between the two rattans was ~19.3 million years ago. Additionally, we identified 193 and 172 genes involved in the lignin biosynthesis pathway in the C. simplicifolius and D. jenkinsiana genomes, respectively.We present the first de novo assemblies of two rattan genomes (C. simplicifolius and D. jenkinsiana). These data will not only provide a fundamental resource for functional genomics, particularly in promoting germplasm utilization for breeding, but also serve as reference genomes for comparative studies between and among different species.

September 22, 2019

Forward genetics by genome sequencing uncovers the central role of the Aspergillus niger goxB locus in hydrogen peroxide induced glucose oxidase expression.

Aspergillus niger is an industrially important source for gluconic acid and glucose oxidase (GOx), a secreted commercially important flavoprotein which catalyses the oxidation of ß-D-glucose by molecular oxygen to D-glucolactone and hydrogen peroxide. Expression of goxC, the GOx encoding gene and the concomitant two step conversion of glucose to gluconic acid requires oxygen and the presence of significant amounts of glucose in the medium and is optimally induced at pH 5.5. The molecular mechanisms underlying regulation of goxC expression are, however, still enigmatic. Genetic studies aimed at understanding GOx induction have indicated the involvement of at least seven complementation groups, for none of which the molecular basis has been resolved. In this study, a mapping-by-sequencing forward genetics approach was used to uncover the molecular role of the goxB locus in goxC expression. Using the Illumina and PacBio sequencing platforms a hybrid high quality draft genome assembly of laboratory strain N402 was obtained and used as a reference for mapping of genomic reads obtained from the derivative NW103:goxB mutant strain. The goxB locus encodes a thioredoxin reductase. A deletion of the encoding gene in the N402 parent strain led to a high constitutive expression level of the GOx and the lactonase encoding genes required for the two-step conversion of glucose in gluconic acid and of the catR gene encoding catalase R. This high constitutive level of expression was observed to be irrespective of the carbon source and oxidative stress applied. A model clarifying the role of GoxB in the regulation of the expression of goxC involving hydrogen peroxide as second messenger is presented.

September 22, 2019

Comparative genomics of degradative Novosphingobium strains with special reference to the microcystin-degrading Novosphingobium sp. THN1

Bacteria in genus Novosphingobium associated with biodegradation of substrates are prevalent in environments such as lakes, soil, sea, wood and sediments. To better understand the characteristics linked to their wide distribution and metabolic versatility, we report the whole genome sequence of Novosphingobium sp. THN1, a microcystin-degrading strain previously isolated by Jiang et al. (2011) from cyanobacteria-blooming water samples from Lake Taihu, China. We performed a genomic comparison analysis of Novosphingobium sp. THN1 with 21 other degradative Novosphingobium strains downloaded from GenBank. Phylogenetic trees were constructed using 16S rRNA genes, core genes, protein-coding sequences, and average nucleotide identity of whole genomes. Orthologous protein analysis showed that the 22 genomes contained 674 core genes and each strain contained a high proportion of distributed genes that are shared by a subset of strains. Inspection of their genomic plasticity revealed a high number of insertion sequence elements and genomic islands that were distributed on both chromosomes and plasmids. We also compared the predicted functional profiles of the Novosphingobium protein-coding genes. The flexible genes and all protein-coding genes produced the same heatmap clusters. The COG annotations were used to generate a dendrogram correlated with the compounds degraded. Furthermore, the metabolic profiles predicted from KEGG pathways showed that the majority of genes involved in central carbon metabolism, nitrogen, phosphate, sulfate metabolism, energy metabolism and cell mobility (above 62.5%) are located on chromosomes. Whereas, a great many of genes involved in degradation pathways (21–50%) are located on plasmids. The abundance and distribution of aromatics-degradative mono- and dioxygenases varied among 22 Novosphingoibum strains. Comparative analysis of the microcystin-degrading mlr gene cluster provided evidence for horizontal acquisition of this cluster. The Novosphingobium sp. THN1 genome sequence contained all the functional genes crucial for microcystin degradation and the mlr gene cluster shared high sequence similarity (=85%) with the sequences of other microcystin-degrading genera isolated from cyanobacteria-blooming water. Our results indicate that Novosphingobium species have high genomic and functional plasticity, rearranging their genomes according to environment variations and shaping their metabolic profiles by the substrates they are exposed to, to better adapt to their environments.

September 22, 2019

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Coenurosis, caused by the larval coenurus of the tapeworm Taenia multiceps, is a fatal central nervous system disease in both sheep and humans. Though treatment and prevention options are available, the control of coenurosis still faces presents great challenges. Here, we present a high-quality genome sequence of T. multiceps in which 240 Mb (96%) of the genome has been successfully assembled using Pacbio single-molecule real-time (SMRT) and Hi-C data with a N50 length of 44.8 Mb. In total, 49.5 Mb (20.6%) repeat sequences and 13, 013 gene models were identified. We found that Taenia spp. have an expansion of transposable elements and recent small-scale gene duplications following the divergence of Taenia from Echinococcus, but not in Echinococcus genomes, and the genes underlying environmental adaptability and dosage effect tend to be over-retained in the T. multiceps genome. Moreover, we identified several genes encoding proteins involved in proglottid formation and interactions with the host central nervous system, which may contribute to the adaption of T. multiceps to its parasitic life style. Our study not only provides insights into the biology and evolution of T. multiceps, but also identifies a set of species-specific gene targets for developing novel treatment and control tools for coenurosis.

September 22, 2019

Convergent evolution of complex genomic rearrangements in two fungal meiotic drive elements.

Meiotic drive is widespread in nature. The conflict it generates is expected to be an important motor for evolutionary change and innovation. In this study, we investigated the genomic consequences of two large multi-gene meiotic drive elements, Sk-2 and Sk-3, found in the filamentous ascomycete Neurospora intermedia. Using long-read sequencing, we generated the first complete and well-annotated genome assemblies of large, highly diverged, non-recombining regions associated with meiotic drive elements. Phylogenetic analysis shows that, even though Sk-2 and Sk-3 are located in the same chromosomal region, they do not form sister clades, suggesting independent origins or at least a long evolutionary separation. We conclude that they have in a convergent manner accumulated similar patterns of tandem inversions and dense repeat clusters, presumably in response to similar needs to create linkage between genes causing drive and resistance.

September 22, 2019

The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics.

Tilletia horrida is a soil-borne, mononucleate basidiomycete fungus with a biotrophic lifestyle that causes rice kernel smut, a disease that is distributed throughout hybrid rice growing areas worldwide. Here we report on the high-quality genome sequence of T. horrida; it is composed of 23.2?Mb that encode 7,729 predicted genes and 6,973 genes supported by RNA-seq. The genome contains few repetitive elements that account for 8.45% of the total. Evolutionarily, T. horrida lies close to the Ustilago fungi, suggesting grass species as potential hosts, but co-linearity was not observed between T. horrida and the barley smut Ustilago hordei. Genes and functions relevant to pathogenicity were presumed. T. horrida possesses a smaller set of carbohydrate-active enzymes and secondary metabolites, which probably reflect the specific characteristics of its infection and biotrophic lifestyle. Genes that encode secreted proteins and enzymes of secondary metabolism, and genes that are represented in the pathogen-host interaction gene database genes, are highly expressed during early infection; this is consistent with their potential roles in pathogenicity. Furthermore, among the 131 candidate pathogen effectors identified according to their expression patterns and functionality, we validated two that trigger leaf cell death in Nicotiana benthamiana. In summary, we have revealed new molecular mechanisms involved in the evolution, biotrophy, and pathogenesis of T. horrida.

September 22, 2019

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

The ascomycete fungus Colletotrichum truncatum is a major phytopathogen with a broad host range which causes anthracnose disease of chilli. The genome sequencing of this fungus led to the discovery of functional categories of genes that may play important roles in fungal pathogenicity. However, the presence of gaps in C. truncatum draft assembly prevented the accurate prediction of repetitive elements, which are the key players to determine the genome architecture and drive evolution and host adaptation. We re-sequenced its genome using single-molecule real-time (SMRT) sequencing technology to obtain a refined assembly with lesser and smaller gaps and ambiguities. This enabled us to study its genome architecture by characterising the repetitive sequences like transposable elements (TEs) and simple sequence repeats (SSRs), which constituted 4.9 and 0.38% of the assembled genome, respectively. The comparative analysis among different Colletotrichum species revealed the extensive repeat rich regions, dominated by Gypsy superfamily of long terminal repeats (LTRs), and the differential composition of SSRs in their genomes. Our study revealed a recent burst of LTR amplification in C. truncatum, C. higginsianum, and C. scovillei. TEs in C. truncatum were significantly associated with secretome, effectors and genes in secondary metabolism clusters. Some of the TE families in C. truncatum showed cytosine to thymine transitions indicative of repeat-induced point mutation (RIP). C. orbiculare and C. graminicola showed strong signatures of RIP across their genomes and “two-speed” genomes with extensive AT-rich and gene-sparse regions. Comparative genomic analyses of Colletotrichum species provided an insight into the species-specific SSR profiles. The SSRs in the coding and non-coding regions of the genome revealed the composition of trinucleotide repeat motifs in exons with potential to alter the translated protein structure through amino acid repeats. This is the first genome-wide study of TEs and SSRs in C. truncatum and their comparative analysis with six other Colletotrichum species, which would serve as a useful resource for future research to get insights into the potential role of TEs in genome expansion and evolution of Colletotrichum fungi and for development of SSR-based molecular markers for population genomic studies.

September 22, 2019

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repetitive genome regions have been difficult to sequence, mainly because of the comparatively small size of the fragments used in assembly. Satellites or tandem repeats are very abundant in nematodes and offer an excellent playground to evaluate different assembly methods. Here, we compare the structure of satellites found in three different assemblies of the Caenorhabditis elegans genome: the original sequence obtained by Sanger sequencing, an assembly based on PacBio technology, and an assembly using Nanopore sequencing reads. In general, satellites were found in equivalent genomic regions, but the new long-read methods (PacBio and Nanopore) tended to result in longer assembled satellites. Important differences exist between the assemblies resulting from the two long-read technologies, such as the sizes of long satellites. Our results also suggest that the lengths of some annotated genes with internal repeats which were assembled using Sanger sequencing are likely to be incorrect.

September 22, 2019

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

Structural features of genomes, including the three-dimensional arrangement of DNA in the nucleus, are increasingly seen as key contributors to the regulation of gene expression. However, studies on how genome structure and nuclear organisation influence transcription have so far been limited to a handful of model species. This narrow focus limits our ability to draw general conclusions about the ways in which three-dimensional structures are encoded, and to integrate information from three-dimensional data to address a broader gamut of biological questions. Here, we generate a complete and gapless genome sequence for the filamentous fungus, Epichloë festucae. We use Hi-C data to examine the three-dimensional organisation of the genome, and RNA-seq data to investigate how Epichloë genome structure contributes to the suite of transcriptional changes needed to maintain symbiotic relationships with the grass host. Our results reveal a genome in which very repeat-rich blocks of DNA with discrete boundaries are interspersed by gene-rich sequences that are almost repeat-free. In contrast to other species reported to date, the three-dimensional structure of the genome is anchored by these repeat blocks, which act to isolate transcription in neighbouring gene-rich regions. Genes that are differentially expressed in planta are enriched near the boundaries of these repeat-rich blocks, suggesting that their three-dimensional orientation partly encodes and regulates the symbiotic relationship formed by this organism.

September 22, 2019

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

The wrasses (Labridae) are one of the most successful and species-rich families of the Perciformes order of teleost fish. Its members display great morphological diversity, and occupy distinct trophic levels in coastal waters and coral reefs. The cleaning behaviour displayed by some wrasses, such as corkwing wrasse (Symphodus melops), is of particular interest for the salmon aquaculture industry to combat and control sea lice infestation as an alternative to chemicals and pharmaceuticals. There are still few genome assemblies available within this fish family for comparative and functional studies, despite the rapid increase in genome resources generated during the past years. Here, we present a highly continuous genome assembly of the corkwing wrasse using PacBio SMRT sequencing (x28.8) followed by error correction with paired-end Illumina data (x132.9). The present genome assembly consists of 5040 contigs (N50?=?461,652?bp) and a total size of 614 Mbp, of which 8.5% of the genome sequence encode known repeated elements. The genome assembly covers 94.21% of highly conserved genes across ray-finned fish species. We find evidence for increased copy numbers specific for corkwing wrasse possibly highlighting diversification and adaptive processes in gene families including N-linked glycosylation (ST8SIA6) and stress response kinases (HIPK1). By comparative analyses, we discover that de novo repeats, often not properly investigated during genome annotation, encode hundreds of immune-related genes. This new genomic resource, together with the ballan wrasse (Labrus bergylta), will allow for in-depth comparative genomics as well as population genetic analyses for the understudied wrasses. Copyright © 2018 Elsevier Inc. All rights reserved.

Auto Tag: High-quality genome

Creating a functional single-chromosome yeast.

A miR172 target-deficient AP2-like gene correlates with the double flower phenotype in roses.

Genomic analysis of Sparus aurata reveals the evolutionary dynamics of sex-biased genes in a sequential hermaphrodite fish

Groundnut entered post-genome sequencing era: Opportunities and challenges in translating genomic information from genome to field

Comparative genomics of Salmonella enterica serovar Montevideo reveals lineage-specific gene differences that may influence ecological niche association.

The chromosome-level genome assemblies of two rattans (Calamus simplicifolius and Daemonorops jenkinsiana).

Forward genetics by genome sequencing uncovers the central role of the Aspergillus niger goxB locus in hydrogen peroxide induced glucose oxidase expression.

Comparative genomics of degradative Novosphingobium strains with special reference to the microcystin-degrading Novosphingobium sp. THN1

The genome of tapeworm Taenia multiceps sheds light on understanding parasitic mechanism and control of coenurosis disease.

Convergent evolution of complex genomic rearrangements in two fungal meiotic drive elements.

The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics.

The landscape of repetitive elements in the refined genome of chilli anthracnose fungus Colletotrichum truncatum.

How long are long tandem repeats? A challenge for current methods of whole-genome sequence assembly: The case of satellites in Caenorhabditis elegans.

Repeat elements organise 3D genome structure and mediate transcription in the filamentous fungus Epichloë festucae.

A continuous genome assembly of the corkwing wrasse (Symphodus melops).

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert