Menu
September 22, 2019

The chromosome-level quality genome provides insights into the evolution of the biosynthesis genes for aroma compounds of Osmanthus fragrans.

Sweet osmanthus (Osmanthus fragrans) is a very popular ornamental tree species throughout Southeast Asia and USA particularly for its extremely fragrant aroma. We constructed a chromosome-level reference genome of O. fragrans to assist in studies of the evolution, genetic diversity, and molecular mechanism of aroma development. A total of over 118?Gb of polished reads was produced from HiSeq (45.1?Gb) and PacBio Sequel (73.35?Gb), giving 100× depth coverage for long reads. The combination of Illumina-short reads, PacBio-long reads, and Hi-C data produced the final chromosome quality genome of O. fragrans with a genome size of 727?Mb and a heterozygosity of 1.45 %. The genome was annotated using de novo and homology comparison and further refined with transcriptome data. The genome of O. fragrans was predicted to have?45,542 genes, of which 95.68 % were functionally annotated. Genome annotation found 49.35 % as the repetitive sequences, with long terminal repeats (LTR) being the richest (28.94 %). Genome evolution analysis indicated the evidence of whole-genome duplication 15 million years ago, which contributed to the current content of 45,242 genes. Metabolic analysis revealed that linalool, a monoterpene is the main aroma compound. Based on the genome and transcriptome, we further demonstrated the direct connection between terpene synthases (TPSs) and the rich aromatic molecules in O. fragrans. We identified three new flower-specific TPS genes, of which the expression coincided with the production of linalool. Our results suggest that the high number of TPS genes and the flower tissue- and stage-specific TPS genes expressions might drive the strong unique aroma production of O. fragrans.


September 22, 2019

Genome sequence of the potato pathogenic fungus Alternaria solani HWC-168 reveals clues for its conidiation and virulence.

Alternaria solani is a known air-born deuteromycete fungus with a polycyclic life cycle and is the causal agent of early blight that causes significant yield losses of potato worldwide. However, the molecular mechanisms underlying the conidiation and pathogenicity remain largely unknown.We produced a high-quality genome assembly of A. solani HWC-168 that was isolated from a major potato-producing region of Northern China, which facilitated a comprehensive gene annotation, the accurate prediction of genes encoding secreted proteins and identification of conidiation-related genes. The assembled genome of A. solani HWC-168 has a genome size 32.8 Mb and encodes 10,358 predicted genes that are highly similar with related Alternaria species including Alternaria arborescens and Alternaria brassicicola. We identified conidiation-related genes in the genome of A. solani HWC-168 by searching for sporulation-related homologues identified from Aspergillus nidulans. A total of 975 secreted protein-encoding genes, which might act as virulence factors, were identified in the genome of A. solani HWC-168. The predicted secretome of A. solani HWC-168 possesses 261 carbohydrate-active enzymes (CAZy), 119 proteins containing RxLx[EDQ] motif and 27 secreted proteins unique to A. solani.Our findings will facilitate the identification of conidiation- and virulence-related genes in the genome of A. solani. This will permit new insights into understanding the molecular mechanisms underlying the A. solani-potato pathosystem and will add value to the global fungal genome database.


September 22, 2019

An improved genome assembly for Larimichthys crocea reveals hepcidin gene expansion with diversified regulation and function.

Larimichthys crocea (large yellow croaker) is a type of perciform fish well known for its peculiar physiological properties and economic value. Here, we constructed an improved version of the L. crocea genome assembly, which contained 26,100 protein-coding genes. Twenty-four pseudo-chromosomes of L. crocea were also reconstructed, comprising 90% of the genome assembly. This improved assembly revealed several expansions in gene families associated with olfactory detection, detoxification, and innate immunity. Specifically, six hepcidin genes (LcHamps) were identified in L. crocea, possibly resulting from lineage-specific gene duplication. All LcHamps possessed similar genomic structures and functional domains, but varied substantially with respect to expression pattern, transcriptional regulation, and biological function. LcHamp1 was associated specifically with iron metabolism, while LcHamp2s were functionally diverse, involving in antibacterial activity, antiviral activity, and regulation of intracellular iron metabolism. This functional diversity among gene copies may have allowed L. crocea to adapt to diverse environmental conditions.


September 22, 2019

Improved reference genome for the domestic horse increases assembly contiguity and composition.

Recent advances in genomic sequencing technology and computational assembly methods have allowed scientists to improve reference genome assemblies in terms of contiguity and composition. EquCab2, a reference genome for the domestic horse, was released in 2007. Although of equal or better quality compared to other first-generation Sanger assemblies, it had many of the shortcomings common to them. In 2014, the equine genomics research community began a project to improve the reference sequence for the horse, building upon the solid foundation of EquCab2 and incorporating new short-read data, long-read data, and proximity ligation data. Here, we present EquCab3. The count of non-N bases in the incorporated chromosomes is improved from 2.33?Gb in EquCab2 to 2.41?Gb in EquCab3. Contiguity has also been improved nearly 40-fold with a contig N50 of 4.5?Mb and scaffold contiguity enhanced to where all but one of the 32 chromosomes is comprised of a single scaffold.


September 22, 2019

Cryptocurrencies and Zero Mode Wave guides: An unclouded path to a more contiguous Cannabis sativa L. genome assembly

We describe the use ofa Decentralized Autonomous Organization (DAO) to crypto- fund the single molecule sequencing and publication ofa Type ll Cannabis plant. This resulted in the construction of the most contiguous Cannabis genome assembly to date. The combined use of the Dash cryptocurrency, DAOs, and Pacific Biosciences sequencing delivered a 1.03 Gb genome with a N50 of 665Kb in 77 days from funding to public upload. This represents a 230 fold improvement in the contiguity of the first cannabis assemblies in 2011 and a 4 fold improvement over all cannabis assemblies to date. 34Gb ofadditional sequencing pushed the assembly to a N50 of 3.8Mb. Hi-C data from Phase Genomics further scaffolded the assembly to 35 contigs at an N50 of 74Mb but requires additional curation. The genome is partially phased and larger than previously reported (2N : 1.33Gb). The CBCA, THCA and CBDA synthase gene clusters have been phased onto respective contigs demonstrating tandem repeat expansions.


September 22, 2019

A mcr-1-carrying conjugative IncX4 plasmid in colistin-resistant Escherichia coli ST278 strain isolated from dairy cow feces in Shanghai, China.

Enterobacteriaceae, including Escherichia coli, has been shown to acquire the colistin resistance gene mcr-1. A strain of E. coli, EC11, which is resistant to colistin, polymyxin B and trimethoprim-sulfamethoxazole, was isolated in 2016 from the feces of a dairy cow in Shanghai, China. Strain EC11 identifies with sequence type ST278 and is susceptible to 19 frequently used antibiotics. Whole genome sequencing of strain EC11 showed that this strain contains a 31-kb resistance plasmid, pEC11b, which belongs to the IncX4 group. The mcr-1 gene was shown to be inserted into a 2.6-kb mcr-1-pap2 cassette of pEC11b. Plasmid pEC11b also contained putative conjugal transfer components, including an oriT-like region, relaxase, type IV coupling protein, and type IV secretion system. We were successful in transferring pEC11b to E. coli C600 with an average transconjugation efficiency of 4.6 × 10-5. Additionally, a MLST-based analysis comparing EC11 and other reported mcr-positive E. coli populations showed high genotypic diversity. The discovery of the E. coli strain EC11 with resistance to colistin in Shanghai emphasizes the importance of vigilance in detecting new threats like mcr genes to public health. Detection of mcr genes helps in tracking, slowing, and responding to the emergence of antibiotic resistance in Chinese livestock farming.


September 22, 2019

Genomic Tandem Quadruplication is Associated with Ketoconazole Resistance in Malassezia pachydermatis.

Malassezia pachydermatis is a commensal yeast found on the skin of dogs. However, M. pachydermatis is also considered an opportunistic pathogen and is associated with various canine skin diseases including otitis externa and atopic dermatitis, which usually require treatment using an azole antifungal drug, such as ketoconazole. In this study, we isolated a ketoconazole-resistant strain of M. pachydermatis, designated “KCTC 27587,” from the external ear canal of a dog with otitis externa and analyzed its resistance mechanism. To understand the mechanism underlying ketoconazole resistance of the clinical isolate M. pachydermatis KCTC 27587, the whole genome of the yeast was sequenced using the PacBio platform and was compared with M. pachydermatis type strain CBS 1879. We found that a ~84-kb region in chromosome 4 of M. pachydermatis KCTC 27587 was tandemly quadruplicated. The quadruplicated region contains 52 protein coding genes, including the homologs of ERG4 and ERG11, whose overexpression is known to be associated with azole resistance. Our data suggest that the quadruplication of the ~84-kb region may be the cause of the ketoconazole resistance in M. pachydermatis KCTC 27587.


September 22, 2019

Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement

Sweetpotato [Ipomoea batatas (L.) Lam.] is a globally important staple food crop, especially for sub-Saharan Africa. Agronomic improvement of sweetpotato has lagged behind other major food crops due to a lack of genomic and genetic resources and inherent challenges in breeding a heterozygous, clonally propagated polyploid. Here, we report the genome sequences of its two diploid relatives, I. trifida and I. triloba, and show that these high-quality genome assemblies are robust references for hexaploid sweetpotato. Comparative and phylogenetic analyses reveal insights into the ancient whole-genome triplication history of Ipomoea and evolutionary relationships within the Batatas complex. Using resequencing data from 16 genotypes widely used in African breeding programs, genes and alleles associated with carotenoid biosynthesis in storage roots are identified, which may enable efficient breeding of varieties with high provitamin A content. These resources will facilitate genome-enabled breeding in this important food security crop.


September 22, 2019

Emergence of pathogenic and multiple-antibiotic-resistant Macrococcus caseolyticus in commercial broiler chickens.

Macrococcus caseolyticus is generally considered to be a non-pathogenic bacterium that does not cause human or animal diseases. However, recently, a strain of M. caseolyticus (SDLY strain) that causes high mortality rates was isolated from commercial broiler chickens in China. The main pathological changes caused by SDLY included caseous exudation in cranial cavities, inflammatory infiltration, haemorrhages and multifocal necrosis in various organs. The whole genome of the SDLY strain was sequenced and was compared with that of the non-pathogenic JCSC5402 strain of M. caseolyticus. The results showed that the SDLY strain harboured a large quantity of mutations, antibiotic resistance genes and numerous insertions and deletions of virulence genes. In particular, among the inserted genes, there is a cluster of eight connected genes associated with the synthesis of capsular polysaccharide. This cluster encodes a transferase and capsular polysaccharide synthase, promotes the formation of capsules and causes changes in pathogenicity. Electron microscopy revealed a distinct capsule surrounding the SDLY strain. The pathogenicity test showed that the SDLY strain could cause significant clinical symptoms and pathological changes in both SPF chickens and mice. In addition, these clinical symptoms and pathological changes were the same as those observed in field cases. Furthermore, the anti-microbial susceptibility test demonstrated that the SDLY strain exhibits multiple-antibiotic resistance. The emergence of pathogenic M. caseolyticus indicates that more attention should be paid to the effects of this micro-organism on both poultry and public health.© 2018 Blackwell Verlag GmbH.


September 22, 2019

Comparative genomic analysis revealed rapid differentiation in the pathogenicity-related gene repertoires between Pyricularia oryzae and Pyricularia penniseti isolated from a Pennisetum grass.

A number of Pyricularia species are known to infect different grass species. In the case of Pyricularia oryzae (syn. Magnaporthe oryzae), distinct populations are known to be adapted to a wide variety of grass hosts, including rice, wheat and many other grasses. The genome sizes of Pyricularia species are typical for filamentous ascomycete fungi [~?40 Mbp for P. oryzae, and ~?45 Mbp for P. grisea]. Genome plasticity, mediated in part by deletions promoted by recombination between repetitive elements [Genome Res 26:1091-1100, 2016, Nat Rev Microbiol 10:417-430,2012] and transposable elements [Annu Rev Phytopathol 55:483-503,2017] contributes to host adaptation. Therefore, comparisons of genome structure of individual species will provide insight into the evolution of host specificity. However, except for the P. oryzae subgroup, little is known about the gene content or genome organization of other Pyricularia species, such as those infecting Pennisetum grasses.Here, we report the genome sequence of P. penniseti strain P1609 isolated from a Pennisetum grass (JUJUNCAO) using PacBio SMRT sequencing technology. Phylogenomic analysis of 28 Magnaporthales species and 5 non-Magnaporthales species indicated that P1609 belongs to a Pyricularia subclade, which is genetically distant from P. oryzae. Comparative genomic analysis revealed that the pathogenicity-related gene repertoires had diverged between P1609 and the P. oryzae strain 70-15, including the known avirulence genes, other putative secreted proteins, as well as some other predicted Pathogen-Host Interaction (PHI) genes. Genomic sequence comparison also identified many genomic rearrangements relative to P. oryzae.Our results suggested that the genomic sequence of the P. penniseti P1609 could be a useful resource for the genetic study of the Pennisetum-infecting Pyricularia species and provide new insight into evolution of pathogen genomes during host adaptation.


September 22, 2019

Insights into the biology of acidophilic members of the Acidiferrobacteraceae family derived from comparative genomic analyses.

The family Acidiferrobacteraceae (order Acidiferrobacterales) currently contains Gram negative, neutrophilic sulfur oxidizers such as Sulfuricaulis and Sulfurifustis, as well as acidophilic iron and sulfur oxidizers belonging to the Acidiferrobacter genus. The diversity and taxonomy of the genus Acidiferrobacter has remained poorly explored. Although several metagenome and bioleaching studies have identified its presence worldwide, only two strains, namely Acidiferrobacter thiooxydans DSM 2932T, and Acidiferrobacter spp. SP3/III have been isolated and made publically available. Using 16S rRNA sequence data publically available for the Acidiferrobacteraceae, we herein shed light into the molecular taxonomy of this family. Results obtained support the presence of three clades Acidiferrobacter, Sulfuricaulis and Sulfurifustis. Genomic analyses of the genome sequences of A. thiooxydansT and Acidiferrobacter spp. SP3/III indicate that ANI relatedness between the SPIII/3 strain and A. thiooxydansT is below 95-96%, supporting the classification of strain SP3/III as a new species within this genus. In addition, approximately 70% of Acidiferrobacter sp. SPIII/3 predicted genes have a conserved ortholog in A. thiooxydans strains. A comparative analysis of iron, sulfur oxidation pathways, genome plasticity and cell-cell communication mechanisms of Acidiferrobacter spp. are also discussed. Copyright © 2018 The Authors. Published by Elsevier Masson SAS.. All rights reserved.


September 22, 2019

Achieving Accurate Sequence and Annotation Data for Caulobacter vibrioides CB13.

Annotated sequence data are instrumental in nearly all realms of biology. However, the advent of next-generation sequencing has rapidly facilitated an imbalance between accurate sequence data and accurate annotation data. To increase the annotation accuracy of the Caulobacter vibrioides CB13b1a (CB13) genome, we compared the PGAP and RAST annotations of the CB13 genome. A total of 64 unique genes were identified in the PGAP annotation that were either completely or partially absent in the RAST annotation, and a total of 16 genes were identified in the RAST annotation that were not included in the PGAP annotation. Moreover, PGAP identified 73 frameshifted genes and 22 genes with an internal stop. In contrast, RAST annotated the larger segment of these frameshifted genes without indicating a change in reading frame may have occurred. The RAST annotation did not include any genes with internal stop codons, since it chose start codons that were after the internal stop. To confirm the discrepancies between the two annotations and verify the accuracy of the CB13 genome sequence data, we re-sequenced and re-annotated the entire genome and obtained an identical sequence, except in a small number of homopolymer regions. A genome sequence comparison between the two versions allowed us to determine the correct number of bases in each homopolymer region, which eliminated frameshifts for 31 genes annotated as frameshifted genes and removed 24 pseudogenes from the PGAP annotation. Both annotation systems correctly identified genes that were missed by the other system. In addition, PGAP identified conserved gene fragments that represented the beginning of genes, but it employed no corrective method to adjust the reading frame of frameshifted genes or the start sites of genes harboring an internal stop codon. In doing so, the PGAP annotation identified a large number of pseudogenes, which may reflect evolutionary history but likely do not produce gene products. These results demonstrate that re-sequencing and annotation comparisons can be used to increase the accuracy of genomic data and the corresponding gene annotation.


September 22, 2019

Genome-scale analysis of Acetobacterium bakii reveals the cold adaptation of psychrotolerant acetogens by post-transcriptional regulation.

Acetogens synthesize acetyl-CoA via CO2 or CO fixation, producing organic compounds. Despite their ecological and industrial importance, their transcriptional and post-transcriptional regulation has not been systematically studied. With completion of the genome sequence of Acetobacterium bakii (4.28-Mb), we measured changes in the transcriptome of this psychrotolerant acetogen in response to temperature variations under autotrophic and heterotrophic growth conditions. Unexpectedly, acetogenesis genes were highly up-regulated at low temperatures under heterotrophic, as well as autotrophic, growth conditions. To mechanistically understand the transcriptional regulation of acetogenesis genes via changes in RNA secondary structures of 5′-untranslated regions (5′-UTR), the primary transcriptome was experimentally determined, and 1379 transcription start sites (TSS) and 1100 5′-UTR were found. Interestingly, acetogenesis genes contained longer 5′-UTR with lower RNA-folding free energy than other genes, revealing that the 5′-UTRs control the RNA abundance of the acetogenesis genes under low temperature conditions. Our findings suggest that post-transcriptional regulation via RNA conformational changes of 5′-UTRs is necessary for cold-adaptive acetogenesis.© 2018 Shin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.


September 22, 2019

Desiccation Tolerance Evolved through Gene Duplication and Network Rewiring in Lindernia.

Although several resurrection plant genomes have been sequenced, the lack of suitable dehydration-sensitive outgroups has limited genomic insights into the origin of desiccation tolerance. Here, we utilized a comparative system of closely related desiccation-tolerant (Lindernia brevidens) and -sensitive (Lindernia subracemosa) species to identify gene- and pathway-level changes associated with the evolution of desiccation tolerance. The two high-quality Lindernia genomes we assembled are largely collinear, and over 90% of genes are conserved. L. brevidens and L. subracemosa have evidence of an ancient, shared whole-genome duplication event, and retained genes have neofunctionalized, with desiccation-specific expression in L. brevidens Tandem gene duplicates also are enriched in desiccation-associated functions, including a dramatic expansion of early light-induced proteins from 4 to 26 copies in L. brevidens A comparative differential gene coexpression analysis between L. brevidens and L. subracemosa supports extensive network rewiring across early dehydration, desiccation, and rehydration time courses. Many LATE EMBRYOGENESIS ABUNDANT genes show significantly higher expression in L. brevidens compared with their orthologs in L. subracemosa Coexpression modules uniquely upregulated during desiccation in L. brevidens are enriched with seed-specific and abscisic acid-associated cis-regulatory elements. These modules contain a wide array of seed-associated genes that have no expression in the desiccation-sensitive L. subracemosa Together, these findings suggest that desiccation tolerance evolved through a combination of gene duplications and network-level rewiring of existing seed desiccation pathways.© 2018 American Society of Plant Biologists. All rights reserved.


September 22, 2019

The genome of the tegu lizard Salvator merianae: combining Illumina, PacBio, and optical mapping data to generate a highly contiguous assembly.

Reptiles are a species-rich group with great phenotypic and life history diversity but are highly underrepresented among the vertebrate species with sequenced genomes.Here, we report a high-quality genome assembly of the tegu lizard, Salvator merianae, the first lacertoid with a sequenced genome. We combined 74X Illumina short-read, 29.8X Pacific Biosciences long-read, and optical mapping data to generate a high-quality assembly with a scaffold N50 value of 55.4 Mb. The contig N50 value of this assembly is 521 Kb, making it the most contiguous reptile assembly so far. We show that the tegu assembly has the highest completeness of coding genes and conserved non-exonic elements (CNEs) compared to other reptiles. Furthermore, the tegu assembly has the highest number of evolutionarily conserved CNE pairs, corroborating a high assembly contiguity in intergenic regions. As in other reptiles, long interspersed nuclear elements comprise the most abundant transposon class. We used transcriptomic data, homology- and de novo gene predictions to annotate 22,413 coding genes, of which 16,995 (76%) likely have human orthologs as inferred by CESAR-derived gene mappings. Finally, we generated a multiple genome alignment comprising 10 squamates and 7 other amniote species and identified conserved regions that are under evolutionary constraint. CNEs cover 38 Mb (1.8%) of the tegu genome, with 3.3 Mb in these elements being squamate specific. In contrast to placental mammal-specific CNEs, very few of these squamate-specific CNEs (<20 Kb) overlap transposons, highlighting a difference in how lineage-specific CNEs originated in these two clades.The tegu lizard genome together with the multiple genome alignment and comprehensive conserved element datasets provide a valuable resource for comparative genomic studies of reptiles and other amniotes.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.