Menu
April 21, 2020

ICESsuHN105, a Novel Multiple Antibiotic Resistant ICE in Streptococcus suis Serotype 5 Strain HN105.

Streptococcussuis serotype 5, an emerging zoonosis bacterial pathogen, has been isolated from infections in both pigs and humans. In this study, we sequenced the first complete genome of a virulent, multidrug-resistant SS5 strain HN105. The strain HN105 displayed enhanced pathogenicity in zebrafish and BABL/c mouse infection models. Comparative genome analysis identified a novel 80K integrative conjugative element (ICE), ICESsuHN105, as required for the multidrug resistance phenotype. Six corresponding antibiotic resistance genes in this ICE were identified, namely tet (O), tet (M), erm (two copies), aph, and spc. Phylogenetic analysis classified the element as a homolog of the ICESa2603 family, containing the typical family backbone and insertion DNA. DNA hybrids mediated by natural transformation between HN105 and ZY05719 verified the antibiotic resistant genes of ICESsuHN105 that could be transferred successfully, while they were dispersedly inserted with a single gene in different genomic locations of ZY05719(HN105) transformants. To further identify the horizontal transfer of ICESsuHN105 as a whole mobile genetic element, a circular intermediate form of ICESsuHN105 was detected by PCR. However, the effective conjugation using serotype 2 S. suis as recipients was not observed in current assays in vitro. Further studies confirmed the presence of the complete lantibiotic locus encoded in ICESsuHN105 that effectively inhibits the growth of other streptococci. In summary, this study demonstrated the presence of antibiotic resistance genes in ICE that are able to transfer between different clinical isolates and adapt to a broader range of Streptococcus serotype or species.


April 21, 2020

Comparative analysis of the chicken IFITM locus by targeted genome sequencing reveals evolution of the locus and positive selection in IFITM1 and IFITM3.

The interferon-induced transmembrane (IFITM) protein family comprises a class of restriction factors widely characterised in humans for their potent antiviral activity. Their biological activity is well documented in several animal species, but their genetic variation and biological mechanism is less well understood, particularly in avian species.Here we report the complete sequence of the domestic chicken Gallus gallus IFITM locus from a wide variety of chicken breeds to examine the detailed pattern of genetic variation of the locus on chromosome 5, including the flanking genes ATHL1 and B4GALNT4. We have generated chIFITM sequences from commercial breeds (supermarket-derived chicken breasts), indigenous chickens from Nigeria (Nsukka) and Ethiopia, European breeds and inbred chicken lines from the Pirbright Institute, totalling of 206 chickens. Through mapping of genetic variants to the latest chIFITM consensus sequence our data reveal that the chIFITM locus does not show structural variation in the locus across the populations analysed, despite spanning diverse breeds from different geographic locations. However, single nucleotide variants (SNVs) in functionally important regions of the proteins within certain groups of chickens were detected, in particular the European breeds and indigenous birds from Ethiopia and Nigeria. In addition, we also found that two out of four SNVs located in the chIFITM1 (Ser36 and Arg77) and chIFITM3 (Val103) proteins were simultaneously under positive selection.Together these data suggest that IFITM genetic variation may contribute to the capacities of different chicken populations to resist virus infection.


April 21, 2020

Hybrid-Transcriptome Sequencing and Associated Metabolite Analysis Reveal Putative Genes Involved in Flower Color Difference in Rose Mutants.

Gene mutation is a common phenomenon in nature that often leads to phenotype differences, such as the variations in flower color that frequently occur in roses. With the aim of revealing the genomic information and inner mechanisms, the differences in the levels of both transcription and secondary metabolism between a pair of natural rose mutants were investigated by using hybrid RNA-sequencing and metabolite analysis. Metabolite analysis showed that glycosylated derivatives of pelargonidin, e.g., pelargonidin 3,5 diglucoside and pelargonidin 3-glucoside, which were not detected in white flowers (Rosa ‘Whilte Mrago Koster’), constituted the major pigments in pink flowers. Conversely, the flavonol contents of petal, such as kaempferol-3-glucoside, quercetin 3-glucoside, and rutin, were higher in white flowers. Hybrid RNA-sequencing obtained a total of 107,280 full-length transcripts in rose petal which were annotated in major databases. Differentially expressed gene (DEG) analysis showed that the expression of genes involved in the flavonoid biosynthesis pathway was significantly different, e.g., CHS, FLS, DFR, LDOX, which was verified by qRT-PCR during flowering. Additionally, two MYB transcription factors were found and named RmMYBAN2 and RmMYBPA1, and their expression patterns during flowering were also analyzed. These findings indicate that these genes may be involved in the flower color difference in the rose mutants, and competition between anthocyanin and flavonol biosynthesis is a primary cause of flower color variation, with its regulation reflected by transcriptional and secondary metabolite levels.


April 21, 2020

Full-length transcriptome sequencing and methyl jasmonate-induced expression profile analysis of genes related to patchoulol biosynthesis and regulation in Pogostemon cablin.

Pogostemon cablin (Blanco) Benth. (Patchouli) is an important aromatic and medicinal plant and widely used in traditional Chinese medicine as well as in the perfume industry. Patchoulol is the primary bioactive component in P. cablin, its biosynthesis has attracted widespread interests. Previous studies have surveyed the putative genes involved in patchoulol biosynthesis using next-generation sequencing method; however, technical limitations generated by short-read sequencing restrict the yield of full-length genes. Additionally, little is known about the expression pattern of genes especially patchoulol biosynthesis related genes in response to methyl jasmonate (MeJA). Our understanding of patchoulol biosynthetic pathway still remained largely incomplete to date.In this study, we analyzed the morphological character and volatile chemical compounds of P. cablin cv. ‘Zhanxiang’, and 39 volatile chemical components were detected in the patchouli leaf using GC-MS, most of which were sesquiterpenes. Furthermore, high-quality RNA isolated from leaves and stems of P. cablin were used to generate the first full-length transcriptome of P. cablin using PacBio isoform sequencing (Iso-Seq). In total, 9.7 Gb clean data and 82,335 full-length UniTransModels were captured. 102 transcripts were annotated as 16 encoding enzymes involved in patchouli alcohol biosynthesis. Accorded with the uptrend of patchoulol content, the vast majority of genes related to the patchoulol biosynthesis were up-regulated after MeJA treatment, indicating that MeJA led to an increasing synthesis of patchoulol through activating the expression level of genes involved in biosynthesis pathway of patchoulol. Moreover, expression pattern analysis also revealed that transcription factors participated in JA regulation of patchoulol biosynthesis were differentially expressed.The current study comprehensively reported the morphological specificity, volatile chemical compositions and transcriptome characterization of the Chinese-cultivated P. cablin cv. ‘Zhanxiang’, these results contribute to our better understanding of the physiological and molecular features of patchouli, especially the molecular mechanism of biosynthesis of patchoulol. Our full-length transcriptome data also provides a valuable genetic resource for further studies in patchouli.


April 21, 2020

Chromosome-level assembly of the water buffalo genome surpasses human and goat genomes in sequence contiguity.

Rapid innovation in sequencing technologies and improvement in assembly algorithms have enabled the creation of highly contiguous mammalian genomes. Here we report a chromosome-level assembly of the water buffalo (Bubalus bubalis) genome using single-molecule sequencing and chromatin conformation capture data. PacBio Sequel reads, with a mean length of 11.5?kb, helped to resolve repetitive elements and generate sequence contiguity. All five B. bubalis sub-metacentric chromosomes were correctly scaffolded with centromeres spanned. Although the index animal was partly inbred, 58% of the genome was haplotype-phased by FALCON-Unzip. This new reference genome improves the contig N50 of the previous short-read based buffalo assembly more than a thousand-fold and contains only 383 gaps. It surpasses the human and goat references in sequence contiguity and facilitates the annotation of hard to assemble gene clusters such as the major histocompatibility complex (MHC).


April 21, 2020

Analysis of Transcriptome and Epitranscriptome in Plants Using PacBio Iso-Seq and Nanopore-Based Direct RNA Sequencing.

Nanopore sequencing from Oxford Nanopore Technologies (ONT) and Pacific BioSciences (PacBio) single-molecule real-time (SMRT) long-read isoform sequencing (Iso-Seq) are revolutionizing the way transcriptomes are analyzed. These methods offer many advantages over most widely used high-throughput short-read RNA sequencing (RNA-Seq) approaches and allow a comprehensive analysis of transcriptomes in identifying full-length splice isoforms and several other post-transcriptional events. In addition, direct RNA-Seq provides valuable information about RNA modifications, which are lost during the PCR amplification step in other methods. Here, we present a comprehensive summary of important applications of these technologies in plants, including identification of complex alternative splicing (AS), full-length splice variants, fusion transcripts, and alternative polyadenylation (APA) events. Furthermore, we discuss the impact of the newly developed nanopore direct RNA-Seq in advancing epitranscriptome research in plants. Additionally, we summarize computational tools for identifying and quantifying full-length isoforms and other co/post-transcriptional events and discussed some of the limitations with these methods. Sequencing of transcriptomes using these new single-molecule long-read methods will unravel many aspects of transcriptome complexity in unprecedented ways as compared to previous short-read sequencing approaches. Analysis of plant transcriptomes with these new powerful methods that require minimum sample processing is likely to become the norm and is expected to uncover novel co/post-transcriptional gene regulatory mechanisms that control biological outcomes during plant development and in response to various stresses.


April 21, 2020

Genome assembly of Nannochloropsis oceanica provides evidence of host nucleus overthrow by the symbiont nucleus during speciation.

The species of the genus Nannochloropsis are unique in their maintenance of a nucleus-plastid continuum throughout their cell cycle, non-motility and asexual reproduction. These characteristics should have been endorsed in their gene assemblages (genomes). Here we show that N. oceanica has a genome of 29.3?Mb consisting of 32 pseudochromosomes and containing 7,330 protein-coding genes; and the host nucleus may have been overthrown by an ancient red alga symbiont nucleus during speciation through secondary endosymbiosis. In addition, N. oceanica has lost its flagella and abilities to undergo meiosis and sexual reproduction, and adopted a genome reduction strategy during speciation. We propose that N. oceanica emerged through the active fusion of a host protist and a photosynthesizing ancient red alga and the symbiont nucleus became dominant over the host nucleus while the chloroplast was wrapped by two layers of endoplasmic reticulum. Our findings evidenced an alternative speciation pathway of eukaryotes.


April 21, 2020

Transcriptome analysis based on a combination of sequencing platforms provides insights into leaf pigmentation in Acer rubrum.

Red maple (Acer rubrum L.) is one of the most common and widespread trees with colorful leaves. We found a mutant with red, yellow, and green leaf phenotypes in different branches, which provided ideal materials with the same genetic relationship, and little interference from the environment, for the study of complex metabolic networks that underly variations in the coloration of leaves. We applied a combination of NGS and SMRT sequencing to various red maple tissues.A total of 125,448 unigenes were obtained, of which 46 and 69 were thought to be related to the synthesis of anthocyanins and carotenoids, respectively. In addition, 88 unigenes were presumed to be involved in the chlorophyll metabolic pathway. Based on a comprehensive analysis of the pigment gene expression network, the mechanisms of leaf color were investigated. The massive accumulation of Cy led to its higher content and proportion than other pigments, which caused the redness of leaves. Yellow coloration was the result of the complete decomposition of chlorophyll pigments, the unmasking of carotenoid pigments, and a slight accumulation of Cy.This study provides a systematic analysis of color variations in the red maple. Moreover, mass sequence data obtained by deep sequencing will provide references for the controlled breeding of red maple.


April 21, 2020

Long-read sequencing reveals a 4.4 kb tandem repeat region in the mitogenome of Echinococcus granulosus (sensu stricto) genotype G1.

Echinococcus tapeworms cause a severe helminthic zoonosis called echinococcosis. The genus comprises various species and genotypes, of which E. granulosus (sensu stricto) represents a significant global public health and socioeconomic burden. Mitochondrial (mt) genomes have provided useful genetic markers to explore the nature and extent of genetic diversity within Echinococcus and have underpinned phylogenetic and population structure analyses of this genus. Our recent work indicated a sequence gap (>?1 kb) in the mt genomes of E. granulosus genotype G1, which could not be determined by PCR-based Sanger sequencing. The aim of the present study was to define the complete mt genome, irrespective of structural complexities, using a long-read sequencing method.We extracted high molecular weight genomic DNA from protoscoleces from a single cyst of E. granulosus genotype G1 from a sheep from Australia using a conventional method and sequenced it using PacBio Sequel (long-read) technology, complemented by BGISEQ-500 short-read sequencing. Sequence data obtained were assembled using a recently-developed workflow.We assembled a complete mt genome sequence of 17,675 bp, which is >?4 kb larger than the complete mt genomes known for E. granulosus genotype G1. This assembly includes a previously-elusive tandem repeat region, which is 4417 bp long and consists of ten near-identical 441-445 bp repeat units, each harbouring a 184 bp non-coding region and adjacent regions. We also identified a short non-coding region of 183 bp, which includes an inverted repeat.We report what we consider to be the first complete mt genome of E. granulosus genotype G1 and characterise all repeat regions in this genome. The numbers, sizes, sequences and functions of tandem repeat regions remain to be studied in different isolates of genotype G1 and in other genotypes and species. The discovery of such ‘new’ repeat elements in the mt genome of genotype G1 by PacBio sequencing raises a question about the completeness of some published genomes of taeniid cestodes assembled from conventional or short-read sequence datasets. This study shows that long-read sequencing readily overcomes the challenges of assembling repeat elements to achieve improved genomes.


April 21, 2020

Long-read based assembly and synteny analysis of a reference Drosophila subobscura genome reveals signatures of structural evolution driven by inversions recombination-suppression effects.

Drosophila subobscura has long been a central model in evolutionary genetics. Presently, its use is hindered by the lack of a reference genome. To bridge this gap, here we used PacBio long-read technology, together with the available wealth of genetic marker information, to assemble and annotate a high-quality nuclear and complete mitochondrial genome for the species. With the obtained assembly, we performed the first synteny analysis of genome structure evolution in the subobscura subgroup.We generated a highly-contiguous ~?129?Mb-long nuclear genome, consisting of six pseudochromosomes corresponding to the six chromosomes of a female haploid set, and a complete 15,764?bp-long mitogenome, and provide an account of their numbers and distributions of codifying and repetitive content. All 12 identified paracentric inversion differences in the subobscura subgroup would have originated by chromosomal breakage and repair, with some associated duplications, but no evidence of direct gene disruptions by the breakpoints. Between lineages, inversion fixation rates were 10 times higher in continental D. subobscura than in the two small oceanic-island endemics D. guanche and D. madeirensis. Within D. subobscura, we found contrasting ratios of chromosomal divergence to polymorphism between the A sex chromosome and the autosomes.We present the first high-quality, long-read sequencing of a D. subobscura genome. Our findings generally support genome structure evolution in this species being driven indirectly, through the inversions’ recombination-suppression effects in maintaining sets of adaptive alleles together in the face of gene flow. The resources developed will serve to further establish the subobscura subgroup as model for comparative genomics and evolutionary indicator of global change.


April 21, 2020

De novo genome sequencing and secretome analysis of Tilletia indica inciting Karnal bunt of wheat provides pathogenesis-related genes.

Tilletia indica is an internationally quarantined fungal pathogen causing Karnal bunt of wheat. The present study carried out that the whole genome of T. indica was sequenced and identified transposable elements, pathogenicity-related genes using a comparative genomics approach. The T. indica genome assembly size of 33.7 MB was generated using Illumina and Pac Bio platforms with GC content of 55.0%. A total of 1737 scaffolds were obtained with N50 of 58,667 bp. The ab initio gene prediction was performed using Ustilago maydis as the reference species. A total number of 10,113 genes were predicted with an average gene size of 1945 bp out of which functionally annotated genes were 7262. A total number of 3216 protein-coding genes were assigned in different categories. Out of a total number of 1877 transposable elements, gypsy had the highest count (573). Total 5772 simple sequence repeats were identified in the genome assembly, and the most abundant simple sequence repeat type was trinucleotide having 42% of total SSRs. The comparative genome analysis suggested 3751 proteins of T. indica had orthologs in five fungi, whereas 126 proteins were unique to T. indica. Secretome analysis revealed the presence of 1014 secretory proteins and few carbohydrate-active enzymes in the genome. Some putative candidate pathogenicity-related genes were identified in the genome. The whole genome of T. indica will provide a window to understand the pathogenesis mechanism, fungal life cycle, survival of teliospores, and novel strategies for management of Karnal bunt disease of wheat.


April 21, 2020

Iso-Seq analysis of the Taxus cuspidata transcriptome reveals the complexity of Taxol biosynthesis.

Taxus cuspidata is well known worldwide for its ability to produce Taxol, one of the top-selling natural anticancer drugs. However, current Taxol production cannot match the increasing needs of the market, and novel strategies should be considered to increase the supply of Taxol. Since the biosynthetic mechanism of Taxol remains largely unknown, elucidating this pathway in detail will be very helpful in exploring alternative methods for Taxol production.Here, we sequenced Taxus cuspidata transcriptomes with next-generation sequencing (NGS) and third-generation sequencing (TGS) platforms. After correction with Illumina reads and removal of redundant reads, more than 180,000 nonredundant transcripts were generated from the raw Iso-Seq data. Using Cogent software and an alignment-based method, we identified a total of 139 cytochrome P450s (CYP450s), 31 BAHD acyltransferases (ACTs) and 1940 transcription factors (TFs). Based on phylogenetic and coexpression analysis, we identified 9 CYP450s and 7 BAHD ACTs as potential lead candidates for Taxol biosynthesis and 6 TFs that are possibly involved in the regulation of this process. Using coexpression analysis of genes known to be involved in Taxol biosynthesis, we elucidated the stem biosynthetic pathway. In addition, we analyzed the expression patterns of 12 characterized genes in the Taxol pathway and speculated that the isoprene precursors for Taxol biosynthesis were mainly synthesized via the MEP pathway. In addition, we found and confirmed that the alternative splicing patterns of some genes varied in different tissues, which may be an important tissue-specific method of posttranscriptional regulation.A strategy was developed to generate corrected full-length or nearly full-length transcripts without assembly to ensure sequence accuracy, thus greatly improving the reliability of coexpression and phylogenetic analysis and greatly facilitating gene cloning and characterization. This strategy was successfully utilized to elucidate the Taxol biosynthetic pathway, which will greatly contribute to the goals of improving the Taxol content in Taxus spp. using molecular breeding or plant management strategies and synthesizing Taxol in microorganisms using synthetic biological technology.


April 21, 2020

The sequencing and de novo assembly of the Larimichthys crocea genome using PacBio and Hi-C technologies.

Larimichthys crocea is an endemic marine fish in East Asia that belongs to Sciaenidae in Perciformes. L. crocea has now been recognized as an “iconic” marine fish species in China because not only is it a popular food fish in China, it is a representative victim of overfishing and still provides high value fish products supported by the modern large-scale mariculture industry. Here, we report a chromosome-level reference genome of L. crocea generated by employing the PacBio single molecule sequencing technique (SMRT) and high-throughput chromosome conformation capture (Hi-C) technologies. The genome sequences were assembled into 1,591 contigs with a total length of 723.86?Mb and a contig N50 length of 2.83?Mb. After chromosome-level scaffolding, 24 scaffolds were constructed with a total length of 668.67?Mb (92.48% of the total length). Genome annotation identified 23,657 protein-coding genes and 7262 ncRNAs. This highly accurate, chromosome-level reference genome of L. crocea provides an essential genome resource to support the development of genome-scale selective breeding and restocking strategies of L. crocea.


April 21, 2020

The sequence and de novo assembly of Takifugu bimaculatus genome using PacBio and Hi-C technologies.

Takifugu bimaculatus is a native teleost species of the southeast coast of China where it has been cultivated as an important edible fish in the last decade. Genetic breeding programs, which have been recently initiated for improving the aquaculture performance of T. bimaculatus, urgently require a high-quality reference genome to facilitate genome selection and related genetic studies. To address this need, we produced a chromosome-level reference genome of T. bimaculatus using the PacBio single molecule sequencing technique (SMRT) and High-through chromosome conformation capture (Hi-C) technologies. The genome was assembled into 2,193 contigs with a total length of 404.21?Mb and a contig N50 length of 1.31?Mb. After chromosome-level scaffolding, 22 chromosomes with a total length of 371.68?Mb were constructed. Moreover, a total of 21,117 protein-coding genes and 3,471 ncRNAs were annotated in the reference genome. The highly accurate, chromosome-level reference genome of T. bimaculatus provides an essential genome resource for not only the genome-scale selective breeding of T. bimaculatus but also the exploration of the evolutionary basis of the speciation and local adaptation of the Takifugu genus.


April 21, 2020

Mitochondrial and chloroplast genomes provide insights into the evolutionary origins of quinoa (Chenopodium quinoa Willd.).

Quinoa has recently gained international attention because of its nutritious seeds, prompting the expansion of its cultivation into new areas in which it was not originally selected as a crop. Improving quinoa production in these areas will benefit from the introduction of advantageous traits from free-living relatives that are native to these, or similar, environments. As part of an ongoing effort to characterize the primary and secondary germplasm pools for quinoa, we report the complete mitochondrial and chloroplast genome sequences of quinoa accession PI 614886 and the identification of sequence variants in additional accessions from quinoa and related species. This is the first reported mitochondrial genome assembly in the genus Chenopodium. Inference of phylogenetic relationships among Chenopodium species based on mitochondrial and chloroplast variants supports the hypotheses that 1) the A-genome ancestor was the cytoplasmic donor in the original tetraploidization event, and 2) highland and coastal quinoas were independently domesticated.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.