Menu
April 21, 2020

The red bayberry genome and genetic basis of sex determination.

Morella rubra, red bayberry, is an economically important fruit tree in south China. Here, we assembled the first high-quality genome for both a female and a male individual of red bayberry. The genome size was 313-Mb, and 90% sequences were assembled into eight pseudo chromosome molecules, with 32 493 predicted genes. By whole-genome comparison between the female and male and association analysis with sequences of bulked and individual DNA samples from female and male, a 59-Kb region determining female was identified and located on distal end of pseudochromosome 8, which contains abundant transposable element and seven putative genes, four of them are related to sex floral development. This 59-Kb female-specific region was likely to be derived from duplication and rearrangement of paralogous genes and retained non-recombinant in the female-specific region. Sex-specific molecular markers developed from candidate genes co-segregated with sex in a genetically diverse female and male germplasm. We propose sex determination follow the ZW model of female heterogamety. The genome sequence of red bayberry provides a valuable resource for plant sex chromosome evolution and also provides important insights for molecular biology, genetics and modern breeding in Myricaceae family. © 2018 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.


April 21, 2020

Phased genome sequence of an interspecific hybrid flowering cherry, ‘Somei-Yoshino’ (Cerasus × yedoensis).

We report the phased genome sequence of an interspecific hybrid, the flowering cherry ‘Somei-Yoshino’ (Cerasus × yedoensis). The sequence data were obtained by single-molecule real-time sequencing technology, split into two subsets based on genome information of the two probable ancestors, and assembled to obtain two haplotype phased genome sequences of the interspecific hybrid. The resultant genome assembly consisting of the two haplotype sequences spanned 690.1 Mb with 4,552 contigs and an N50 length of 1.0 Mb. We predicted 95,076 high-confidence genes, including 94.9% of the core eukaryotic genes. Based on a high-density genetic map, we established a pair of eight pseudomolecule sequences, with highly conserved structures between the two haplotype sequences with 2.4 million sequence variants. A whole genome resequencing analysis of flowering cherries suggested that ‘Somei-Yoshino’ might be derived from a cross between C. spachiana and either C. speciosa or its relatives. A time-course transcriptome analysis of floral buds and flowers suggested comprehensive changes in gene expression in floral bud development towards flowering. These genome and transcriptome data are expected to provide insights into the evolution and cultivation of flowering cherry and the molecular mechanism underlying flowering. © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.


April 21, 2020

A 12-kb structural variation in progressive myoclonic epilepsy was newly identified by long-read whole-genome sequencing.

We report a family with progressive myoclonic epilepsy who underwent whole-exome sequencing but was negative for pathogenic variants. Similar clinical courses of a devastating neurodegenerative phenotype of two affected siblings were highly suggestive of a genetic etiology, which indicates that the survey of genetic variation by whole-exome sequencing was not comprehensive. To investigate the presence of a variant that remained unrecognized by standard genetic testing, PacBio long-read sequencing was performed. Structural variant (SV) detection using low-coverage (6×) whole-genome sequencing called 17,165 SVs (7,216 deletions and 9,949 insertions). Our SV selection narrowed down potential candidates to only five SVs (two deletions and three insertions) on the genes tagged with autosomal recessive phenotypes. Among them, a 12.4-kb deletion involving the CLN6 gene was the top candidate because its homozygous abnormalities cause neuronal ceroid lipofuscinosis. This deletion included the initiation codon and was found in a GC-rich region containing multiple repetitive elements. These results indicate the presence of a causal variant in a difficult-to-sequence region and suggest that such variants that remain enigmatic after the application of current whole-exome sequencing technology could be uncovered by unbiased application of long-read whole-genome sequencing.


April 21, 2020

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

To better understand the immune system of shrimp, this study combined PacBio isoform sequencing (Iso-Seq) and Illumina paired-end short reads sequencing methods to discover full-length immune-related molecules of the Pacific white shrimp, Litopenaeus vannamei. A total of 72,648 nonredundant full-length transcripts (unigenes) were generated with an average length of 2545 bp from five main tissues, including the hepatopancreas, cardiac stomach, heart, muscle, and pyloric stomach. These unigenes exhibited a high annotation rate (62,164, 85.57%) when compared against NR, NT, Swiss-Prot, Pfam, GO, KEGG and COG databases. A total of 7544 putative long noncoding RNAs (lncRNAs) were detected and 1164 nonredundant full-length transcripts (449 UniTransModels) participated in the alternative splicing (AS) events. Importantly, a total of 5279 nonredundant full-length unigenes were successfully identified, which were involved in the innate immune system, including 9 immune-related processes, 19 immune-related pathways and 10 other immune-related systems. We also found wide transcript variants, which increased the number and function complexity of immune molecules; for example, toll-like receptors (TLRs) and interferon regulatory factors (IRFs). The 480 differentially expressed genes (DEGs) were significantly higher or tissue-specific expression patterns in the hepatopancreas compared with that in other four tested tissues (FDR <0.05). Furthermore, the expression levels of six selected immune-related DEGs and putative IRFs were validated using real-time PCR technology, substantiating the reliability of the PacBio Iso-seq results. In conclusion, our results provide new genetic resources of long-read full-length transcripts data and information for identifying immune-related genes, which are an invaluable transcriptomic resource as genomic reference, especially for further exploration of the innate immune and defense mechanisms of shrimp. Copyright © 2019 Elsevier Ltd. All rights reserved.


April 21, 2020

Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae.

Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ~80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ~1.45?Gb, spanning ~96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (~80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1-2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.


April 21, 2020

One Aeromonas salmonicida subsp. salmonicida isolate with a pAsa5 variant bearing antibiotic resistance and a pRAS3 variant making a link with a swine pathogen.

The Gram-negative bacterium Aeromonas salmonicida subsp. salmonicida is an aquatic pathogen which causes furunculosis to salmonids, especially in fish farms. The emergence of strains of this bacterium exhibiting antibiotic resistance is increasing, limiting the effectiveness of antibiotherapy as a treatment against this worldwide disease. In the present study, we discovered an isolate of A. salmonicida subsp. salmonicida that harbors two novel plasmids variants carrying antibiotic resistance genes. The use of long-read sequencing (PacBio) allowed us to fully characterize those variants, named pAsa5-3432 and pRAS3-3432, which both differ from their classic counterpart through their content in mobile genetic elements. The plasmid pAsa5-3432 carries a new multidrug region composed of multiple mobile genetic elements, including a Class 1 integron similar to an integrated element of Salmonella enterica. With this new region, probably acquired through plasmid recombination, pAsa5-3432 is the first reported plasmid of this bacterium that bears both an essential virulence factor (the type three secretion system) and multiple antibiotic resistance genes. As for pRAS3-3432, compared to the classic pRAS3, it carries a new mobile element that has only been identified in Chlamydia suis. Hence, with the identification of those two novel plasmids harboring mobile genetic elements that are normally encountered in other bacterial species, the present study puts emphasis on the important impact of mobile genetic elements in the genomic plasticity of A. salmonicida subsp. salmonicida and suggests that this aquatic bacterium could be an important reservoir of antibiotic resistance genes that can be exchanged with other bacteria, including human and animal pathogens. Copyright © 2019 Elsevier B.V. All rights reserved.


April 21, 2020

Function and Distribution of a Lantipeptide in Strawberry Fusarium Wilt Disease-Suppressive Soils.

Streptomyces griseus S4-7 is representative of strains responsible for the specific soil suppressiveness of Fusarium wilt of strawberry caused by Fusarium oxysporum f. sp. fragariae. Members of the genus Streptomyces secrete diverse secondary metabolites including lantipeptides, heat-stable lanthionine-containing compounds that can exhibit antibiotic activity. In this study, a class II lantipeptide provisionally named grisin, of previously unknown biological function, was shown to inhibit F. oxysporum. The inhibitory activity of grisin distinguishes it from other class II lantipeptides from Streptomyces spp. Results of quantitative reverse transcription-polymerase chain reaction with lanM-specific primers showed that the density of grisin-producing Streptomyces spp. in the rhizosphere of strawberry was positively correlated with the number of years of monoculture and a minimum of seven years was required for development of specific soil suppressiveness to Fusarium wilt disease. We suggest that lanM can be used as a diagnostic marker of whether a soil is conducive or suppressive to the disease.


April 21, 2020

Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Comprehensive molecular characterization of myriad somatic alterations and aberrant gene expressions at personal level is key to precision cancer therapy, yet limited by current short-read sequencing technology, individualized catalog of complete genomic and transcriptomic features is thus far elusive. Here, we integrated second- and third-generation sequencing platforms to generate a multidimensional dataset on a patient affected by metastatic epithelial ovarian cancer. Whole-genome and hybrid transcriptome dissection captured global genetic and transcriptional variants at previously unparalleled resolution. Particularly, single-molecule mRNA sequencing identified a vast array of unannotated transcripts, novel long noncoding RNAs and gene chimeras, permitting accurate determination of transcription start, splice, polyadenylation and fusion sites. Phylogenetic and enrichment inference of isoform-level measurements implicated early functional divergence and cytosolic proteostatic stress in shaping ovarian tumorigenesis. A complementary imaging-based high-throughput drug screen was performed and subsequently validated, which consistently pinpointed proteasome inhibitors as an effective therapeutic regime by inducing protein aggregates in ovarian cancer cells. Therefore, our study suggests that clinical application of the emerging long-read full-length analysis for improving molecular diagnostics is feasible and informative. An in-depth understanding of the tumor transcriptome complexity allowed by leveraging the hybrid sequencing approach lays the basis to reveal novel and valid therapeutic vulnerabilities in advanced ovarian malignancies.


April 21, 2020

Symbiotic organs shaped by distinct modes of genome evolution in cephalopods.

Microbes have been critical drivers of evolutionary innovation in animals. To understand the processes that influence the origin of specialized symbiotic organs, we report the sequencing and analysis of the genome of Euprymna scolopes, a model cephalopod with richly characterized host-microbe interactions. We identified large-scale genomic reorganization shared between E. scolopes and Octopus bimaculoides and posit that this reorganization has contributed to the evolution of cephalopod complexity. To reveal genomic signatures of host-symbiont interactions, we focused on two specialized organs of E. scolopes: the light organ, which harbors a monoculture of Vibrio fischeri, and the accessory nidamental gland (ANG), a reproductive organ containing a bacterial consortium. Our findings suggest that the two symbiotic organs within E. scolopes originated by different evolutionary mechanisms. Transcripts expressed in these microbe-associated tissues displayed their own unique signatures in both coding sequences and the surrounding regulatory regions. Compared with other tissues, the light organ showed an abundance of genes associated with immunity and mediating light, whereas the ANG was enriched in orphan genes known only from E. scolopes Together, these analyses provide evidence for different patterns of genomic evolution of symbiotic organs within a single host. Copyright © 2019 the Author(s). Published by PNAS.


April 21, 2020

PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.

In eukaryotes, alternative splicing (AS) greatly expands the diversity of transcripts. However, it is challenging to accurately determine full-length splicing isoforms. Recently, more studies have taken advantage of Pacific Bioscience (PacBio) long-read sequencing to identify full-length transcripts. Nevertheless, the high error rate of PacBio reads seriously offsets the advantages of long reads, especially for accurately identifying splicing junctions. To best capitalize on the features of long reads, we used Illumina RNA-seq reads to improve PacBio circular consensus sequence (CCS) quality and to validate splicing patterns in the rice transcriptome. We evaluated the impact of CCS accuracy on the number and the validation rate of splicing isoforms, and integrated a comprehensive pipeline of splicing transcripts analysis by Iso-Seq and RNA-seq (STAIR) to identify the full-length multi-exon isoforms in rice seedling transcriptome (Oryza sativa L. ssp. japonica). STAIR discovered 11 733 full-length multi-exon isoforms, 6599 more than the SMRT Portal RS_IsoSeq pipeline did. Of these splicing isoforms identified, 4453 (37.9%) were missed in assembled transcripts from RNA-seq reads, and 5204 (44.4%), including 268 multi-exon long non-coding RNAs (lncRNAs), were not reported in the MSU_osa1r7 annotation. Some randomly selected unreported splicing junctions were verified by polymerase chain reaction (PCR) amplification. In addition, we investigated alternative polyadenylation (APA) events in transcripts and identified 829 major polyadenylation [poly(A)] site clusters (PACs). The analysis of splicing isoforms and APA events will facilitate the annotation of the rice genome and studies on the expression and polyadenylation of AS genes in different developmental stages or growth conditions of rice. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.


April 21, 2020

MSC: a metagenomic sequence classification algorithm.

Metagenomics is the study of genetic materials directly sampled from natural habitats. It has the potential to reveal previously hidden diversity of microscopic life largely due to the existence of highly parallel and low-cost next-generation sequencing technology. Conventional approaches align metagenomic reads onto known reference genomes to identify microbes in the sample. Since such a collection of reference genomes is very large, the approach often needs high-end computing machines with large memory which is not often available to researchers. Alternative approaches follow an alignment-free methodology where the presence of a microbe is predicted using the information about the unique k-mers present in the microbial genomes. However, such approaches suffer from high false positives due to trading off the value of k with the computational resources. In this article, we propose a highly efficient metagenomic sequence classification (MSC) algorithm that is a hybrid of both approaches. Instead of aligning reads to the full genomes, MSC aligns reads onto a set of carefully chosen, shorter and highly discriminating model sequences built from the unique k-mers of each of the reference sequences.Microbiome researchers are generally interested in two objectives of a taxonomic classifier: (i) to detect prevalence, i.e. the taxa present in a sample, and (ii) to estimate their relative abundances. MSC is primarily designed to detect prevalence and experimental results show that MSC is indeed a more effective and efficient algorithm compared to the other state-of-the-art algorithms in terms of accuracy, memory and runtime. Moreover, MSC outputs an approximate estimate of the abundances.The implementations are freely available for non-commercial purposes. They can be downloaded from https://drive.google.com/open?id=1XirkAamkQ3ltWvI1W1igYQFusp9DHtVl. © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.


April 21, 2020

Comparative analysis of KPC-2-encoding chimera plasmids with multi-replicon IncR:IncpA1763-KPC:IncN1 or IncFIIpHN7A8:IncpA1763-KPC:IncN1.

IncR, IncFII, IncpA1763-KPC, and IncN1 plasmids have been increasingly found among Enterobacteriaceae species, but plasmids with hybrid structures derived from the above-mentioned incompatibility groups have not yet been described.Plasmids p721005-KPC, p504051-KPC, and pA3295-KPC were fully sequenced and compared with previously sequenced related plasmids pHN84KPC (IncR), pKPHS2 (IncFIIK), pKOX_NDM1 (IncFIIY), pHN7A8 (IncFIIpHN7A8), and R46 (IncN1).The backbone of p721005-KPC/p504051-KPC was a hybrid of the entire 10-kb IncR-type backbone from pHN84KPC, the entire 64.3-kb IncFIIK-type maintenance, and conjugal transfer regions from pKPHS2, a 15.5-kb IncFIIY-type maintenance region from pKOX_NDM1 and a 5.6-kb IncpA1763-KPC-type backbone region from pA1763-KPC, and it contained a primary IncR replicon and two auxiliary IncpA1763-KPC and IncN1 replicons. The backbone of pA3295-KPC was a hybrid of a 7.2-kb IncFIIpHN7A8-type backbone region from pHN7A8, the almost entire 33.3-kb IncN1-type maintenance and conjugal transfer regions highly similar to R46, a 26.2-kb IncFIIK-type maintenance regions from pKPHS2, the above 15.5-kb IncFIIY-type maintenance region, and the above 5.6-kb IncpA1763-KPC-type backbone region, and it contained a primary Inc-FIIpHN7A8 replicon and two auxiliary IncpA1763-KPC and IncN1 replicons. Each of p721005-KPC, p504051-KPC, and pA3295-KPC acquired a wealth of accessory modules, carrying a range of intact and residue mobile elements (such as insertion sequences, unit transposons, and integrons) and resistance markers (such as blaKPC, tetA, dfrA, and qnr).In each of p721005-KPC, p504051-KPC, and pA3295-KPC, multiple replicons in coordination with maintenance and conjugation regions of various origins would maintain a broad host range and a stable replication at a steady-state plasmid copy number.


April 21, 2020

Complete mitochondrial genome of Hemiptelea davidii (Ulmaceae) and phylogenetic analysis

Hemiptelea davidii (Hance) Planch is a potential valuable forest tree in arid sandy environments. Here, the complete mitochondrial genome of H. davidii was assembled using a combination of the PacBio Sequel data and the Illumina Hiseq data. The mitochondrial genome is 460,941bp in length, including 37 protein-coding genes, 19 tRNA genes, and three rRNA genes. The GC content of the whole mito- chondrial genome is 44.84%. Phylogenetic analyses indicated that H. davidii is close with Cannabis and Morus species.


April 21, 2020

A systematic review of the Trypanosoma cruzi genetic heterogeneity, host immune response and genetic factors as plausible drivers of chronic chagasic cardiomyopathy.

Chagas disease is a complex tropical pathology caused by the kinetoplastid Trypanosoma cruzi. This parasite displays massive genetic diversity and has been classified by international consensus in at least six Discrete Typing Units (DTUs) that are broadly distributed in the American continent. The main clinical manifestation of the disease is the chronic chagasic cardiomyopathy (CCC) that is lethal in the infected individuals. However, one intriguing feature is that only 30-40% of the infected individuals will develop CCC. Some authors have suggested that the immune response, host genetic factors, virulence factors and even the massive genetic heterogeneity of T. cruzi are responsible of this clinical pattern. To date, no conclusive data support the reason why a few percentages of the infected individuals will develop CCC. Therefore, we decided to conduct a systematic review analysing the host genetic factors, immune response, cytokine production, virulence factors and the plausible association of the parasite DTUs and CCC. The epidemiological and clinical implications are herein discussed.


April 21, 2020

High Quality Draft Genome of Arogyapacha (Trichopus zeylanicus), an Important Medicinal Plant Endemic to Western Ghats of India.

Arogyapacha, the local name of Trichopus zeylanicus, is a rare, indigenous medicinal plant of India. This plant is famous for its traditional use as an instant energy stimulant. So far, no genomic resource is available for this important plant and hence its metabolic pathways are poorly understood. Here, we report on a high-quality draft assembly of approximately 713.4 Mb genome of T. zeylanicus, first draft genome from the genus Trichopus The assembly was generated in a hybrid approach using Illumina short-reads and Pacbio longer-reads. The total assembly comprised of 22601 scaffolds with an N50 value of 433.3 Kb. We predicted 34452 protein coding genes in T. zeylanicus genome and found that a significant portion of these predicted genes were associated with various secondary metabolite biosynthetic pathways. Comparative genome analysis revealed extensive gene collinearity between T. zeylanicus and its closely related plant species. The present genome and annotation data provide an essential resource to speed-up the research on secondary metabolism, breeding and molecular evolution of T. zeylanicus. Copyright © 2019 Chellappan et al.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.