Long-read Archives - Page 51 of 66

April 21, 2020

The CF Canada-Sick Kids Program in individual CF therapy: A resource for the advancement of personalized medicine in CF.

Therapies targeting certain CFTR mutants have been approved, yet variations in clinical response highlight the need for in-vitro and genetic tools that predict patient-specific clinical outcomes. Toward this goal, the CF Canada-Sick Kids Program in Individual CF Therapy (CFIT) is generating a “first of its kind”, comprehensive resource containing patient-specific cell cultures and data from 100 CF individuals that will enable modeling of therapeutic responses.The CFIT program is generating: 1) nasal cells from drug naïve patients suitable for culture and the study of drug responses in vitro, 2) matched gene expression data obtained by sequencing the RNA from the primary nasal tissue, 3) whole genome sequencing of blood derived DNA from each of the 100 participants, 4) induced pluripotent stem cells (iPSCs) generated from each participant’s blood sample, 5) CRISPR-edited isogenic control iPSC lines and 6) prospective clinical data from patients treated with CF modulators.To date, we have recruited 57 of 100 individuals to CFIT, most of whom are homozygous for F508del (to assess in-vitro: in-vivo correlations with respect to ORKAMBI response) or heterozygous for F508del and a minimal function mutation. In addition, several donors are homozygous for rare nonsense and missense mutations. Nasal epithelial cell cultures and matched iPSC lines are available for many of these donors.This accessible resource will enable development of tools that predict individual outcomes to current and emerging modulators targeting F508del-CFTR and facilitate therapy discovery for rare CF causing mutations.Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

April 21, 2020

Characterization and analysis of the transcriptome in Gymnocypris selincuoensis on the Qinghai-Tibetan Plateau using single-molecule long-read sequencing and RNA-seq.

The lakes on the Qinghai-Tibet Plateau (QTP) are the largest and highest lake group in the world. Gymnocypris selincuoensis is the only cyprinid fish living in lake Selincuo, the largest lake on QTP. However, its genetic resource is still blank, limiting studies on molecular and genetic analysis. In this study, the transcriptome of G. selincuoensis was first generated by using PacBio Iso-Seq and Illumina RNA-seq. A full-length (FL) transcriptome with 75,435 transcripts was obtained by Iso-Seq with N50 length of 3,870 bp. Among all transcripts, 75,016 were annotated to public databases, 64,710 contain complete open reading frames and 2,811 were long non-coding RNAs. Based on all- vs.-all BLAST, 2,069 alternative splicing events were detected, and 80% of them were validated by reverse transcription polymerase chain reaction (RT-PCR). Tissue gene expression atlas showed that the number of detected expressed transcripts ranged from 37,397 in brain to 19,914 in muscle, with 10,488 transcripts detected in all seven tissues. Comparative genomic analysis with other cyprinid fishes identified 77 orthologous genes with potential positive selection (Ka/Ks > 0.3). A total of 56,696 perfect simple sequence repeats were identified from FL transcripts. Our results provide valuable genetic resources for further studies on adaptive evolution, gene expression and population genetics in G. selincuoensis and other congeneric fishes. © The Author(s) 2019. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

April 21, 2020

rMETL: sensitive mobile element insertion detection with long read realignment.

Mobile element insertion (MEI) is a major category of structure variations (SVs). The rapid development of long read sequencing technologies provides the opportunity to detect MEIs sensitively. However, the signals of MEI implied by noisy long reads are highly complex due to the repetitiveness of mobile elements as well as the high sequencing error rates. Herein, we propose the Realignment-based Mobile Element insertion detection Tool for Long read (rMETL). Benchmarking results of simulated and real datasets demonstrate that rMETL enables to handle the complex signals to discover MEIs sensitively. It is suited to produce high-quality MEI callsets in many genomics studies.rMETL is available from https://github.com/hitbc/rMETL.Supplementary data are available at Bioinformatics online. © The Author(s) 2019. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

April 21, 2020

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

To better understand the immune system of shrimp, this study combined PacBio isoform sequencing (Iso-Seq) and Illumina paired-end short reads sequencing methods to discover full-length immune-related molecules of the Pacific white shrimp, Litopenaeus vannamei. A total of 72,648 nonredundant full-length transcripts (unigenes) were generated with an average length of 2545 bp from five main tissues, including the hepatopancreas, cardiac stomach, heart, muscle, and pyloric stomach. These unigenes exhibited a high annotation rate (62,164, 85.57%) when compared against NR, NT, Swiss-Prot, Pfam, GO, KEGG and COG databases. A total of 7544 putative long noncoding RNAs (lncRNAs) were detected and 1164 nonredundant full-length transcripts (449 UniTransModels) participated in the alternative splicing (AS) events. Importantly, a total of 5279 nonredundant full-length unigenes were successfully identified, which were involved in the innate immune system, including 9 immune-related processes, 19 immune-related pathways and 10 other immune-related systems. We also found wide transcript variants, which increased the number and function complexity of immune molecules; for example, toll-like receptors (TLRs) and interferon regulatory factors (IRFs). The 480 differentially expressed genes (DEGs) were significantly higher or tissue-specific expression patterns in the hepatopancreas compared with that in other four tested tissues (FDR <0.05). Furthermore, the expression levels of six selected immune-related DEGs and putative IRFs were validated using real-time PCR technology, substantiating the reliability of the PacBio Iso-seq results. In conclusion, our results provide new genetic resources of long-read full-length transcripts data and information for identifying immune-related genes, which are an invaluable transcriptomic resource as genomic reference, especially for further exploration of the innate immune and defense mechanisms of shrimp. Copyright © 2019 Elsevier Ltd. All rights reserved.

April 21, 2020

TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts.

Long-read, single-molecule sequencing platforms hold great potential for isoform discovery and characterization of multi-exon transcripts. However, their high error rates are an obstacle to distinguishing novel transcript isoforms from sequencing artifacts. Therefore, we developed the package TranscriptClean to correct mismatches, microindels and noncanonical splice junctions in mapped transcripts using the reference genome while preserving known variants.Our method corrects nearly all mismatches and indels present in a publically available human PacBio Iso-seq dataset, and rescues 39% of noncanonical splice junctions.All Python and R scripts used in this paper are available at https://github.com/dewyman/TranscriptClean.

April 21, 2020

Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae.

Within the economically important plant family Solanaceae, Jaltomata is a rapidly evolving genus that has extensive diversity in flower size and shape, as well as fruit and nectar color, among its ~80 species. Here, we report the whole-genome sequencing, assembly, and annotation, of one representative species (Jaltomata sinuosa) from this genus. Combining PacBio long reads (25×) and Illumina short reads (148×) achieved an assembly of ~1.45?Gb, spanning ~96% of the estimated genome. Ninety-six percent of curated single-copy orthologs in plants were detected in the assembly, supporting a high level of completeness of the genome. Similar to other Solanaceous species, repetitive elements made up a large fraction (~80%) of the genome, with the most recently active element, Gypsy, expanding across the genome in the last 1-2 Myr. Computational gene prediction, in conjunction with a merged transcriptome data set from 11 tissues, identified 34,725 protein-coding genes. Comparative phylogenetic analyses with six other sequenced Solanaceae species determined that Jaltomata is most likely sister to Solanum, although a large fraction of gene trees supported a conflicting bipartition consistent with substantial introgression between Jaltomata and Capsicum after these species split. We also identified gene family dynamics specific to Jaltomata, including expansion of gene families potentially involved in novel reproductive trait development, and loss of gene families that accompanied the loss of self-incompatibility. This high-quality genome will facilitate studies of phenotypic diversification in this rapidly radiating group and provide a new point of comparison for broader analyses of genomic evolution across the Solanaceae.

April 21, 2020

Heterochromatin-enriched assemblies reveal the sequence and organization of the Drosophila melanogaster Y chromosome.

Heterochromatic regions of the genome are repeat-rich and poor in protein coding genes, and are therefore underrepresented in even the best genome assemblies. One of the most difficult regions of the genome to assemble are sex-limited chromosomes. The Drosophila melanogaster Y chromosome is entirely heterochromatic, yet has wide-ranging effects on male fertility, fitness, and genome-wide gene expression. The genetic basis of this phenotypic variation is difficult to study, in part because we do not know the detailed organization of the Y chromosome. To study Y chromosome organization in D. melanogaster, we develop an assembly strategy involving the in silico enrichment of heterochromatic long single-molecule reads and use these reads to create targeted de novo assemblies of heterochromatic sequences. We assigned contigs to the Y chromosome using Illumina reads to identify male-specific sequences. Our pipeline extends the D. melanogaster reference genome by 11.9 Mb, closes 43.8% of the gaps, and improves overall contiguity. The addition of 10.6 MB of Y-linked sequence permitted us to study the organization of repeats and genes along the Y chromosome. We detected a high rate of duplication to the pericentric regions of the Y chromosome from other regions in the genome. Most of these duplicated genes exist in multiple copies. We detail the evolutionary history of one sex-linked gene family, crystal-Stellate While the Y chromosome does not undergo crossing over, we observed high gene conversion rates within and between members of the crystal-Stellate gene family, Su(Ste), and PCKR, compared to genome-wide estimates. Our results suggest that gene conversion and gene duplication play an important role in the evolution of Y-linked genes. Copyright © 2019 Chang and Larracuente.

April 21, 2020

Two large reciprocal translocations characterized in the disease resistance-rich burmannica genetic group of Musa acuminata.

Banana cultivars are derived from hybridizations involving Musa acuminata subspecies. The latter diverged following geographical isolation in distinct South-east Asian continental regions and islands. Observation of chromosome pairing irregularities in meiosis of hybrids between these subspecies suggested the presence of large chromosomal structural variations. The aim of this study was to characterize such rearrangements.Marker (single nucleotide polymorphism) segregation in a self-progeny of the ‘Calcutta 4’ accession and mate-pair sequencing were used to search for chromosomal rearrangements in comparison with the M. acuminata ssp. malaccensis genome reference sequence. Signature segment junctions of the revealed chromosome structures were identified and searched in whole-genome sequencing data from 123 wild and cultivated Musa accessions.Two large reciprocal translocations were characterized in the seedy banana M. acuminata ssp. burmannicoides ‘Calcutta 4’ accession. One consisted of an exchange of a 240 kb distal region of chromosome 2 with a 7.2 Mb distal region of chromosome 8. The other involved an exchange of a 20.8 Mb distal region of chromosome 1 with a 11.6 Mb distal region of chromosome 9. Both translocations were found only in wild accessions belonging to the burmannicoides/burmannica/siamea subspecies. Only two of the 87 cultivars analysed displayed the 2/8 translocation, while none displayed the 1/9 translocation.Two large reciprocal translocations were identified that probably originated in the burmannica genetic group. Accurate characterization of these translocations should enhance the use of this disease resistance-rich burmannica group in breeding programmes. © The Author(s) 2019. Published by Oxford University Press on behalf of the Annals of Botany Company.

April 21, 2020

Genomic investigation of Staphylococcus aureus recovered from Gambian women and newborns following an oral dose of intra-partum azithromycin.

Oral azithromycin given during labour reduces carriage of bacteria responsible for neonatal sepsis, including Staphylococcus aureus. However, there is concern that this may promote drug resistance.Here, we combine genomic and epidemiological data on S. aureus isolated from mothers and babies in a randomized intra-partum azithromycin trial (PregnAnZI) to describe bacterial population dynamics and resistance mechanisms.Participants from both arms of the trial, who carried S. aureus in day 3 and day 28 samples post-intervention, were included. Sixty-six S. aureus isolates (from 7 mothers and 10 babies) underwent comparative genome analyses and the data were then combined with epidemiological data. Trial registration (main trial): ClinicalTrials.gov Identifier NCT01800942.Seven S. aureus STs were identified, with ST5 dominant (n?=?40, 61.0%), followed by ST15 (n?=?11, 17.0%). ST5 predominated in the placebo arm (73.0% versus 49.0%, P?=?0.039) and ST15 in the azithromycin arm (27.0% versus 6.0%, P?=?0.022). In azithromycin-resistant isolates, msr(A) was the main macrolide resistance gene (n?=?36, 80%). Ten study participants, from both trial arms, acquired azithromycin-resistant S. aureus after initially harbouring a susceptible isolate. In nine (90%) of these cases, the acquired clone was an msr(A)-containing ST5 S. aureus. Long-read sequencing demonstrated that in ST5, msr(A) was found on an MDR plasmid.Our data reveal in this Gambian population the presence of a dominant clone of S. aureus harbouring plasmid-encoded azithromycin resistance, which was acquired by participants in both arms of the study. Understanding these resistance dynamics is crucial to defining the public health drug resistance impacts of azithromycin prophylaxis given during labour in Africa. © The Author(s) 2019. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy.

April 21, 2020

One Aeromonas salmonicida subsp. salmonicida isolate with a pAsa5 variant bearing antibiotic resistance and a pRAS3 variant making a link with a swine pathogen.

The Gram-negative bacterium Aeromonas salmonicida subsp. salmonicida is an aquatic pathogen which causes furunculosis to salmonids, especially in fish farms. The emergence of strains of this bacterium exhibiting antibiotic resistance is increasing, limiting the effectiveness of antibiotherapy as a treatment against this worldwide disease. In the present study, we discovered an isolate of A. salmonicida subsp. salmonicida that harbors two novel plasmids variants carrying antibiotic resistance genes. The use of long-read sequencing (PacBio) allowed us to fully characterize those variants, named pAsa5-3432 and pRAS3-3432, which both differ from their classic counterpart through their content in mobile genetic elements. The plasmid pAsa5-3432 carries a new multidrug region composed of multiple mobile genetic elements, including a Class 1 integron similar to an integrated element of Salmonella enterica. With this new region, probably acquired through plasmid recombination, pAsa5-3432 is the first reported plasmid of this bacterium that bears both an essential virulence factor (the type three secretion system) and multiple antibiotic resistance genes. As for pRAS3-3432, compared to the classic pRAS3, it carries a new mobile element that has only been identified in Chlamydia suis. Hence, with the identification of those two novel plasmids harboring mobile genetic elements that are normally encountered in other bacterial species, the present study puts emphasis on the important impact of mobile genetic elements in the genomic plasticity of A. salmonicida subsp. salmonicida and suggests that this aquatic bacterium could be an important reservoir of antibiotic resistance genes that can be exchanged with other bacteria, including human and animal pathogens. Copyright © 2019 Elsevier B.V. All rights reserved.

April 21, 2020

Function and Distribution of a Lantipeptide in Strawberry Fusarium Wilt Disease-Suppressive Soils.

Streptomyces griseus S4-7 is representative of strains responsible for the specific soil suppressiveness of Fusarium wilt of strawberry caused by Fusarium oxysporum f. sp. fragariae. Members of the genus Streptomyces secrete diverse secondary metabolites including lantipeptides, heat-stable lanthionine-containing compounds that can exhibit antibiotic activity. In this study, a class II lantipeptide provisionally named grisin, of previously unknown biological function, was shown to inhibit F. oxysporum. The inhibitory activity of grisin distinguishes it from other class II lantipeptides from Streptomyces spp. Results of quantitative reverse transcription-polymerase chain reaction with lanM-specific primers showed that the density of grisin-producing Streptomyces spp. in the rhizosphere of strawberry was positively correlated with the number of years of monoculture and a minimum of seven years was required for development of specific soil suppressiveness to Fusarium wilt disease. We suggest that lanM can be used as a diagnostic marker of whether a soil is conducive or suppressive to the disease.

April 21, 2020

Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Comprehensive molecular characterization of myriad somatic alterations and aberrant gene expressions at personal level is key to precision cancer therapy, yet limited by current short-read sequencing technology, individualized catalog of complete genomic and transcriptomic features is thus far elusive. Here, we integrated second- and third-generation sequencing platforms to generate a multidimensional dataset on a patient affected by metastatic epithelial ovarian cancer. Whole-genome and hybrid transcriptome dissection captured global genetic and transcriptional variants at previously unparalleled resolution. Particularly, single-molecule mRNA sequencing identified a vast array of unannotated transcripts, novel long noncoding RNAs and gene chimeras, permitting accurate determination of transcription start, splice, polyadenylation and fusion sites. Phylogenetic and enrichment inference of isoform-level measurements implicated early functional divergence and cytosolic proteostatic stress in shaping ovarian tumorigenesis. A complementary imaging-based high-throughput drug screen was performed and subsequently validated, which consistently pinpointed proteasome inhibitors as an effective therapeutic regime by inducing protein aggregates in ovarian cancer cells. Therefore, our study suggests that clinical application of the emerging long-read full-length analysis for improving molecular diagnostics is feasible and informative. An in-depth understanding of the tumor transcriptome complexity allowed by leveraging the hybrid sequencing approach lays the basis to reveal novel and valid therapeutic vulnerabilities in advanced ovarian malignancies.

April 21, 2020

Penicillium purpurogenum Produces a Set of Endoxylanases: Identification, Heterologous Expression, and Characterization of a Fourth Xylanase, XynD, a Novel Enzyme Belonging to Glycoside Hydrolase Family 10.

The fungus Penicillium purpurogenum grows on a variety of natural carbon sources and secretes a large number of enzymes which degrade the polysaccharides present in lignocellulose. In this work, the gene coding for a novel endoxylanase has been identified in the genome of the fungus. This gene (xynd) possesses four introns. The cDNA has been expressed in Pichia pastoris and characterized. The enzyme, XynD, belongs to family 10 of the glycoside hydrolases. Mature XynD has a calculated molecular weight of 40,997. It consists of 387 amino acid residues with an N-terminal catalytic module, a linker rich in ser and thr residues, and a C-terminal family 1 carbohydrate-binding module. XynD shows the highest identity (97%) to a putative endoxylanase from Penicillium subrubescens but its highest identity to a biochemically characterized xylanase (XYND from Penicillium funiculosum) is only 68%. The enzyme has a temperature optimum of 60 °C, and it is highly stable in its pH optimum range of 6.5-8.5. XynD is the fourth biochemically characterized endoxylanase from P. purpurogenum, confirming the rich potential of this fungus for lignocellulose biodegradation. XynD, due to its wide pH optimum and stability, may be a useful enzyme in biotechnological procedures related to this biodegradation process.

April 21, 2020

PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.

In eukaryotes, alternative splicing (AS) greatly expands the diversity of transcripts. However, it is challenging to accurately determine full-length splicing isoforms. Recently, more studies have taken advantage of Pacific Bioscience (PacBio) long-read sequencing to identify full-length transcripts. Nevertheless, the high error rate of PacBio reads seriously offsets the advantages of long reads, especially for accurately identifying splicing junctions. To best capitalize on the features of long reads, we used Illumina RNA-seq reads to improve PacBio circular consensus sequence (CCS) quality and to validate splicing patterns in the rice transcriptome. We evaluated the impact of CCS accuracy on the number and the validation rate of splicing isoforms, and integrated a comprehensive pipeline of splicing transcripts analysis by Iso-Seq and RNA-seq (STAIR) to identify the full-length multi-exon isoforms in rice seedling transcriptome (Oryza sativa L. ssp. japonica). STAIR discovered 11 733 full-length multi-exon isoforms, 6599 more than the SMRT Portal RS_IsoSeq pipeline did. Of these splicing isoforms identified, 4453 (37.9%) were missed in assembled transcripts from RNA-seq reads, and 5204 (44.4%), including 268 multi-exon long non-coding RNAs (lncRNAs), were not reported in the MSU_osa1r7 annotation. Some randomly selected unreported splicing junctions were verified by polymerase chain reaction (PCR) amplification. In addition, we investigated alternative polyadenylation (APA) events in transcripts and identified 829 major polyadenylation [poly(A)] site clusters (PACs). The analysis of splicing isoforms and APA events will facilitate the annotation of the rice genome and studies on the expression and polyadenylation of AS genes in different developmental stages or growth conditions of rice. © 2018 The Authors The Plant Journal © 2018 John Wiley & Sons Ltd.

April 21, 2020

A prophage and two ICESa2603-family integrative and conjugative elements (ICEs) carrying optrA in Streptococcus suis.

To investigate the presence and transfer of the oxazolidinone/phenicol resistance gene optrA and identify the genetic elements involved in the horizontal transfer of the optrA gene in Streptococcus suis.A total of 237 S. suis isolates were screened for the presence of the optrA gene by PCR. Whole-genome DNA of three optrA-positive strains was completely sequenced using the Illumina MiSeq and Pacbio RSII platforms. MICs were determined by broth microdilution. Transferability of the optrA gene in S. suis was investigated by conjugation. The presence of circular intermediates was examined by inverse PCR.The optrA gene was present in 11.8% (28/237) of the S. suis strains. In three strains, the optrA gene was flanked by two copies of IS1216 elements in the same orientation, located either on a prophage or on ICESa2603-family integrative and conjugative elements (ICEs), including one tandem ICE. In one isolate, the optrA-carrying ICE transferred with a frequency of 2.1?×?10-8. After the transfer, the transconjugant displayed elevated MICs of the respective antimicrobial agents. Inverse PCRs revealed that circular intermediates of different sizes were formed in the three optrA-carrying strains, containing one copy of the IS1216E element and the optrA gene alone or in combination with other resistance genes.A prophage and two ICESa2603-family ICEs (including one tandem ICE) associated with the optrA gene were identified in S. suis. The association of the optrA gene with the IS1216E elements and its location on either a prophage or ICEs will aid its horizontal transfer. © The Author(s) 2019. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Auto Tag: Long-read

Characterization and analysis of the transcriptome in Gymnocypris selincuoensis on the Qinghai-Tibetan Plateau using single-molecule long-read sequencing and RNA-seq.

rMETL: sensitive mobile element insertion detection with long read realignment.

Full-length transcriptome analysis of Litopenaeus vannamei reveals transcript variants involved in the innate immune system.

TranscriptClean: variant-aware correction of indels, mismatches and splice junctions in long-read transcripts.

Genome Sequence of Jaltomata Addresses Rapid Reproductive Trait Evolution and Enhances Comparative Genomics in the Hyper-Diverse Solanaceae.

Heterochromatin-enriched assemblies reveal the sequence and organization of the Drosophila melanogaster Y chromosome.

Two large reciprocal translocations characterized in the disease resistance-rich burmannica genetic group of Musa acuminata.

Genomic investigation of Staphylococcus aureus recovered from Gambian women and newborns following an oral dose of intra-partum azithromycin.

One Aeromonas salmonicida subsp. salmonicida isolate with a pAsa5 variant bearing antibiotic resistance and a pRAS3 variant making a link with a swine pathogen.

Function and Distribution of a Lantipeptide in Strawberry Fusarium Wilt Disease-Suppressive Soils.

Hybrid sequencing-based personal full-length transcriptomic analysis implicates proteostatic stress in metastatic ovarian cancer.

Penicillium purpurogenum Produces a Set of Endoxylanases: Identification, Heterologous Expression, and Characterization of a Fourth Xylanase, XynD, a Novel Enzyme Belonging to Glycoside Hydrolase Family 10.

PacBio full-length cDNA sequencing integrated with RNA-seq reads drastically improves the discovery of splicing transcripts in rice.

A prophage and two ICESa2603-family integrative and conjugative elements (ICEs) carrying optrA in Streptococcus suis.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert