Menu
April 21, 2020

A critical comparison of technologies for a plant genome sequencing project.

A high-quality genome sequence of any model organism is an essential starting point for genetic and other studies. Older clone-based methods are slow and expensive, whereas faster, cheaper short-read-only assemblies can be incomplete and highly fragmented, which minimizes their usefulness. The last few years have seen the introduction of many new technologies for genome assembly. These new technologies and associated new algorithms are typically benchmarked on microbial genomes or, if they scale appropriately, on larger (e.g., human) genomes. However, plant genomes can be much more repetitive and larger than the human genome, and plant biochemistry often makes obtaining high-quality DNA that is free from contaminants difficult. Reflecting their challenging nature, we observe that plant genome assembly statistics are typically poorer than for vertebrates.Here, we compare Illumina short read, Pacific Biosciences long read, 10x Genomics linked reads, Dovetail Hi-C, and BioNano Genomics optical maps, singly and combined, in producing high-quality long-range genome assemblies of the potato species Solanum verrucosum. We benchmark the assemblies for completeness and accuracy, as well as DNA compute requirements and sequencing costs.The field of genome sequencing and assembly is reaching maturity, and the differences we observe between assemblies are surprisingly small. We expect that our results will be helpful to other genome projects, and that these datasets will be used in benchmarking by assembly algorithm developers. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Genetic Diversity of Salmonella Derby from the Poultry Sector in Europe.

Salmonella Derby (S. Derby) is emerging in Europe as a predominant serovar in fattening turkey flocks. This serovar was recorded as being predominant in the turkey sector in 2014 in the United Kingdom (UK). Only two years later, in 2016, it was also recorded in the turkey and broiler sectors in Ireland and Spain. These S. Derby isolates were characterised as members of the multilocus sequence type (MLST) profile 71 (ST71). For the first time, we characterise by whole genome sequencing (WGS) analysis a panel of 90 S. Derby ST71 genomes to understand the routes of transmission of this emerging pathogen within the poultry/turkey food trade. Selected panel included strains isolated as early as 2010 in five leading European g countries for turkey meat production. Twenty-one of the 90 genomes were extracted from a public database-Enterobase. Five of these originated from the United States (n=3), China (n=1) and Taiwan (n=1) isolated between 1986 and 2016. A phylogenomic analysis at the core-genome level revealed the presence of three groups. The largest group contained 97.5% of the European strains and included both, turkey and human isolates that were genetically related by an average of 35 ± 15 single nucleotide polymorphism substitutions (SNPs). To illustrate the diversity, the presence of antimicrobial resistance genes and phages were characteised in 30, S. Derby ST71 genomes, including 11 belonging to this study This study revealed an emergent turkey-related S. Derby ST71 clone circulating in at least five European countries (the UK, Germany, Poland, Italy, and France) since 2010 that causes human gastroenteritis. A matter of concern is the identification of a gyrA mutation involved in resistance to quinolone, present in the Italian genomes. Interestingly, the diversity of phages seems to be related to the geographic origins. These results constitute a baseline for following the spread of this emerging pathogen and identifying appropriate monitoring and prevention measures.


April 21, 2020

Comparative Transcriptomic Profiling of Yersinia enterocolitica O:3 and O:8 Reveals Major Expression Differences of Fitness- and Virulence-Relevant Genes Indicating Ecological Separation.

Yersinia enterocolitica is a zoonotic pathogen and an important cause of bacterial gastrointestinal infections in humans. Large-scale population genomic analyses revealed genetic and phenotypic diversity of this bacterial species, but little is known about the differences in the transcriptome organization, small RNA (sRNA) repertoire, and transcriptional output. Here, we present the first comparative high-resolution transcriptome analysis of Y. enterocolitica strains representing highly pathogenic phylogroup 2 (serotype O:8) and moderately pathogenic phylogroup 3 (serotype O:3) grown under four infection-relevant conditions. Our transcriptome sequencing (RNA-seq) approach revealed 1,299 and 1,076 transcriptional start sites and identified strain-specific sRNAs that could contribute to differential regulation among the phylogroups. Comparative transcriptomics further uncovered major gene expression differences, in particular, in the temperature-responsive regulon. Multiple virulence-relevant genes are differentially regulated between the two strains, supporting an ecological separation of phylogroups with certain niche-adapted properties. Strong upregulation of the ystA enterotoxin gene in combination with constitutive high expression of cell invasion factor InvA further showed that the toxicity of recent outbreak O:3 strains has increased. Overall, our report provides new insights into the specific transcriptome organization of phylogroups 2 and 3 and reveals gene expression differences contributing to the substantial phenotypic differences that exist between the lineages. IMPORTANCE Yersinia enterocolitica is a major diarrheal pathogen and is associated with a large range of gut-associated diseases. Members of this species have evolved into different phylogroups with genotypic variations. We performed the first characterization of the Y. enterocolitica transcriptional landscape and tracked the consequences of the genomic variations between two different pathogenic phylogroups by comparing their RNA repertoire, promoter usage, and expression profiles under four different virulence-relevant conditions. Our analysis revealed major differences in the transcriptional outputs of the closely related strains, pointing to an ecological separation in which one is more adapted to an environmental lifestyle and the other to a mostly mammal-associated lifestyle. Moreover, a variety of pathoadaptive alterations, including alterations in acid resistance genes, colonization factors, and toxins, were identified which affect virulence and host specificity. This illustrates that comparative transcriptomics is an excellent approach to discover differences in the functional output from closely related genomes affecting niche adaptation and virulence, which cannot be directly inferred from DNA sequences.


April 21, 2020

Genome Sequencing of Cladobotryum protrusum Provides Insights into the Evolution and Pathogenic Mechanisms of the Cobweb Disease Pathogen on Cultivated Mushroom.

Cladobotryum protrusum is one of the mycoparasites that cause cobweb disease on cultivated edible mushrooms. However, the molecular mechanisms of evolution and pathogenesis of C. protrusum on mushrooms are largely unknown. Here, we report a high-quality genome sequence of C. protrusum using the single-molecule, real-time sequencing platform of PacBio and perform a comparative analysis with closely related fungi in the family Hypocreaceae. The C. protrusum genome, the first complete genome to be sequenced in the genus Cladobotryum, is 39.09 Mb long, with an N50 of 4.97 Mb, encoding 11,003 proteins. The phylogenomic analysis confirmed its inclusion in Hypocreaceae, with its evolutionary divergence time estimated to be ~170.1 million years ago. The genome encodes a large and diverse set of genes involved in secreted peptidases, carbohydrate-active enzymes, cytochrome P450 enzymes, pathogen?host interactions, mycotoxins, and pigments. Moreover, C. protrusum harbors arrays of genes with the potential to produce bioactive secondary metabolites and stress response-related proteins that are significant for adaptation to hostile environments. Knowledge of the genome will foster a better understanding of the biology of C. protrusum and mycoparasitism in general, as well as help with the development of effective disease control strategies to minimize economic losses from cobweb disease in cultivated edible mushrooms.


April 21, 2020

The Modern View of B Chromosomes Under the Impact of High Scale Omics Analyses.

Supernumerary B chromosomes (Bs) are extra karyotype units in addition to A chromosomes, and are found in some fungi and thousands of animals and plant species. Bs are uniquely characterized due to their non-Mendelian inheritance, and represent one of the best examples of genomic conflict. Over the last decades, their genetic composition, function and evolution have remained an unresolved query, although a few successful attempts have been made to address these phenomena. A classical concept based on cytogenetics and genetics is that Bs are selfish and abundant with DNA repeats and transposons, and in most cases, they do not carry any function. However, recently, the modern quantum development of high scale multi-omics techniques has shifted B research towards a new-born field that we call “B-omics”. We review the recent literature and add novel perspectives to the B research, discussing the role of new technologies to understand the mechanistic perspectives of the molecular evolution and function of Bs. The modern view states that B chromosomes are enriched with genes for many significant biological functions, including but not limited to the interesting set of genes related to cell cycle and chromosome structure. Furthermore, the presence of B chromosomes could favor genomic rearrangements and influence the nuclear environment affecting the function of other chromatin regions. We hypothesize that B chromosomes might play a key function in driving their transmission and maintenance inside the cell, as well as offer an extra genomic compartment for evolution.


April 21, 2020

Clinical and laboratory-induced colistin-resistance mechanisms in Acinetobacter baumannii.

The increasing incidence and emergence of multi-drug resistant (MDR) Acinetobacter baumannii has become a major global health concern. Colistin is a historic antimicrobial that has become commonly used as a treatment for MDR A. baumannii infections. The increase in colistin usage has been mirrored by an increase in colistin resistance. We aimed to identify the mechanisms associated with colistin resistance in A. baumannii using multiple high-throughput-sequencing technologies, including transposon-directed insertion site sequencing (TraDIS), RNA sequencing (RNAseq) and whole-genome sequencing (WGS) to investigate the genotypic changes of colistin resistance in A. baumannii. Using TraDIS, we found that genes involved in drug efflux (adeIJK), and phospholipid (mlaC, mlaF and mlaD) and lipooligosaccharide synthesis (lpxC and lpsO) were required for survival in sub-inhibitory concentrations of colistin. Transcriptomic (RNAseq) analysis revealed that expression of genes encoding efflux proteins (adeI, adeC, emrB, mexB and macAB) was enhanced in in vitro generated colistin-resistant strains. WGS of these organisms identified disruptions in genes involved in lipid A (lpxC) and phospholipid synthesis (mlaA), and in the baeS/R two-component system (TCS). We additionally found that mutations in the pmrB TCS genes were the primary colistin-resistance-associated mechanisms in three Vietnamese clinical colistin-resistant A. baumannii strains. Our results outline the entire range of mechanisms employed in A. baumannii for resistance against colistin, including drug extrusion and the loss of lipid A moieties by gene disruption or modification.


April 21, 2020

Complete Genome Sequence Analysis and Characterization of Selected Iron Regulation Genes of Pasteurella Multocida Serotype A Strain PMTB2.1.

Although more than 100 genome sequences of Pasteurella multocida are available, comprehensive and complete genome sequence analysis is limited. This study describes the analysis of complete genome sequence and pathogenomics of P. multocida strain PMTB2.1. The genome of PMTB2.1 has 2176 genes with more than 40 coding sequences associated with iron regulation and 140 virulence genes including the complete tad locus. The tad locus includes several previously uncharacterized genes such as flp2, rcpC and tadV genes. A transposable phage resembling to Mu phages was identified in P. multocida that has not been identified in any other serotype yet. The multi-locus sequence typing analysis assigned the PMTB2.1 genome sequence as type ST101, while the comparative genome analysis showed that PMTB2.1 is closely related to other P. multocida strains with the genomic distance of less than 0.13. The expression profiling of iron regulating-genes of PMTB2.1 was characterized under iron-limited environment. Results showed significant changes in the expression profiles of iron-regulating genes (p < 0.05) whereas the highest expression of fecE gene (281 fold) at 30 min suggests utilization of the outer-membrane proteins system in iron acquisition at an early stage of growth. This study showed the phylogenomic relatedness of P. multocida and improved annotation of important genes and functional characterization of iron-regulating genes of importance to the bacterial growth.


April 21, 2020

DNA Methylation Patterns in the Social Spider, Stegodyphus dumicola.

Variation in DNA methylation patterns among genes, individuals, and populations appears to be highly variable among taxa, but our understanding of the functional significance of this variation is still incomplete. We here present the first whole genome bisulfite sequencing of a chelicerate species, the social spider Stegodyphus dumicola. We show that DNA methylation occurs mainly in CpG context and is concentrated in genes. This is a pattern also documented in other invertebrates. We present RNA sequence data to investigate the role of DNA methylation in gene regulation and show that, within individuals, methylated genes are more expressed than genes that are not methylated and that methylated genes are more stably expressed across individuals than unmethylated genes. Although no causal association is shown, this lends support for the implication of DNA CpG methylation in regulating gene expression in invertebrates. Differential DNA methylation between populations showed a small but significant correlation with differential gene expression. This is consistent with a possible role of DNA methylation in local adaptation. Based on indirect inference of the presence and pattern of DNA methylation in chelicerate species whose genomes have been sequenced, we performed a comparative phylogenetic analysis. We found strong evidence for exon DNA methylation in the horseshoe crab Limulus polyphemus and in all spider and scorpion species, while most Parasitiformes and Acariformes species seem to have lost DNA methylation.


April 21, 2020

Genome sequence of Malania oleifera, a tree with great value for nervonic acid production.

Malania oleifera, a member of the Olacaceae family, is an IUCN red listed tree, endemic and restricted to the Karst region of southwest China. This tree’s seed is valued for its high content of precious fatty acids (especially nervonic acid). However, studies on its genetic makeup and fatty acid biogenesis are severely hampered by a lack of molecular and genetic tools.We generated 51 Gb and 135 Gb of raw DNA sequences, using Pacific Biosciences (PacBio) single-molecule real-time and 10× Genomics sequencing, respectively. A final genome assembly, with a scaffold N50 size of 4.65 Mb and a total length of 1.51 Gb, was obtained by primary assembly based on PacBio long reads plus scaffolding with 10× Genomics reads. Identified repeats constituted ~82% of the genome, and 24,064 protein-coding genes were predicted with high support. The genome has low heterozygosity and shows no evidence for recent whole genome duplication. Metabolic pathway genes relating to the accumulation of long-chain fatty acid were identified and studied in detail.Here, we provide the first genome assembly and gene annotation for M. oleifera. The availability of these resources will be of great importance for conservation biology and for the functional genomics of nervonic acid biosynthesis. © The Author(s) 2019. Published by Oxford University Press.


April 21, 2020

Divergent evolution in the genomes of closely related lacertids, Lacerta viridis and L. bilineata, and implications for speciation.

Lacerta viridis and Lacerta bilineata are sister species of European green lizards (eastern and western clades, respectively) that, until recently, were grouped together as the L. viridis complex. Genetic incompatibilities were observed between lacertid populations through crossing experiments, which led to the delineation of two separate species within the L. viridis complex. The population history of these sister species and processes driving divergence are unknown. We constructed the first high-quality de novo genome assemblies for both L. viridis and L. bilineata through Illumina and PacBio sequencing, with annotation support provided from transcriptome sequencing of several tissues. To estimate gene flow between the two species and identify factors involved in reproductive isolation, we studied their evolutionary history, identified genomic rearrangements, detected signatures of selection on non-coding RNA, and on protein-coding genes.Here we show that gene flow was primarily unidirectional from L. bilineata to L. viridis after their split at least 1.15 million years ago. We detected positive selection of the non-coding repertoire; mutations in transcription factors; accumulation of divergence through inversions; selection on genes involved in neural development, reproduction, and behavior, as well as in ultraviolet-response, possibly driven by sexual selection, whose contribution to reproductive isolation between these lacertid species needs to be further evaluated.The combination of short and long sequence reads resulted in one of the most complete lizard genome assemblies. The characterization of a diverse array of genomic features provided valuable insights into the demographic history of divergence among European green lizards, as well as key species differences, some of which are candidates that could have played a role in speciation. In addition, our study generated valuable genomic resources that can be used to address conservation-related issues in lacertids. © The Author(s) 2018. Published by Oxford University Press.


April 21, 2020

Genome assembly and annotation of the Trichoplusia ni Tni-FNL insect cell line enabled by long-read technologies.

Trichoplusiani derived cell lines are commonly used to enable recombinant protein expression via baculovirus infection to generate materials approved for clinical use and in clinical trials. In order to develop systems biology and genome engineering tools to improve protein expression in this host, we performed de novo genome assembly of the Trichoplusiani-derived cell line Tni-FNL.By integration of PacBio single-molecule sequencing, Bionano optical mapping, and 10X Genomics linked-reads data, we have produced a draft genome assembly of Tni-FNL.Our assembly contains 280 scaffolds, with a N50 scaffold size of 2.3 Mb and a total length of 359 Mb. Annotation of the Tni-FNL genome resulted in 14,101 predicted genes and 93.2% of the predicted proteome contained recognizable protein domains. Ortholog searches within the superorder Holometabola provided further evidence of high accuracy and completeness of the Tni-FNL genome assembly.This first draft Tni-FNL genome assembly was enabled by complementary long-read technologies and represents a high-quality, well-annotated genome that provides novel insight into the complexity of this insect cell line and can serve as a reference for future large-scale genome engineering work in this and other similar recombinant protein production hosts.


April 21, 2020

Diverse Vectors and Mechanisms Spread New Delhi Metallo-ß-Lactamases among Carbapenem-Resistant Enterobacteriaceae in the Greater Boston Area.

New Delhi metallo-beta-lactamases (NDMs) are an uncommon but emerging cause of carbapenem resistance in the United States. Genomic factors promoting their domestic spread remain poorly characterized. A prospective genomic surveillance program among Boston-area hospitals identified multiple new occurrences of NDM-carrying strains of Escherichia coli and Enterobacter cloacae complex in inpatient and outpatient settings, representing the first occurrences of NDM-mediated resistance since initiating genomic surveillance in 2011. Cases included domestic patients with no international exposures. PacBio sequencing of isolates identified strain characteristics, resistance genes, and the complement of mobile vectors mediating spread. Analyses revealed a common 3,114-bp region containing the blaNDM gene, with carriage of this conserved region among unique strains by diverse transposon and plasmid backbones. Functional studies revealed a broad capacity for blaNDM transmission by conjugation, transposition, and complex interplasmid recombination events. NDMs represent a rapidly spreading form of drug resistance that can occur in inpatient and outpatient settings and in patients without international exposures. In contrast to Tn4401-based spread of Klebsiella pneumoniae carbapenemases (KPCs), diverse transposable elements mobilize NDM enzymes, commonly with other resistance genes, enabling naive strains to acquire multi- and extensively drug-resistant profiles with single transposition or plasmid conjugation events. Genomic surveillance provides effective means to rapidly identify these gene-level drivers of resistance and mobilization in order to inform clinical decisions to prevent further spread.Copyright © 2019 American Society for Microbiology.


April 21, 2020

A Genome-Wide Epstein-Barr Virus Polyadenylation Map and Its Antisense RNA to EBNA.

Epstein-Barr virus (EBV) is a ubiquitous human pathogen associated with Burkitt’s lymphoma and nasopharyngeal carcinoma. Although the EBV genome harbors more than a hundred genes, a full transcription map with EBV polyadenylation profiles remains unknown. To elucidate the 3′ ends of all EBV transcripts genome-wide, we performed the first comprehensive analysis of viral polyadenylation sites (pA sites) using our previously reported polyadenylation sequencing (PA-seq) technology. We identified that EBV utilizes a total of 62?pA sites in JSC-1, 60 in Raji, and 53 in Akata cells for the expression of EBV genes from both plus and minus DNA strands; 42 of these pA sites are commonly used in all three cell lines. The majority of identified pA sites were mapped to the intergenic regions downstream of previously annotated EBV open reading frames (ORFs) and viral promoters. pA sites lacking an association with any known EBV genes were also identified, mostly for the minus DNA strand within the EBNA locus, a major locus responsible for maintenance of viral latency and cell transformation. The expression of these novel antisense transcripts to EBNA were verified by 3′ rapid amplification of cDNA ends (RACE) and Northern blot analyses in several EBV-positive (EBV+) cell lines. In contrast to EBNA RNA expressed during latency, expression of EBNA-antisense transcripts, which is restricted in latent cells, can be significantly induced by viral lytic infection, suggesting potential regulation of viral gene expression by EBNA-antisense transcription during lytic EBV infection. Our data provide the first evidence that EBV has an unrecognized mechanism that regulates EBV reactivation from latency.IMPORTANCE Epstein-Barr virus represents an important human pathogen with an etiological role in the development of several cancers. By elucidation of a genome-wide polyadenylation landscape of EBV in JSC-1, Raji, and Akata cells, we have redefined the EBV transcriptome and mapped individual polymerase II (Pol II) transcripts of viral genes to each one of the mapped pA sites at single-nucleotide resolution as well as the depth of expression. By unveiling a new class of viral lytic RNA transcripts antisense to latent EBNAs, we provide a novel mechanism of how EBV might control the expression of viral latent genes and lytic infection. Thus, this report takes another step closer to understanding EBV gene structure and expression and paves a new path for antiviral approaches.This is a work of the U.S. Government and is not subject to copyright protection in the United States. Foreign copyrights may apply.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.