Menu
July 19, 2019

Insight into the recent genome duplication of the halophilic yeast Hortaea werneckii: combining an improved genome with gene expression and chromatin structure.

Extremophilic organisms demonstrate the flexibility and adaptability of basic biological processes by highlighting how cell physiology adapts to environmental extremes. Few eukaryotic extremophiles have been well studied and only a small number are amenable to laboratory cultivation and manipulation. A detailed characterization of the genome architecture of such organisms is important to illuminate how they adapt to environmental stresses. One excellent example of a fungal extremophile is the halophile Hortaea werneckii (Pezizomycotina, Dothideomycetes, Capnodiales), a yeast-like fungus able to thrive at near-saturating concentrations of sodium chloride and which is also tolerant to both UV irradiation and desiccation. Given its unique lifestyle and its remarkably recent whole genome duplication, H. werneckii provides opportunities for testing the role of genome duplications and adaptability to extreme environments. We previously assembled the genome of H. werneckii using short-read sequencing technology and found a remarkable degree of gene duplication. Technology limitations, however, precluded high-confidence annotation of the entire genome. We therefore revisited the H. wernickii genome using long-read, single-molecule sequencing and provide an improved genome assembly which, combined with transcriptome and nucleosome analysis, provides a useful resource for fungal halophile genomics. Remarkably, the ~50 Mb H. wernickii genome contains 15,974 genes of which 95% (7608) are duplicates formed by a recent whole genome duplication (WGD), with an average of 5% protein sequence divergence between them. We found that the WGD is extraordinarily recent, and compared to Saccharomyces cerevisiae, the majority of the genome’s ohnologs have not diverged at the level of gene expression of chromatin structure. Copyright © 2017 Sinha et al.


July 19, 2019

The draft genome of Globodera ellingtonae.

Globodera ellingtonae is a newly described potato cyst nematode (PCN) found in Idaho, Oregon, and Argentina. Here, we present a genome assembly for G. ellingtonae, a relative of the quarantine nematodes G. pallida and G. rostochiensis, produced using data from Illumina and Pacific Biosciences DNA sequencing technologies.


July 19, 2019

Genome sequencing of strain Cellulosimicrobium sp. TH-20 with ginseng biotransformation ability.

Biotransformation for increasing the pharmaceutical effect of ginsenosides is getting more and more attractions. Strain Cellulosimicrobium sp. TH-20 isolated from ginseng soil samples was identified to produce enzymes contributing to its excellent biotransformation activity against ginsenosides, the main active components of ginseng. Based on phylogenetic tree and homology analysis, the strain can be designated as Cellulosimicrobium sp. Genome sequencing was performed using the Illumina Miseq to explore the functional genes involved in ginsenoside transformation. The draft genome of Cellulosimicrobium sp. TH-20 encoded 3450 open reading frames, 51 tRNA, and 9 rRNA. All ORFs were annotated using NCBI BLAST with non-redundant proteins, Gene Ontology, Cluster of Orthologous Gene, and Kyoto Encyclopedia of Genes and Genomes databases. A total of 11 genes were selected based on the functional annotation analysis. These genes are relevant to ginsenoside biotransformation, including 6 for beta-glucosidase, 1 for alpha-N-arabinofuranosidase, 1 for alpha-1,6-glucosidase, 1 for endo-1,4-beta-xylanase, 1 for alpha-L-arabinofuranosidase, and 1 for beta-galactosidase. These glycosidases were predicted to catalyze the hydrolysis of sugar moieties attached to the aglycon of ginsenosides and led to the transformation of PPD-type and PPT-type ginsenosides.


July 19, 2019

Omics approaches to study gene regulatory networks for development in echinoderms.

Gene regulatory networks (GRNs) describe the interactions for a developmental process at a given time and space. Historically, perturbation experiments represent one of the key methods for analyzing and reconstructing a GRN, and the GRN governing early development in the sea urchin embryo stands as one of the more deeply dissected so far. As technology progresses, so do the methods used to address different biological questions. Next-generation sequencing (NGS) has become a standard experimental technique for genome and transcriptome sequencing and studies of protein-DNA interactions and DNA accessibility. While several efforts have been made toward the integration of different omics approaches for the study of the regulatory genome in many animals, in a few cases, these are applied with the purpose of reconstructing and experimentally testing developmental GRNs. Here, we review emerging approaches integrating multiple NGS technologies for the prediction and validation of gene interactions within echinoderm GRNs. These approaches can be applied to both ‘model’ and ‘non-model’ organisms. Although a number of issues still need to be addressed, advances in NGS applications, such as assay for transposase-accessible chromatin sequencing, combined with the availability of embryos belonging to different species, all separated by various evolutionary distances and accessible to experimental regulatory biology, place echinoderms in an unprecedented position for the reconstruction and evolutionary comparison of developmental GRNs. We conclude that sequencing technologies and integrated omics approaches allow the examination of GRNs on a genome-wide scale only if biological perturbation and cis-regulatory analyses are experimentally accessible, as in the case of echinoderm embryos.© The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please email: journals.permissions@oup.com.


July 19, 2019

Characterisation of MHC class I genes in the koala.

Koala (Phascolarctos cinereus) populations are on the decline across the majority of Australia’s mainland. Two major diseases threatening the long-term survival of affected koala populations are caused by obligate intracellular pathogens: Chlamydia and koala retrovirus (KoRV). To improve our understanding of the koala immune system, we characterised their major histocompatibility complex (MHC) class I genes, which are centrally involved in presenting foreign peptides derived from intracellular pathogens to cytotoxic T cells. A total of 11 class I genes were identified in the koala genome. Three genes, Phci-UA, UB and UC, showed relatively high genetic variability and were expressed in all 12 examined tissues, whereas the other eight genes had tissue-specific expression and limited polymorphism. Evidence of diversifying selection was detected in Phci-UA and UC, while gene conversion may have played a role in creating new alleles at Phci-UB. We propose that Phci-UA, UB and UC are likely classical MHC genes of koalas, and further research is needed to understand their role in koala chlamydial and KoRV infections.


July 19, 2019

Monitoring microevolution of OXA-48-producing Klebsiella pneumoniae ST147 in a hospital setting by SMRT sequencing.

Carbapenemase-producing Klebsiella pneumoniae pose an increasing risk for healthcare facilities worldwide. A continuous monitoring of ST distribution and its association with resistance and virulence genes is required for early detection of successful K. pneumoniae lineages. In this study, we used WGS to characterize MDR blaOXA-48-positive K. pneumoniae isolated from inpatients at the University Medical Center Göttingen, Germany, between March 2013 and August 2014.Closed genomes for 16 isolates of carbapenemase-producing K. pneumoniae were generated by single molecule real-time technology using the PacBio RSII platform.Eight of the 16 isolates showed identical XbaI macrorestriction patterns and shared the same MLST, ST147. The eight ST147 isolates differed by only 1-25 SNPs of their core genome, indicating a clonal origin. Most of the eight ST147 isolates carried four plasmids with sizes of 246.8, 96.1, 63.6 and 61.0?kb and a novel linear plasmid prophage, named pKO2, of 54.6?kb. The blaOXA-48 gene was located on a 63.6?kb IncL plasmid and is part of composite transposon Tn1999.2. The ST147 isolates expressed the yersinabactin system as a major virulence factor. The comparative whole-genome analysis revealed several rearrangements of mobile genetic elements and losses of chromosomal and plasmidic regions in the ST147 isolates.Single molecule real-time sequencing allowed monitoring of the genetic and epigenetic microevolution of MDR OXA-48-producing K. pneumoniae and revealed in addition to SNPs, complex rearrangements of genetic elements.© The Author 2017. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please email: journals.permissions@oup.com.


July 19, 2019

The distribution of miniature impala elements and SIX genes in the Fusarium genus is suggestive of horizontal gene transfer.

The mimp family of miniature inverted-repeat transposable elements was previously found only in genomes of Fusarium oxysporum and is contextually associated with virulence genes in this species. Through extensive comparative analysis of 83 F. oxysporum and 52 other Fusarium genomes, we uncovered the distribution of different mimp families throughout the genus. We show that (i) mimps are not exclusive to F. oxysporum; (ii) pathogenic isolates generally possess more mimps than non-pathogenic strains and (iii) two isolates of F. hostae and one F. proliferatum isolate display evidence for horizontal transfer of genetic material to or from F. oxysporum. Multiple instances of mimp elements identical to F. oxysporum mimps were encountered in the genomes of these isolates. Moreover, homologs of effector genes (SIX1, 2, 6, 7, 11 and FomAVR2) were discovered here, several with very high (97-100%) pairwise nucleotide sequence identity scores. These three strains were isolated from infected flower bulbs (Hyacinthus and Lilium spp.). Their ancestors may thus have lived in close proximity to pathogenic strains of F. oxysporum f. sp. hyacinthi and f. sp. lilii. The Fo f. sp. lycopersici SIX2 effector gene was found to be widely distributed (15/18 isolates) throughout the F. fujikuroi species complex, exhibiting a predominantly vertical inheritance pattern. These findings shed light on the potential evolutionary mechanism underlying plant-pathogenicity in Fusarium and show that interspecies horizontal gene transfer may have occurred.


July 19, 2019

A novel approach using long-read sequencing and ddPCR to investigate gonadal mosaicism and estimate recurrence risk in two families with developmental disorders.

De novo mutations contribute significantly to severe early-onset genetic disorders. Even if the mutation is apparently de novo, there is a recurrence risk due to parental germ line mosaicism, depending on in which gonadal generation the mutation occurred.We demonstrate the power of using SMRT sequencing and ddPCR to determine parental origin and allele frequencies of de novo mutations in germ cells in two families whom had undergone assisted reproduction.In the first family, a TCOF1 variant c.3156C>T was identified in the proband with Treacher Collins syndrome. The variant affects splicing and was determined to be of paternal origin. It was present in <1% of the paternal germ cells, suggesting a very low recurrence risk. In the second family, the couple had undergone several unsuccessful pregnancies where a de novo mutation PTPN11 c.923A>C causing Noonan syndrome was identified. The variant was present in 40% of the paternal germ cells suggesting a high recurrence risk.Our findings highlight a successful strategy to identify the parental origin of mutations and to investigate the recurrence risk in couples that have undergone assisted reproduction with an unknown donor or in couples with gonadal mosaicism that will undergo preimplantation genetic diagnosis.© 2017 The Authors Prenatal Diagnosis published by John Wiley & Sons Ltd.


July 19, 2019

Exonization of an intronic LINE-1 element causing Becker muscular dystrophy as a novel mutational mechanism in dystrophin gene.

A broad mutational spectrum in the dystrophin (DMD) gene, from large deletions/duplications to point mutations, causes Duchenne/Becker muscular dystrophy (D/BMD). Comprehensive genotyping is particularly relevant considering the mutation-centered therapies for dystrophinopathies. We report the genetic characterization of a patient with disease onset at age 13 years, elevated creatine kinase levels and reduced dystrophin labeling, where multiplex-ligation probe amplification (MLPA) and genomic sequencing failed to detect pathogenic variants. Bioinformatic, transcriptomic (real time PCR, RT-PCR), and genomic approaches (Southern blot, long-range PCR, and single molecule real-time sequencing) were used to characterize the mutation. An aberrant transcript was identified, containing a 103-nucleotide insertion between exons 51 and 52, with no similarity with the DMD gene. This corresponded to the partial exonization of a long interspersed nuclear element (LINE-1), disrupting the open reading frame. Further characterization identified a complete LINE-1 (~6 kb with typical hallmarks) deeply inserted in intron 51. Haplotyping and segregation analysis demonstrated that the mutation had a de novo origin. Besides underscoring the importance of mRNA studies in genetically unsolved cases, this is the first report of a disease-causing fully intronic LINE-1 element in DMD, adding to the diversity of mutational events that give rise to D/BMD.


July 19, 2019

Functional Analysis of the Glucan Degradation Locus in Caldicellulosiruptor bescii Reveals Essential Roles of Component Glycoside Hydrolases in Plant Biomass Deconstruction.

The ability to hydrolyze microcrystalline cellulose is an uncommon feature in the microbial world, but it can be exploited for conversion of lignocellulosic feedstocks into biobased fuels and chemicals. Understanding the physiological and biochemical mechanisms by which microorganisms deconstruct cellulosic material is key to achieving this objective. The glucan degradation locus (GDL) in the genomes of extremely thermophilic Caldicellulosiruptor species encodes polysaccharide lyases (PLs), unique cellulose binding proteins (tapirins), and putative posttranslational modifying enzymes, in addition to multidomain, multifunctional glycoside hydrolases (GHs), thereby representing an alternative paradigm for plant biomass degradation compared to fungal or cellulosomal systems. To examine the individual and collective in vivo roles of the glycolytic enzymes, the six GH genes in the GDL of Caldicellulosiruptor bescii were systematically deleted, and the extents to which the resulting mutant strains could solubilize microcrystalline cellulose (Avicel) and plant biomass (switchgrass or poplar) were examined. Three of the GDL enzymes, Athe_1867 (CelA) (GH9-CBM3-CBM3-CBM3-GH48), Athe_1859 (GH5-CBM3-CBM3-GH44), and Athe_1857 (GH10-CBM3-CBM3-GH48), acted synergistically in vivo and accounted for 92% of naked microcrystalline cellulose (Avicel) degradation. However, the relative importance of the GDL GHs varied for the plant biomass substrates tested. Furthermore, mixed cultures of mutant strains showed that switchgrass solubilization depended on the secretome-bound enzymes collectively produced by the culture, not on the specific strain from which they came. These results demonstrate that certain GDL GHs are primarily responsible for the degradation of microcrystalline cellulose-containing substrates by C. bescii and provide new insights into the workings of a novel microbial mechanism for lignocellulose utilization.IMPORTANCE The efficient and extensive degradation of complex polysaccharides in lignocellulosic biomass, particularly microcrystalline cellulose, remains a major barrier to its use as a renewable feedstock for the production of fuels and chemicals. Extremely thermophilic bacteria from the genus Caldicellulosiruptor rapidly degrade plant biomass to fermentable sugars at temperatures of 70 to 78°C, although the specific mechanism by which this occurs is not clear. Previous comparative genomic studies identified a genomic locus found only in certain Caldicellulosiruptor species that was hypothesized to be mainly responsible for microcrystalline cellulose degradation. By systematically deleting genes in this locus in Caldicellulosiruptor bescii, the nuanced, substrate-specific in vivo roles of glycolytic enzymes in deconstructing crystalline cellulose and plant biomasses could be discerned. The results here point to synergism of three multidomain cellulases in C. bescii, working in conjunction with the aggregate secreted enzyme inventory, as the key to the plant biomass degradation ability of this extreme thermophile. Copyright © 2017 American Society for Microbiology.


July 19, 2019

Amplification-free, CRISPR-Cas9 targeted enrichment and SMRT Sequencing of repeat-expansion disease causative genomic regions

Targeted sequencing has proven to be an economical means of obtaining sequence information for one or more defined regions of a larger genome. However, most target enrichment methods require amplification. Some genomic regions, such as those with extreme GC content and repetitive sequences, are recalcitrant to faithful amplification. Yet, many human genetic disorders are caused by repeat expansions, including difficult to sequence tandem repeats. We have developed a novel, amplification-free enrichment technique that employs the CRISPR-Cas9 system for specific targeting multiple genomic loci. This method, in conjunction with long reads generated through Single Molecule, Real-Time (SMRT) sequencing and unbiased coverage, enables enrichment and sequencing of complex genomic regions that cannot be investigated with other technologies. Using human genomic DNA samples, we demonstrate successful targeting of causative loci for Huntingtontextquoterights disease (HTT; CAG repeat), Fragile X syndrome (FMR1; CGG repeat), amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (C9orf72; GGGGCC repeat), and spinocerebellar ataxia type 10 (SCA10) (ATXN10; variable ATTCT repeat). The method, amenable to multiplexing across multiple genomic loci, uses an amplification-free approach that facilitates the isolation of hundreds of individual on-target molecules in a single SMRT Cell and accurate sequencing through long repeat stretches, regardless of extreme GC percent or sequence complexity content. Our novel targeted sequencing method opens new doors to genomic analyses independent of PCR amplification that will facilitate the study of repeat expansion disorders.


July 19, 2019

The draft genome of tropical fruit durian (Durio zibethinus).

Durian (Durio zibethinus) is a Southeast Asian tropical plant known for its hefty, spine-covered fruit and sulfury and onion-like odor. Here we present a draft genome assembly of D. zibethinus, representing the third plant genus in the Malvales order and first in the Helicteroideae subfamily to be sequenced. Single-molecule sequencing and chromosome contact maps enabled assembly of the highly heterozygous durian genome at chromosome-scale resolution. Transcriptomic analysis showed upregulation of sulfur-, ethylene-, and lipid-related pathways in durian fruits. We observed paleopolyploidization events shared by durian and cotton and durian-specific gene expansions in MGL (methionine ?-lyase), associated with production of volatile sulfur compounds (VSCs). MGL and the ethylene-related gene ACS (aminocyclopropane-1-carboxylic acid synthase) were upregulated in fruits concomitantly with their downstream metabolites (VSCs and ethylene), suggesting a potential association between ethylene biosynthesis and methionine regeneration via the Yang cycle. The durian genome provides a resource for tropical fruit biology and agronomy.


July 19, 2019

The composite 259-kb plasmid of Martelella mediterranea DSM 17316(T)-a natural replicon with functional RepABC modules from Rhodobacteraceae and Rhizobiaceae.

A multipartite genome organization with a chromosome and many extrachromosomal replicons (ECRs) is characteristic for Alphaproteobacteria. The best investigated ECRs of terrestrial rhizobia are the symbiotic plasmids for legume root nodulation and the tumor-inducing (Ti) plasmid of Agrobacterium tumefaciens. RepABC plasmids represent the most abundant alphaproteobacterial replicon type. The currently known homologous replication modules of rhizobia and Rhodobacteraceae are phylogenetically distinct. In this study, we surveyed type-strain genomes from the One Thousand Microbial Genomes (KMG-I) project and identified a roseobacter-specific RepABC-type operon in the draft genome of the marine rhizobium Martelella mediterranea DSM 17316(T). PacBio genome sequencing demonstrated the presence of three circular ECRs with sizes of 593, 259, and 170-kb. The rhodobacteral RepABC module is located together with a rhizobial equivalent on the intermediate sized plasmid pMM259, which likely originated in the fusion of a pre-existing rhizobial ECR with a conjugated roseobacter plasmid. Further evidence for horizontal gene transfer (HGT) is given by the presence of a roseobacter-specific type IV secretion system on the 259-kb plasmid and the rhodobacteracean origin of 62% of the genes on this plasmid. Functionality tests documented that the genuine rhizobial RepABC module from the Martelella 259-kb plasmid is only maintained in A. tumefaciens C58 (Rhizobiaceae) but not in Phaeobacter inhibens DSM 17395 (Rhodobacteraceae). Unexpectedly, the roseobacter-like replication system is functional and stably maintained in both host strains, thus providing evidence for a broader host range than previously proposed. In conclusion, pMM259 is the first example of a natural plasmid that likely mediates genetic exchange between roseobacters and rhizobia.


July 19, 2019

De novo PacBio long-read and phased avian genome assemblies correct and add to reference genes generated with intermediate and short reads.

Reference-quality genomes are expected to provide a resource for studying gene structure, function, and evolution. However, often genes of interest are not completely or accurately assembled, leading to unknown errors in analyses or additional cloning efforts for the correct sequences. A promising solution is long-read sequencing. Here we tested PacBio-based long-read sequencing and diploid assembly for potential improvements to the Sanger-based intermediate-read zebra finch reference and Illumina-based short-read Anna’s hummingbird reference, 2 vocal learning avian species widely studied in neuroscience and genomics. With DNA of the same individuals used to generate the reference genomes, we generated diploid assemblies with the FALCON-Unzip assembler, resulting in contigs with no gaps in the megabase range, representing 150-fold and 200-fold improvements over the current zebra finch and hummingbird references, respectively. These long-read and phased assemblies corrected and resolved what we discovered to be numerous misassemblies in the references, including missing sequences in gaps, erroneous sequences flanking gaps, base call errors in difficult-to-sequence regions, complex repeat structure errors, and allelic differences between the 2 haplotypes. These improvements were validated by single long-genome and transcriptome reads and resulted for the first time in completely resolved protein-coding genes widely studied in neuroscience and specialized in vocal learning species. These findings demonstrate the impact of long reads, sequencing of previously difficult-to-sequence regions, and phasing of haplotypes on generating the high-quality assemblies necessary for understanding gene structure, function, and evolution.© The Authors 2017. Published by Oxford University Press.


July 19, 2019

ALF: a strategy for identification of unauthorized GMOs in complex mixtures by a GW-NGS method and dedicated bioinformatics analysis.

The majority of feed products in industrialised countries contains materials derived from genetically modified organisms (GMOs). In parallel, the number of reports of unauthorised GMOs (UGMOs) is gradually increasing. There is a lack of specific detection methods for UGMOs, due to the absence of detailed sequence information and reference materials. In this research, an adapted genome walking approach was developed, called ALF: Amplification of Linearly-enriched Fragments. Coupling of ALF to NGS aims for simultaneous detection and identification of all GMOs, including UGMOs, in one sample, in a single analysis. The ALF approach was assessed on a mixture made of DNA extracts from four reference materials, in an uneven distribution, mimicking a real life situation. The complete insert and genomic flanking regions were known for three of the included GMO events, while for MON15985 only partial sequence information was available. Combined with a known organisation of elements, this GMO served as a model for a UGMO. We successfully identified sequences matching with this organisation of elements serving as proof of principle for ALF as new UGMO detection strategy. Additionally, this study provides a first outline of an automated, web-based analysis pipeline for identification of UGMOs containing known GM elements.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.