Menu
July 7, 2019

Complete plastid genomes of the genus Ammopiptanthus and identification of a novel 23-kb rearrangement

Ammopiptanthus is an endangered angiosperm genus with evergreen and broad leaves, grown in the semi-desert regions of eastern Central Asia. We decoded the complete plastid genomes of Ammopiptanthus mongolicus (AM) and Ammopiptanthus nanus (AN), the only two species in the genus Ammopiptanthus. The total length of AM plastome is 153,935 bp, comprising a 83,889-bp long single-copy region (LSC), a 18,022-bp short single-copy region (SSC) and two inverted repeat (IR) regions of 26,012 bp. The total length of AN plastome is 154,140 bp, including a LSC of 84,069 bp, a SSC of 17,971 bp and two IR regions of 26,000 bp, respectively. Each Ammopiptanthus plastome contains 116 unique functional genes, including 78 protein-coding genes, 4 rRNA and 34 tRNA genes. Plastome sequence alignment with other Papilionoid legumes and an outgroup revealed a novel 23-kb inversion between the ndhJ and petN loci in Ammopiptanthus plastomes.


July 7, 2019

MYB transcription factor gene involved in sex determination in Asparagus officinalis.

Dioecy is a plant mating system in which individuals of a species are either male or female. Although many flowering plants evolved independently from hermaphroditism to dioecy, the molecular mechanism underlying this transition remains largely unknown. Sex determination in the dioecious plant Asparagus officinalis is controlled by X and Y chromosomes; the male and female karyotypes are XY and XX, respectively. Transcriptome analysis of A. officinalis buds showed that a MYB-like gene, Male Specific Expression 1 (MSE1), is specifically expressed in males. MSE1 exhibits tight linkage with the Y chromosome, specific expression in early anther development and loss of function on the X chromosome. Knockout of the MSE1 orthologue in Arabidopsis induces male sterility. Thus, MSE1 acts in sex determination in A. officinalis.© 2016 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.


July 7, 2019

Genome sequencing reveals the origin of the allotetraploid Arabidopsis suecica.

Polyploidy is an example of instantaneous speciation when it involves the formation of a new cytotype that is incompatible with the parental species. Because new polyploid individuals are likely to be rare, establishment of a new species is unlikely unless polyploids are able to reproduce through self-fertilization (selfing), or asexually. Conversely, selfing (or asexuality) makes it possible for polyploid species to originate from a single individual-a bona fide speciation event. The extent to which this happens is not known. Here, we consider the origin of Arabidopsis suecica, a selfing allopolyploid between Arabidopsis thaliana and Arabidopsis arenosa, which has hitherto been considered to be an example of a unique origin. Based on whole-genome re-sequencing of 15 natural A. suecica accessions, we identify ubiquitous shared polymorphism with the parental species, and hence conclusively reject a unique origin in favor of multiple founding individuals. We further estimate that the species originated after the last glacial maximum in Eastern Europe or central Eurasia (rather than Sweden, as the name might suggest). Finally, annotation of the self-incompatibility loci in A. suecica revealed that both loci carry non-functional alleles. The locus inherited from the selfing A. thaliana is fixed for an ancestral non-functional allele, whereas the locus inherited from the outcrossing A. arenosa is fixed for a novel loss-of-function allele. Furthermore, the allele inherited from A. thaliana is predicted to transcriptionally silence the allele inherited from A. arenosa, suggesting that loss of self-incompatibility may have been instantaneous.© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.


July 7, 2019

Trichoderma reesei complete genome sequence, repeat-induced point mutation, and partitioning of CAZyme gene clusters.

Trichoderma reesei (Ascomycota, Pezizomycotina) QM6a is a model fungus for a broad spectrum of physiological phenomena, including plant cell wall degradation, industrial production of enzymes, light responses, conidiation, sexual development, polyketide biosynthesis, and plant-fungal interactions. The genomes of QM6a and its high enzyme-producing mutants have been sequenced by second-generation-sequencing methods and are publicly available from the Joint Genome Institute. While these genome sequences have offered useful information for genomic and transcriptomic studies, their limitations and especially their short read lengths make them poorly suited for some particular biological problems, including assembly, genome-wide determination of chromosome architecture, and genetic modification or engineering.We integrated Pacific Biosciences and Illumina sequencing platforms for the highest-quality genome assembly yet achieved, revealing seven telomere-to-telomere chromosomes (34,922,528 bp; 10877 genes) with 1630 newly predicted genes and >1.5 Mb of new sequences. Most new sequences are located on AT-rich blocks, including 7 centromeres, 14 subtelomeres, and 2329 interspersed AT-rich blocks. The seven QM6a centromeres separately consist of 24 conserved repeats and 37 putative centromere-encoded genes. These findings open up a new perspective for future centromere and chromosome architecture studies. Next, we demonstrate that sexual crossing readily induced cytosine-to-thymine point mutations on both tandem and unlinked duplicated sequences. We also show by bioinformatic analysis that T. reesei has evolved a robust repeat-induced point mutation (RIP) system to accumulate AT-rich sequences, with longer AT-rich blocks having more RIP mutations. The widespread distribution of AT-rich blocks correlates genome-wide partitions with gene clusters, explaining why clustering of genes has been reported to not influence gene expression in T. reesei.Compartmentation of ancestral gene clusters by AT-rich blocks might promote flexibilities that are evolutionarily advantageous in this fungus’ soil habitats and other natural environments. Our analyses, together with the complete genome sequence, provide a better blueprint for biotechnological and industrial applications.


July 7, 2019

Genomic and transcriptomic analyses of Agrobacterium tumefaciens S33 reveal the molecular mechanism of a novel hybrid nicotine-degrading pathway.

Agrobacterium tumefaciens S33 is able to degrade nicotine via a novel hybrid of the pyridine and pyrrolidine pathways. It can be utilized to remove nicotine from tobacco wastes and transform nicotine into important functionalized pyridine precursors for some valuable drugs and insecticides. However, the molecular mechanism of the hybrid pathway is still not completely clear. Here we report the genome analysis of strain S33 and its transcriptomes grown in glucose-ammonium medium and nicotine medium. The complete gene cluster involved in nicotine catabolism was found to be located on a genomic island composed of genes functionally similar but not in sequences to those of the pyridine and pyrrolidine pathways, as well as genes encoding plasmid partitioning and replication initiation proteins, conjugal transfer proteins and transposases. This suggests that the evolution of this hybrid pathway is not a simple fusion of the genes involved in the two pathways, but the result of a complicated lateral gene transfer. In addition, other genes potentially involved in the hybrid pathway could include those responsible for substrate sensing and transport, transcription regulation and electron transfer during nicotine degradation. This study provides new insights into the molecular mechanism of the novel hybrid pathway for nicotine degradation.


July 7, 2019

Whole genome sequence of the heterozygous clinical isolate Candida krusei 81-B-5.

Candida krusei is a diploid, heterozygous yeast that is an opportunistic fungal pathogen in immunocompromised patients. This species also is utilized for fermenting cocoa beans during chocolate production. One major concern in the clinical setting is the innate resistance of this species to the most commonly used antifungal drug fluconazole. Here we report a high-quality genome sequence and assembly for the first clinical isolate of C. krusei, strain 81-B-5, into 11 scaffolds generated with PacBio sequencing technology. Gene annotation and comparative analysis revealed a unique profile of transporters that could play a role in drug resistance or adaptation to different environments. In addition, we show that while 82% of the genome is highly heterozygous, a 2.0 Mb region of the largest scaffold has undergone loss of heterozygosity. This genome will serve as a reference for further genetic studies of this pathogen. Copyright © 2017 Author et al.


July 7, 2019

Updated reference genome sequence and annotation of Mycobacterium bovis AF2122/97.

We report here an update to the reference genome sequence of the bovine tuberculosis bacillus Mycobacterium bovis AF2122/97, generated using an integrative multiomics approach. The update includes 42 new coding sequences (CDSs), 14 modified annotations, 26 single-nucleotide polymorphism (SNP) corrections, and disclosure that the RD900 locus, previously described as absent from the genome, is in fact present. Copyright © 2017 Malone et al.


July 7, 2019

Genome-wide analysis of WOX genes in upland cotton and their expression pattern under different stresses.

WUSCHEL-related homeobox (WOX) family members play significant roles in plant growth and development, such as in embryo patterning, stem-cell maintenance, and lateral organ formation. The recently published cotton genome sequences allow us to perform comprehensive genome-wide analysis and characterization of WOX genes in cotton.In this study, we identified 21, 20, and 38 WOX genes in Gossypium arboreum (2n = 26, A2), G. raimondii (2n = 26, D5), and G. hirsutum (2n = 4x = 52, (AD)t), respectively. Sequence logos showed that homeobox domains were significantly conserved among the WOX genes in cotton, Arabidopsis, and rice. A total of 168 genes from three typical monocots and six dicots were naturally divided into three clades, which were further classified into nine sub-clades. A good collinearity was observed in the synteny analysis of the orthologs from At and Dt (t represents tetraploid) sub-genomes. Whole genome duplication (WGD) and segmental duplication within At and Dt sub-genomes played significant roles in the expansion of WOX genes, and segmental duplication mainly generated the WUS clade. Copia and Gypsy were the two major types of transposable elements distributed upstream or downstream of WOX genes. Furthermore, through comparison, we found that the exon/intron pattern was highly conserved between Arabidopsis and cotton, and the homeobox domain loci were also conserved between them. In addition, the expression pattern in different tissues indicated that the duplicated genes in cotton might have acquired new functions as a result of sub-functionalization or neo-functionalization. The expression pattern of WOX genes under different stress treatments showed that the different genes were induced by different stresses.In present work, WOX genes, classified into three clades, were identified in the upland cotton genome. Whole genome and segmental duplication were determined to be the two major impetuses for the expansion of gene numbers during the evolution. Moreover, the expression patterns suggested that the duplicated genes might have experienced a functional divergence. Together, these results shed light on the evolution of the WOX gene family, and would be helpful in future research.


July 7, 2019

Metatranscriptomics supports the mechanism for biocathode electroautotrophy by “Candidatus Tenderia electrophaga.”

Biocathodes provide a stable electron source to drive reduction reactions in electrotrophic microbial electrochemical systems. Electroautotrophic biocathode communities may be more robust than monocultures in environmentally relevant settings, but some members are not easily cultivated outside the electrode environment. We previously used metagenomics and metaproteomics to propose a pathway for coupling extracellular electron transfer (EET) to carbon fixation in “Candidatus Tenderia electrophaga,” an uncultivated but dominant member of an electroautotrophic biocathode community. Here we validate and refine this proposed pathway using metatranscriptomics of replicate aerobic biocathodes poised at the growth potential level of 310 mV and the suboptimal 470 mV (versus the standard hydrogen electrode). At both potentials, transcripts were more abundant from “Ca. Tenderia electrophaga” than from any other constituent, and its relative activity was positively correlated with current. Several genes encoding key components of the proposed “Ca. Tenderia electrophaga” EET pathway were more highly expressed at 470 mV, consistent with a need for cells to acquire more electrons to obtain the same amount of energy as at 310 mV. These included cyc2, encoding a homolog of a protein known to be involved in iron oxidation. Mean expression of all CO2 fixation-related genes is 0.27 log2-fold higher at 310 mV, indicating that reduced energy availability at 470 mV decreased CO2 fixation. Our results substantiate the claim that “Ca. Tenderia electrophaga” is the key electroautotroph, which will help guide further development of this community for microbial electrosynthesis. IMPORTANCE Bacteria that directly use electrodes as metabolic electron donors (biocathodes) have been proposed for applications ranging from microbial electrosynthesis to advanced bioelectronics for cellular communication with machines. However, just as we understand very little about oxidation of analogous natural insoluble electron donors, such as iron oxide, the organisms and extracellular electron transfer (EET) pathways underlying the electrode-cell direct electron transfer processes are almost completely unknown. Biocathodes are a stable biofilm cultivation platform to interrogate both the rate and mechanism of EET using electrochemistry and to study the electroautotrophic organisms that catalyze these reactions. Here we provide new evidence supporting the hypothesis that the uncultured bacterium “Candidatus Tenderia electrophaga” directly couples extracellular electron transfer to CO2 fixation. Our results provide insight into developing biocathode technology, such as microbial electrosynthesis, as well as advancing our understanding of chemolithoautotrophy.


July 7, 2019

A high-coverage draft genome of the mycalesine butterfly Bicyclus anynana.

The mycalesine butterfly Bicyclus anynana , the ‘Squinting bush brown’, is a model organism in the study of lepidopteran ecology, development and evolution. Here, we present a draft genome sequence for B. anynana to serve as a genomics resource for current and future studies of this important model species.Seven libraries with insert sizes ranging from 350 bp to 20 kb were constructed using DNA from an inbred female and sequenced using both Illumina and PacBio technology. 128 Gb raw Illumina data were filtered to 124 Gb and assembled to a final size of 475 Mb (~260X assembly coverage). Contigs were scaffolded using mate-pair, transcriptome and PacBio data into 10,800 sequences with an N50 of 638 kb (longest scaffold 5 Mb). The genome is comprised of 26% repetitive elements, and encodes a total of 22,642 predicted protein-coding genes. Recovery of a BUSCO set of core metazoan genes was almost complete (98%). Overall, these metrics compare well with other recently published lepidopteran genomes.We report a high-quality draft genome sequence for Bicyclus anynana . The genome assembly and annotated gene models are available at LepBase ( http://ensembl.lepbase.org/index.html ).


July 7, 2019

Proteogenomics produces comprehensive and highly accurate protein-coding gene annotation in a complete genome assembly of Malassezia sympodialis.

Complete and accurate genome assembly and annotation is a crucial foundation for comparative and functional genomics. Despite this, few complete eukaryotic genomes are available, and genome annotation remains a major challenge. Here, we present a complete genome assembly of the skin commensal yeast Malassezia sympodialis and demonstrate how proteogenomics can substantially improve gene annotation. Through long-read DNA sequencing, we obtained a gap-free genome assembly for M. sympodialis (ATCC 42132), comprising eight nuclear and one mitochondrial chromosome. We also sequenced and assembled four M. sympodialis clinical isolates, and showed their value for understanding Malassezia reproduction by confirming four alternative allele combinations at the two mating-type loci. Importantly, we demonstrated how proteomics data could be readily integrated with transcriptomics data in standard annotation tools. This increased the number of annotated protein-coding genes by 14% (from 3612 to 4113), compared to using transcriptomics evidence alone. Manual curation further increased the number of protein-coding genes by 9% (to 4493). All of these genes have RNA-seq evidence and 87% were confirmed by proteomics. The M. sympodialis genome assembly and annotation presented here is at a quality yet achieved only for a few eukaryotic organisms, and constitutes an important reference for future host-microbe interaction studies.© The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.


July 7, 2019

The origin, diversification and adaptation of a major mangrove clade (Rhizophoreae) revealed by whole-genome sequencing

Mangroves invade some very marginal habitats for woody plants—at the interface between land and sea. Since mangroves anchor tropical coastal communities globally, their origin, diversification and adaptation are of scientific significance, particularly at a time of global climate change. In this study, a combination of single-molecule long reads and the more conventional short reads are generated from Rhizophora apiculata for the de novo assembly of its genome to a near chromosome level. The longest scaffold, N50 and N90 for the R. apiculata genome, are 13.3 Mb, 5.4 Mb and 1.0 Mb, respectively. Short reads for the genomes and transcriptomes of eight related species are also generated. We find that the ancestor of Rhizophoreae experienced a whole-genome duplication ~70 Myrs ago, which is followed rather quickly by colonization and species diversification. Mangroves exhibit pan-exome modifications of amino acid (AA) usage as well as unusual AA substitutions among closely related species. The usage and substitution of AAs, unique among plants surveyed, is correlated with the rapid evolution of proteins in mangroves. A small subset of these substitutions is associated with mangroves’ highly specialized traits (vivipary and red bark) thought to be adaptive in the intertidal habitats. Despite the many adaptive features, mangroves are among the least genetically diverse plants, likely the result of continual habitat turnovers caused by repeated rises and falls of sea level in the geologically recent past. Mangrove genomes thus inform about their past evolutionary success as well as portend a possibly difficult future.


July 7, 2019

Tandem duplications lead to novel expression patterns through exon shuffling in Drosophila yakuba.

One common hypothesis to explain the impacts of tandem duplications is that whole gene duplications commonly produce additive changes in gene expression due to copy number changes. Here, we use genome wide RNA-seq data from a population sample of Drosophila yakuba to test this ‘gene dosage’ hypothesis. We observe little evidence of expression changes in response to whole transcript duplication capturing 5′ and 3′ UTRs. Among whole gene duplications, we observe evidence that dosage sharing across copies is likely to be common. The lack of expression changes after whole gene duplication suggests that the majority of genes are subject to tight regulatory control and therefore not sensitive to changes in gene copy number. Rather, we observe changes in expression level due to both shuffling of regulatory elements and the creation of chimeric structures via tandem duplication. Additionally, we observe 30 de novo gene structures arising from tandem duplications, 23 of which form with expression in the testes. Thus, the value of tandem duplications is likely to be more intricate than simple changes in gene dosage. The common regulatory effects from chimeric gene formation after tandem duplication may explain their contribution to genome evolution.


July 7, 2019

Whole genome sequencing predicts novel human disease models in rhesus macaques.

Rhesus macaques are an important pre-clinical model of human disease. To advance our understanding of genomic variation that may influence disease, we surveyed genome-wide variation in 21 rhesus macaques. We employed best-practice variant calling, validated with Mendelian inheritance. Next, we used alignment data from our cohort to detect genomic regions likely to produce inaccurate genotypes, potentially due to either gene duplication or structural variation between individuals. We generated a final dataset of >16 million high confidence variants, including 13 million in Chinese-origin rhesus macaques, an increasingly important disease model. We detected an average of 131 mutations predicted to severely alter protein coding per animal, and identified 45 such variants that coincide with known pathogenic human variants. These data suggest that expanded screening of existing breeding colonies will identify novel models of human disease, and that increased genomic characterization can help inform research studies in macaques. Copyright © 2017 Elsevier Inc. All rights reserved.


July 7, 2019

Genome comparisons of two Taiwanese community-associated methicillin-resistant Staphylococcus aureus ST59 clones support the multi-origin theory of CA-MRSA.

Sequence type (ST) 59 is an epidemic lineage of community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) in Asia. Two ST59 clones are prevalent in Taiwan: the Taiwan clone (TW) causes severe infections, whereas the Asian-Pacific clone (AP) is usually commensal. In this study, we sequenced the genome and transcriptome of the representative strains of these two clones and found their differences to focus on three mobile genetic elements: TW carries SCCmec Type VT, Panton-Valentine leucocidin (PVL)-encoding prophage FSa2, whereas AP carries SCCmec Type IV and staphylokinase (SAK)-encoding prophage FSa3. The anti-virulent role of SAK was confirmed using murine skin and bloodstream infection models. FSa3 usually integrates into the hlb gene, but in AP was found to be integrated at the genomic island ?Saß. The mutation of the attB site “TGTATCCAAACTGG” to “TGTATCCGAATTGG” led to a failure in the integration of FSa3 in hlb, prompting atypical integration at other sites. The sak gene possessed remarkably different patterns of distribution among the different STs of S. aureus. We conclude that the atypical integration of FSa3 may help S. aureus adapt to the human host habitat and that the subsequent loss of FSa3 contributes toward the development of a virulent CA-MRSA lineage for wider horizontal transmission. Copyright © 2017. Published by Elsevier B.V.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.