Menu
September 22, 2019

A manually annotated Actinidia chinensis var. chinensis (kiwifruit) genome highlights the challenges associated with draft genomes and gene prediction in plants.

Most published genome sequences are drafts, and most are dominated by computational gene prediction. Draft genomes typically incorporate considerable sequence data that are not assigned to chromosomes, and predicted genes without quality confidence measures. The current Actinidia chinensis (kiwifruit) ‘Hongyang’ draft genome has 164 Mb of sequences unassigned to pseudo-chromosomes, and omissions have been identified in the gene models.A second genome of an A. chinensis (genotype Red5) was fully sequenced. This new sequence resulted in a 554.0 Mb assembly with all but 6 Mb assigned to pseudo-chromosomes. Pseudo-chromosomal comparisons showed a considerable number of translocation events have occurred following a whole genome duplication (WGD) event some consistent with centromeric Robertsonian-like translocations. RNA sequencing data from 12 tissues and ab initio analysis informed a genome-wide manual annotation, using the WebApollo tool. In total, 33,044 gene loci represented by 33,123 isoforms were identified, named and tagged for quality of evidential support. Of these 3114 (9.4%) were identical to a protein within ‘Hongyang’ The Kiwifruit Information Resource (KIR v2). Some proportion of the differences will be varietal polymorphisms. However, as most computationally predicted Red5 models required manual re-annotation this proportion is expected to be small. The quality of the new gene models was tested by fully sequencing 550 cloned ‘Hort16A’ cDNAs and comparing with the predicted protein models for Red5 and both the original ‘Hongyang’ assembly and the revised annotation from KIR v2. Only 48.9% and 63.5% of the cDNAs had a match with 90% identity or better to the original and revised ‘Hongyang’ annotation, respectively, compared with 90.9% to the Red5 models.Our study highlights the need to take a cautious approach to draft genomes and computationally predicted genes. Our use of the manual annotation tool WebApollo facilitated manual checking and correction of gene models enabling improvement of computational prediction. This utility was especially relevant for certain types of gene families such as the EXPANSIN like genes. Finally, this high quality gene set will supply the kiwifruit and general plant community with a new tool for genomics and other comparative analysis.


September 22, 2019

Draft genome of the Peruvian scallop Argopecten purpuratus.

The Peruvian scallop, Argopecten purpuratus, is mainly cultured in southern Chile and Peru was introduced into China in the last century. Unlike other Argopecten scallops, the Peruvian scallop normally has a long life span of up to 7 to 10 years. Therefore, researchers have been using it to develop hybrid vigor. Here, we performed whole genome sequencing, assembly, and gene annotation of the Peruvian scallop, with an important aim to develop genomic resources for genetic breeding in scallops.A total of 463.19-Gb raw DNA reads were sequenced. A draft genome assembly of 724.78 Mb was generated (accounting for 81.87% of the estimated genome size of 885.29 Mb), with a contig N50 size of 80.11 kb and a scaffold N50 size of 1.02 Mb. Repeat sequences were calculated to reach 33.74% of the whole genome, and 26,256 protein-coding genes and 3,057 noncoding RNAs were predicted from the assembly.We generated a high-quality draft genome assembly of the Peruvian scallop, which will provide a solid resource for further genetic breeding and for the analysis of the evolutionary history of this economically important scallop.


September 22, 2019

Comparative genomics of bdelloid rotifers: Insights from desiccating and nondesiccating species.

Bdelloid rotifers are a class of microscopic invertebrates that have existed for millions of years apparently without sex or meiosis. They inhabit a variety of temporary and permanent freshwater habitats globally, and many species are remarkably tolerant of desiccation. Bdelloids offer an opportunity to better understand the evolution of sex and recombination, but previous work has emphasised desiccation as the cause of several unusual genomic features in this group. Here, we present high-quality whole-genome sequences of 3 bdelloid species: Rotaria macrura and R. magnacalcarata, which are both desiccation intolerant, and Adineta ricciae, which is desiccation tolerant. In combination with the published assembly of A. vaga, which is also desiccation tolerant, we apply a comparative genomics approach to evaluate the potential effects of desiccation tolerance and asexuality on genome evolution in bdelloids. We find that ancestral tetraploidy is conserved among all 4 bdelloid species, but homologous divergence in obligately aquatic Rotaria genomes is unexpectedly low. This finding is contrary to current models regarding the role of desiccation in shaping bdelloid genomes. In addition, we find that homologous regions in A. ricciae are largely collinear and do not form palindromic repeats as observed in the published A. vaga assembly. Consequently, several features interpreted as genomic evidence for long-term ameiotic evolution are not general to all bdelloid species, even within the same genus. Finally, we substantiate previous findings of high levels of horizontally transferred nonmetazoan genes in both desiccating and nondesiccating bdelloid species and show that this unusual feature is not shared by other animal phyla, even those with desiccation-tolerant representatives. These comparisons call into question the proposed role of desiccation in mediating horizontal genetic transfer.


September 22, 2019

Chinook salmon (Oncorhynchus tshawytscha) genome and transcriptome.

When unifying genomic resources among studies and comparing data between species, there is often no better resource than a genome sequence. Having a reference genome for the Chinook salmon (Oncorhynchus tshawytscha) will enable the extensive genomic resources available for Pacific salmon, Atlantic salmon, and rainbow trout to be leveraged when asking questions related to the Chinook salmon. The Chinook salmon’s wide distribution, long cultural impact, evolutionary history, substantial hatchery production, and recent wild-population decline make it an important research species. In this study, we sequenced and assembled the genome of a Chilliwack River Hatchery female Chinook salmon (gynogenetic and homozygous at all loci). With a reference genome sequence, new questions can be asked about the nature of this species, and its role in a rapidly changing world.


September 22, 2019

Genomic architecture of haddock (Melanogrammus aeglefinus) shows expansions of innate immune genes and short tandem repeats.

Increased availability of genome assemblies for non-model organisms has resulted in invaluable biological and genomic insight into numerous vertebrates, including teleosts. Sequencing of the Atlantic cod (Gadus morhua) genome and the genomes of many of its relatives (Gadiformes) demonstrated a shared loss of the major histocompatibility complex (MHC) II genes 100 million years ago. An improved version of the Atlantic cod genome assembly shows an extreme density of tandem repeats compared to other vertebrate genome assemblies. Highly contiguous assemblies are therefore needed to further investigate the unusual immune system of the Gadiformes, and whether the high density of tandem repeats found in Atlantic cod is a shared trait in this group.Here, we have sequenced and assembled the genome of haddock (Melanogrammus aeglefinus) – a relative of Atlantic cod – using a combination of PacBio and Illumina reads. Comparative analyses reveal that the haddock genome contains an even higher density of tandem repeats outside and within protein coding sequences than Atlantic cod. Further, both species show an elevated number of tandem repeats in genes mainly involved in signal transduction compared to other teleosts. A characterization of the immune gene repertoire demonstrates a substantial expansion of MCHI in Atlantic cod compared to haddock. In contrast, the Toll-like receptors show a similar pattern of gene losses and expansions. For the NOD-like receptors (NLRs), another gene family associated with the innate immune system, we find a large expansion common to all teleosts, with possible lineage-specific expansions in zebrafish, stickleback and the codfishes.The generation of a highly contiguous genome assembly of haddock revealed that the high density of short tandem repeats as well as expanded immune gene families is not unique to Atlantic cod – but possibly a feature common to all, or most, codfishes. A shared expansion of NLR genes in teleosts suggests that the NLRs have a more substantial role in the innate immunity of teleosts than other vertebrates. Moreover, we find that high copy number genes combined with variable genome assembly qualities may impede complete characterization of these genes, i.e. the number of NLRs in different teleost species might be underestimates.


September 22, 2019

The antibody loci of the domestic goat (Capra hircus).

The domestic goat (Capra hircus) is an important ruminant species both as a source of antibody-based reagents for research and biomedical applications and as an economically important animal for agriculture, particularly for developing nations that maintain most of the global goat population. Characterization of the loci encoding the goat immune repertoire would be highly beneficial for both vaccine and immune reagent development. However, in goat and other species whose reference genomes were generated using short-read sequencing technologies, the immune loci are poorly assembled as a result of their repetitive nature. Our recent construction of a long-read goat genome assembly (ARS1) has facilitated characterization of all three antibody loci with high confidence and comparative analysis to cattle. We observed broad similarity of goat and cattle antibody-encoding loci but with notable differences that likely influence formation of the functional antibody repertoire. The goat heavy-chain locus is restricted to only four functional and nearly identical IGHV genes, in contrast to the ten observed in cattle. Repertoire analysis indicates that light-chain usage is more balanced in goats, with greater representation of kappa light chains (~ 20-30%) compared to that in cattle (~ 5%). The present study represents the first characterization of the goat antibody loci and will help inform future investigations of their antibody responses to disease and vaccination.


September 22, 2019

Discovery of gorilla MHC-C expressing C1 ligand for KIR.

In comparison to humans and chimpanzees, gorillas show low diversity at MHC class I genes (Gogo), as reflected by an overall reduced level of allelic variation as well as the absence of a functionally important sequence motif that interacts with killer cell immunoglobulin-like receptors (KIR). Here, we use recently generated large-scale genomic sequence data for a reassessment of allelic diversity at Gogo-C, the gorilla orthologue of HLA-C. Through the combination of long-range amplifications and long-read sequencing technology, we obtained, among the 35 gorillas reanalyzed, three novel full-length genomic sequences including a coding region sequence that has not been previously described. The newly identified Gogo-C*03:01 allele has a divergent recombinant structure that sets it apart from other Gogo-C alleles. Domain-by-domain phylogenetic analysis shows that Gogo-C*03:01 has segments in common with Gogo-B*07, the additional B-like gene that is present on some gorilla MHC haplotypes. Identified in ~ 50% of the gorillas analyzed, the Gogo-C*03:01 allele exclusively encodes the C1 epitope among Gogo-C allotypes, indicating its important function in controlling natural killer cell (NK cell) responses via KIR. We further explored the hypothesis whether gorillas experienced a selective sweep which may have resulted in a general reduction of the gorilla MHC class I repertoire. Our results provide little support for a selective sweep but rather suggest that the overall low Gogo class I diversity can be best explained by drastic demographic changes gorillas experienced in the ancient and recent past.


September 22, 2019

Integrated proteomics, genomics, metabolomics approaches reveal oxalic acid as pathogenicity factor in Tilletia indica inciting Karnal bunt disease of wheat.

Tilletia indica incites Karnal bunt (KB) disease in wheat. To date, no KB resistant wheat cultivar could be developed due to non-availability of potential biomarkers related to pathogenicity/virulence for screening of resistant wheat genotypes. The present study was carried out to compare the proteomes of T. indica highly (TiK) and low (TiP) virulent isolates. Twenty one protein spots consistently observed as up-regulated/differential in the TiK proteome were selected for identification by MALDI-TOF/TOF. Identified sequences showed homology with fungal proteins playing essential role in plant infection and pathogen survival, including stress response, adhesion, fungal penetration, invasion, colonization, degradation of host cell wall, signal transduction pathway. These results were integrated with T. indica genome sequence for identification of homologs of candidate pathogenicity/virulence related proteins. Protein identified in TiK isolate as malate dehydrogenase that converts malate to oxaloacetate which is precursor of oxalic acid. Oxalic acid is key pathogenicity factor in phytopathogenic fungi. These results were validated by GC-MS based metabolic profiling of T. indica isolates indicating that oxalic acid was exclusively identified in TiK isolate. Thus, integrated omics approaches leads to identification of pathogenicity/virulence factor(s) that would provide insights into pathogenic mechanisms of fungi and aid in devising effective disease management strategies.


September 22, 2019

In vitro DNA SCRaMbLE.

The power of synthetic biology has enabled the expression of heterologous pathways in cells, as well as genome-scale synthesis projects. The complexity of biological networks makes rational de novo design a grand challenge. Introducing features that confer genetic flexibility is a powerful strategy for downstream engineering. Here we develop an in vitro method of DNA library construction based on structural variation to accomplish this goal. The “in vitro SCRaMbLE system” uses Cre recombinase mixed in a test tube with purified DNA encoding multiple loxPsym sites. Using a ß-carotene pathway designed for expression in yeast as an example, we demonstrate top-down and bottom-up in vitro SCRaMbLE, enabling optimization of biosynthetic pathway flux via the rearrangement of relevant transcription units. We show that our system provides a straightforward way to correlate phenotype and genotype and is potentially amenable to biochemical optimization in ways that the in vivo system cannot achieve.


September 22, 2019

A transposable element annotation pipeline and expression analysis reveal potentially active elements in the microalga Tisochrysis lutea.

Transposable elements (TEs) are mobile DNA sequences known as drivers of genome evolution. Their impacts have been widely studied in animals, plants and insects, but little is known about them in microalgae. In a previous study, we compared the genetic polymorphisms between strains of the haptophyte microalga Tisochrysis lutea and suggested the involvement of active autonomous TEs in their genome evolution.To identify potentially autonomous TEs, we designed a pipeline named PiRATE (Pipeline to Retrieve and Annotate Transposable Elements, download: https://doi.org/10.17882/51795 ), and conducted an accurate TE annotation on a new genome assembly of T. lutea. PiRATE is composed of detection, classification and annotation steps. Its detection step combines multiple, existing analysis packages representing all major approaches for TE detection and its classification step was optimized for microalgal genomes. The efficiency of the detection and classification steps was evaluated with data on the model species Arabidopsis thaliana. PiRATE detected 81% of the TE families of A. thaliana and correctly classified 75% of them. We applied PiRATE to T. lutea genomic data and established that its genome contains 15.89% Class I and 4.95% Class II TEs. In these, 3.79 and 17.05% correspond to potentially autonomous and non-autonomous TEs, respectively. Annotation data was combined with transcriptomic and proteomic data to identify potentially active autonomous TEs. We identified 17 expressed TE families and, among these, a TIR/Mariner and a TIR/hAT family were able to synthesize their transposase. Both these TE families were among the three highest expressed genes in a previous transcriptomic study and are composed of highly similar copies throughout the genome of T. lutea. This sum of evidence reveals that both these TE families could be capable of transposing or triggering the transposition of potential related MITE elements.This manuscript provides an example of a de novo transposable element annotation of a non-model organism characterized by a fragmented genome assembly and belonging to a poorly studied phylum at genomic level. Integration of multi-omics data enabled the discovery of potential mobile TEs and opens the way for new discoveries on the role of these repeated elements in genomic evolution of microalgae.


September 22, 2019

Signatures of host specialization and a recent transposable element burst in the dynamic one-speed genome of the fungal barley powdery mildew pathogen.

Powdery mildews are biotrophic pathogenic fungi infecting a number of economically important plants. The grass powdery mildew, Blumeria graminis, has become a model organism to study host specialization of obligate biotrophic fungal pathogens. We resolved the large-scale genomic architecture of B. graminis forma specialis hordei (Bgh) to explore the potential influence of its genome organization on the co-evolutionary process with its host plant, barley (Hordeum vulgare).The near-chromosome level assemblies of the Bgh reference isolate DH14 and one of the most diversified isolates, RACE1, enabled a comparative analysis of these haploid genomes, which are highly enriched with transposable elements (TEs). We found largely retained genome synteny and gene repertoires, yet detected copy number variation (CNV) of secretion signal peptide-containing protein-coding genes (SPs) and locally disrupted synteny blocks. Genes coding for sequence-related SPs are often locally clustered, but neither the SPs nor the TEs reside preferentially in genomic regions with unique features. Extended comparative analysis with different host-specific B. graminis formae speciales revealed the existence of a core suite of SPs, but also isolate-specific SP sets as well as congruence of SP CNV and phylogenetic relationship. We further detected evidence for a recent, lineage-specific expansion of TEs in the Bgh genome.The characteristics of the Bgh genome (largely retained synteny, CNV of SP genes, recently proliferated TEs and a lack of significant compartmentalization) are consistent with a “one-speed” genome that differs in its architecture and (co-)evolutionary pattern from the “two-speed” genomes reported for several other filamentous phytopathogens.


September 22, 2019

Long-read sequencing data analysis for yeasts.

Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ~41 h to generate a complete and well-annotated genome from ~100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.


September 22, 2019

Footprints of parasitism in the genome of the parasitic flowering plant Cuscuta campestris.

A parasitic lifestyle, where plants procure some or all of their nutrients from other living plants, has evolved independently in many dicotyledonous plant families and is a major threat for agriculture globally. Nevertheless, no genome sequence of a parasitic plant has been reported to date. Here we describe the genome sequence of the parasitic field dodder, Cuscuta campestris. The genome contains signatures of a fairly recent whole-genome duplication and lacks genes for pathways superfluous to a parasitic lifestyle. Specifically, genes needed for high photosynthetic activity are lost, explaining the low photosynthesis rates displayed by the parasite. Moreover, several genes involved in nutrient uptake processes from the soil are lost. On the other hand, evidence for horizontal gene transfer by way of genomic DNA integration from the parasite’s hosts is found. We conclude that the parasitic lifestyle has left characteristic footprints in the C. campestris genome.


September 22, 2019

Genome mining of the marine actinomycete Streptomyces sp. DUT11 and discovery of tunicamycins as anti-complement agents.

Marine actinobacteria are potential producers of various secondary metabolites with diverse bioactivities. Among various bioactive compounds, anti-complement agents have received great interest for drug discovery to treat numerous diseases caused by inappropriate activation of the human complement system. However, marine streptomycetes producing anti-complement agents are still poorly explored. In this study, a marine-derived strain Streptomyces sp. DUT11 showing superior anti-complement activity was focused, and its genome sequence was analyzed. Gene clusters showing high similarities to that of tunicamycin and nonactin were identified, and their corresponding metabolites were also detected. Subsequently, tunicamycin I, V, and VII were isolated from Streptomyces sp. DUT11. Anti-complement assay showed that tunicamycin I, V, VII inhibited complement activation through the classic pathway, whereas no anti-complement activity of nonactin was detected. This is the first time that tunicamycins are reported to have such activity. In addition, genome analysis indicates that Streptomyces sp. DUT11 has the potential to produce novel lassopeptides and lantibiotics. These results suggest that marine Streptomyces are rich sources of anti-complement agents for drug discovery.


September 22, 2019

Genome-wide DNA methylation and transcriptome changes in Mycobacterium tuberculosis with rifampicin and isoniazid resistance

We investigated the genome-wide DNA methylation and transcriptome changes in M. tuberculosis with rifampicin or isoniazid resistance. Single-molecule real-time (SMRT) sequencing and microarray technology were performed to expound DNA methylation profiles and differentially expressed genes in rifampicin or isoniazid resis- tant M. tuberculosis. Kyoto Encyclopedia of Genes and Genomes (KEGG) biological pathway analysis and meth- ylated regulatory network analysis were conducted by online forecasting databases. Integrated analysis of DNA methylation and transcriptome revealed that 335 differentially methylated genes (175 hypermethylated and 160 hypomethylated) and 132 significant differentially expressed genes (68 up-regulated and 63 down-regulated) were found to be regulated by both rifampicin and isoniazid in M. tuberculosis H37Rv. Correlation analysis showed that differential methylated genes were negatively correlated with their transcriptional levels in rifampicin or isoniazid resistant strains. KEGG pathway analysis indicated that nitrogen metabolism pathway is closely related to differ- entially methylated genes induced by rifampicin and isoniazid. KEGG also suggested that differentially expressed genes in rifampicin or isoniazid-resistant strains may play different roles in regulating signal transduction events. Furthermore, five differentially methylated candidate genes (Rv0840c, Rv2243, Rv0644c, Rv2386c and Rv1130) in rifampicin resistant strains and three genes (Rv0405, Rv0252 and Rv0908) in isoniazid-resistant strains were verified the existence of protein-protein interaction in STRING database. Integrated DNA methylation and transcrip- tome analyses provide an epigenetic overview of rifampicin and isoniazid-induced antibiotic resistance in M. tuber- culosis H37Rv. Several interesting genes and regulatory pathways may provide valuable resources for epigenetic studies in M. tuberculosis antibiotic resistance.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.