A comprehensive understanding of wheat responses to environmental stress will contribute to the long-term goal of feeding the planet. ALERNATIVE OXIDASE (AOX) genes encode proteins involved in a bypass of the electron transport chain and are also known to be involved in stress tolerance in multiple species. Here, we report the identification and characterization of the AOX gene family in diploid and hexaploid wheat. Four genes each were found in the diploid ancestors Triticum urartu, and Aegilops tauschii, and three in Aegilops speltoides. In hexaploid wheat (Triticum aestivum), 20 genes were identified, some with multiple splice variants, corresponding to a…
The seed is the pharmaceutical and breeding organ of Cassia obtusifolia, a well-known medical herb containing aurantio-obtusin (a kind of anthraquinone), food, and landscape. In order to understand the molecular mechanism of the biosynthesis of aurantio-obtusin, seed formation and development, and stress response of C. obtusifolia, it is necessary to understand the genomics information. Although previous seed transcriptome of C. obtusifolia has been carried out by short-read next-generation sequencing (NGS) technology, the vast majority of the resulting unigenes did not represent full-length cDNA sequences and supply enough gene expression profile information of the various organs or tissues. In this study,…
We report full-length draft de novo genome assemblies for 16 widely used inbred mouse strains and find extensive strain-specific haplotype variation. We identify and characterize 2,567 regions on the current mouse reference genome exhibiting the greatest sequence diversity. These regions are enriched for genes involved in pathogen defence and immunity and exhibit enrichment of transposable elements and signatures of recent retrotransposition events. Combinations of alleles and genes unique to an individual strain are commonly observed at these loci, reflecting distinct strain phenotypes. We used these genomes to improve the mouse reference genome, resulting in the completion of 10 new gene…
Nuclear genes of euglenids and marine diplonemids harbor atypical, nonconventional introns which are not observed in the genomes of other eukaryotes. Nonconventional introns do not have the conserved borders characteristic for spliceosomal introns or the sequence complementary to U1 snRNA at the 5′ end. They form a stable secondary structure bringing together both exon/intron junctions, nevertheless, this conformation does not resemble the form of self-splicing or tRNA introns. In the genes studied so far, frequent nonconventional introns insertions at new positions have been observed, whereas conventional introns have been either found at the conserved positions, or simply lost. In this…
The tick microbiome comprises communities of microorganisms, including viruses, bacteria and eukaryotes, and is being elucidated through modern molecular techniques. The advent of next-generation sequencing (NGS) technologies has enabled the genes and genomes within these microbial communities to be explored in a rapid and cost-effective manner. The advantages of using NGS to investigate microbiomes surpass the traditional non-molecular methods that are limited in their sensitivity, and conventional molecular approaches that are limited in their scalability. In recent years the number of studies using NGS to investigate the microbial diversity and composition of ticks has expanded. Here, we provide a review…
Pneumocystis species are ascomycete fungi adapted to live inside the lungs of mammals. These ascomycetes show extensive stenoxenism, meaning that each species of Pneumocystis infects a single species of host. Here, we study the effect exerted by natural selection on gene evolution in the genomes of three Pneumocystis species. We show that genes involved in host interaction evolve under positive selection. In the first place, we found strong evidence of episodic diversifying selection in Major surface glycoproteins (Msg). These proteins are located on the surface of Pneumocystis and are used for host attachment and probably for immune system evasion. Consistent…
Streptomyces are a genus of Actinobacteria capable of producing structurally diverse natural products. Here we report the isolation and characterization of a biosynthetically talented Streptomyces (Streptomyces sp. SD85) from tropical mangrove sediments. Whole-genome sequencing revealed that Streptomyces sp. SD85 harbors at least 52 biosynthetic gene clusters (BGCs), which constitute 21.2% of the 8.6-Mb genome. When cultivated under lab conditions, Streptomyces sp. SD85 produces sceliphrolactam, a 26-membered polyene macrolactam with unknown biosynthetic origin. Genome mining yielded a putative sceliphrolactam BGC (sce) that encodes a type I modular polyketide synthase (PKS) system, several ß-amino acid starter biosynthetic enzymes, transporters, and transcriptional regulators.…
Trypanosomatids (Trypanosomatidae, Kinetoplastida) are flagellated protozoa containing many parasites of medical or agricultural importance. Among those, Crithidia bombi and C. expoeki, are common parasites in bumble bees around the world, and phylogenetically close to Leishmania and Leptomonas. They have a simple and direct life cycle with one host, and partially castrate the founding queens greatly reducing their fitness. Here, we report the nuclear genome sequences of one clone of each species, extracted from a field-collected infection. Using a combination of Roche 454 FLX Titanium, Pacific Biosciences PacBio RS, and Illumina GA2 instruments for C. bombi, and PacBio for C. expoeki,…
Trypanosoma cruzi, a zoonotic kinetoplastid protozoan with a complex genome, is the causative agent of American trypanosomiasis (Chagas disease). The parasite uses a highly diverse repertoire of surface molecules, with roles in cell invasion, immune evasion and pathogenesis. Thus far, the genomic regions containing these genes have been impossible to resolve and it has been impossible to study the structure and function of the several thousand repetitive genes encoding the surface molecules of the parasite. We here present an improved genome assembly of a T. cruzi clade I (TcI) strain using high coverage PacBio single molecule sequencing, together with Illumina…
Pantoea vagans, a gram-negative bacterium from the genus Pantoea and family Enterobacteriaceae, is present in various natural environments and considered to be plant endophytes. We isolated the Pantoea vagans PV989 strain from the clinic and sequenced its whole genome. Besides a chromosome DNA molecule, it also harboured three large plasmids. A comparative genomics analysis was performed for the smallest plasmid, pPV989-94. It can be divided into four regions, including three conservative regions related to replication (R1), transfer conjugation (R2), and transfer leading (R3), and one variable region (R4). Further analysis showed that pPV989-94 is most similar to plasmids LA637P2 and…
Pneumocystis, a unique atypical fungus with an elusive lifestyle, has had an important medical history. It came to prominence as an opportunistic pathogen that not only can cause life-threatening pneumonia in patients with HIV infection and other immunodeficiencies but also can colonize the lungs of healthy individuals from a very early age. The genus Pneumocystis includes a group of closely related but heterogeneous organisms that have a worldwide distribution, have been detected in multiple mammalian species, are highly host species specific, inhabit the lungs almost exclusively, and have never convincingly been cultured in vitro, making Pneumocystis a fascinating but difficult-to-study…
The de Bruijn graph is a simple and efficient data structure that is used in many areas of sequence analysis including genome assembly, read error correction and variant calling. The data structure has a single parameter k, is straightforward to implement and is tractable for large genomes with high sequencing depth. It also enables representation of multiple samples simultaneously to facilitate comparison. However, unlike the string graph, a de Bruijn graph does not retain long range information that is inherent in the read data. For this reason, applications that rely on de Bruijn graphs can produce sub-optimal results given their…
Members of the genus Naegleria are free-living eukaryotes with the capability to transform from the amoeboid form into resting cysts or moving flagellates in response to environmental conditions. More than 40 species have been characterized, but only Naegleria fowleri (N. fowleri) is known as a human pathogen causing primary amoebic meningoencephalitis (PAM), a fast progressing and mostly fatal disease of the central nervous system. Several studies report an involvement of phospholipases and other molecular factors, but the mechanisms involved in pathogenesis are still poorly understood. To gain a better understanding of the relationships within the genus of Naegleria and to…
African trypanosomiasis is a vector-borne disease of humans and livestock caused by African trypanosomes (Trypanosoma spp.). Survival in the vertebrate bloodstream depends on antigenic variation of Variant Surface Glycoproteins (VSGs) coating the parasite surface. In T. brucei, a model for antigenic variation, monoallelic VSG expression originates from dedicated VSG expression sites (VES). Trypanosoma brucei VES have a conserved structure consisting of a telomeric VSG locus downstream of unique, repeat sequences, and an independent promoter. Additional protein-coding sequences, known as “Expression Site Associated Genes (ESAGs)”, are also often present and are implicated in diverse, bloodstream-stage functions. Trypanosoma congolense is a related…
Chagas disease is a complex illness caused by the protozoan Trypanosoma cruzi displaying highly diverse clinical outcomes. In this sense, the genome sequence elucidation and comparison between strains may lead to disease understanding. Here, two new T. cruzi strains, have been sequenced, Y using Illumina and Bug2148 using PacBio, assembled, analyzed and compared with the T. cruzi annotated genomes available to date. The assembly stats from the new sequences show effective improvement of T. cruzi genome over the actual ones. Such as, the largest contig assembled (1.3?Mb in Bug2148) in de novo attempts and the highest mean assembly coverage (71X…