Species-specific, new, or “orphan” genes account for 10%-30% of eukaryotic genomes. Although initially considered to have limited function, an increasing number of orphan genes have been shown to provide important phenotypic innovation. How new genes acquire regulatory sequences for proper temporal and spatial expression is unknown. Orphan gene regulation may rely in part on origination in open chromatin adjacent to preexisting promoters, although this has not yet been assessed by genome-wide analysis of chromatin states. Here, we combine taxon-rich nematode phylogenies with Iso-Seq, RNA-seq, ChIP-seq, and ATAC-seq to identify the gene structure and epigenetic signature of orphan genes in the…
A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe – or ‘annotate’ – genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists – from clinicians to evolutionary biologists – need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets.…
The development of multicellular organisms is coordinated by various gene regulatory mechanisms that ensure correct spatio-temporal patterns of gene expression. Recently, the role of antisense transcription in gene regulation has moved into focus of research. To characterize genome-wide patterns of antisense transcription and to study their evolutionary conservation, we sequenced a strand-specific RNA-seq library of the nematode Pristionchus pacificus.We identified 1112 antisense configurations of which the largest group represents 465 antisense transcripts (ASTs) that are fully embedded in introns of their host genes. We find that most ASTs show homology to protein-coding genes and are overrepresented in proteomic data. Together…
The chicken has long served as an important model organism in many fields, and continues to aid our understanding of animal development. Functional genomics studies aimed at probing the mechanisms that regulate development require high-quality genomes and transcript annotations. The quality of these resources has improved dramatically over the last several years, but many isoforms and genes have yet to be identified. We hope to contribute to the process of improving these resources with the data presented here: a set of long cDNA sequencing reads, and a curated set of new genes and transcript isoforms not currently represented in the…
The advantages of Pacific Biosciences (PacBio) single-molecule real-time (SMRT) technology include long reads, low systematic bias, and high consensus read accuracy. Here we use these attributes to improve on the genome annotation of the parasitic hookworm Ancylostoma ceylanicum using PacBio RNA-Seq.We sequenced 192,888 circular consensus sequences (CCS) derived from cDNAs generated using the CloneTech SMARTer system. These SMARTer-SMRT libraries were normalized and size-selected providing a robust population of expressed structural genes for subsequent genome annotation. We demonstrate PacBio mRNA sequences based genome annotation improvement, compared to genome annotation using conventional sequencing-by-synthesis alone, by identifying 1609 (9.2%) new genes, extended the…
Gilliamella apicola and Snodgrassella alvi are dominant members of the honey bee (Apis spp.) and bumble bee (Bombus spp.) gut microbiota. We generated complete genomes of the type strains G. apicola wkB1(T) and S. alvi wkB2(T) (isolated from Apis), as well as draft genomes for four other strains from Bombus. G. apicola and S. alvi were found to occupy very different metabolic niches: The former is a saccharolytic fermenter, whereas the latter is an oxidizer of carboxylic acids. Together, they may form a syntrophic network for partitioning of metabolic resources. Both species possessed numerous genes [type 6 secretion systems, repeats…
Community analyses of arbuscular mycorrhizal fungi (AMF) using ribosomal small subunit (SSU) or internal transcribed spacer (ITS) DNA sequences often suffer from low resolution or coverage. We developed a novel sequencing based approach for a highly resolving and specific profiling of AMF communities. We took advantage of previously established AMF-specific PCR primers that amplify a c. 1.5-kb long fragment covering parts of SSU, ITS and parts of the large ribosomal subunit (LSU), and we sequenced the resulting amplicons with single molecule real-time (SMRT) sequencing. The method was applicable to soil and root samples, detected all major AMF families and successfully…
Bermudagrass [Cynodon dactylon (L.) Pers.] is an important perennial warm-season turfgrass species with great economic value. However, the reference genome and transcriptome information are still deficient in bermudagrass, which severely impedes functional and molecular breeding studies. In this study, through analyzing a mixture sample of leaves, stolons, shoots, roots and flowers with single-molecule long-read sequencing technology from Pacific Biosciences (PacBio), we reported the first full-length transcriptome dataset of bermudagrass (C. dactylon cultivar Yangjiang) comprising 78,192 unigenes. Among the unigenes, 66,409 were functionally annotated, whereas 27,946 were found to have two or more isoforms. The annotated full-length unigenes provided many new…
Solanum sisymbriifolium, also known as “Litchi Tomato” or “Sticky Nightshade,” is an undomesticated and poorly researched plant related to potato and tomato. Unlike the latter species, S. sisymbriifolium induces eggs of the cyst nematode, Globodera pallida, to hatch and migrate into its roots, but then arrests further nematode maturation. In order to provide researchers with a partial blueprint of its genetic make-up so that the mechanism of this response might be identified, we used single molecule real time (SMRT) sequencing to compile a high quality de novo transcriptome of 41,189 unigenes drawn from individually sequenced bud, root, stem, and leaf…
House dust mites are common pests with an unusual evolutionary history, being descendants of a parasitic ancestor. Transition to parasitism is frequently accompanied by genome rearrangements, possibly to accommodate the genetic change needed to access new ecology. Transposable element (TE) activity is a source of genomic instability that can trigger large-scale genomic alterations. Eukaryotes have multiple transposon control mechanisms, one of which is RNA interference (RNAi). Investigation of the dust mite genome failed to identify a major RNAi pathway: the Piwi-associated RNA (piRNA) pathway, which has been replaced by a novel small-interfering RNA (siRNA)-like pathway. Co-opting of piRNA function by…
Pochonia chlamydosporia infects eggs and females of economically important plant-parasitic nematodes. The fungal isolates parasitizing different nematodes are genetically distinct. To understand their intraspecific genetic differentiation, parasitic mechanisms, and adaptive evolution, we assembled seven putative chromosomes of P. chlamydosporia strain 170 isolated from root-knot nematode eggs (~44?Mb, including 7.19% of transposable elements) and compared them with the genome of the strain 123 (~41?Mb) isolated from cereal cyst nematode. We focus on secretomes of the fungus, which play important roles in pathogenicity and fungus-host/environment interactions, and identified 1,750 secreted proteins, with a high proportion of carboxypeptidases, subtilisins, and chitinases. We analyzed…
Although the genus Clavibacter was originally proposed to accommodate all phytopathogenic coryneform bacteria containing B2? diaminobutyrate in the peptidoglycan, reclassification of all but one species into other genera has resulted in the current monospecific status of the genus. The single species in the genus, Clavibacter michiganensis, has multiple subspecies, which are all highly host-specific plant pathogens. Whole genome analysis based on average nucleotide identity and digital DNA-DNA hybridization as well as multi-locus sequence analysis (MLSA) of seven housekeeping genes support raising each of the C. michiganensis subspecies to species status. On the basis of whole genome and MLSA data, we…
Nematophagous (NP) fungi are ecologically important components of the soil microbiome in natural ecosystems. Esteya vermicola (Ev) has been reported as a NP fungus with a poorly understood evolutionary history and mechanism of adaptation to parasitism. Furthermore, NP fungal genomic basis of lifestyle was still unclear. We sequenced and annotated the Ev genome (34.2 Mbp) and integrated genetic makeup and evolution of pathogenic genes to investigate NP fungi. The results revealed that NP fungi had some abundant pathogenic genes corresponding to their niche. A number of gene families involved in pathogenicity were expanded, and some pathogenic orthologous genes underwent positive…
The cultivated apple is susceptible to several pests including the rosy apple aphid (RAA; Dysaphis plantaginea Passerini), control of which is mainly based on chemical treatments. A few cases of resistance to aphids have been described in apple germplasm resources, laying the basis for the development of new resistant cultivars by breeding. The cultivar ‘Florina’ is resistant to RAA, and recently, the Dp-fl locus responsible for its resistance was mapped on linkage group 8 of the apple genome. In this paper, a chromosome walking approach was performed by using a ‘Florina’ bacterial artificial chromosome (BAC) library. The walking started from…
Thanks to a recent spate of sequencing projects, the Hemiptera are the first hemimetabolous insect order to achieve a critical mass of species with sequenced genomes, establishing the basis for comparative genomics of the bugs. However, as the most speciose hemimetabolous order, there is still a vast swathe of the hemipteran phylogeny that awaits genomic representation across subterranean, terrestrial, and aquatic habitats, and with lineage-specific and developmentally plastic cases of both wing polyphenisms and flightlessness. In this review, we highlight opportunities for taxonomic sampling beyond obvious pest species candidates, motivated by intriguing biological features of certain groups as well as…