Single-molecule full-length complementary DNA (cDNA) sequencing can aid genome annotation by revealing transcript structure and alternative splice forms, yet current annotation pipelines do not incorporate such information. Here we present long-read annotation (LoReAn) software, an automated annotation pipeline utilizing short- and long-read cDNA sequencing, protein evidence, and ab initio prediction to generate accurate genome annotations. Based on annotations of two fungal genomes (Verticillium dahliae and Plicaturopsis crispa) and two plant genomes (Arabidopsis [Arabidopsis thaliana] and Oryza sativa), we show that LoReAn outperforms popular annotation pipelines by integrating single-molecule cDNA-sequencing data generated from either the Pacific Biosciences or MinION sequencing platforms, correctly predicting gene structure, and capturing genes missed by other annotation pipelines. © 2019 American Society of Plant Biologists. All Rights Reserved.
Genomes of lethal Amanita and Galerina mushrooms have gradually become available in the past ten years; in contrast the other known amanitin-producing genus, Lepiota, is still vacant in this aspect. A fatal mushroom poisoning case in China has led to acquisition of fresh L. venenata fruiting bodies, based on which a draft genome was obtained through PacBio and Illumina sequencing platforms. Toxin-biosynthetic MSDIN family and Porlyl oligopeptidase B (POPB) genes were mined from the genome and used for phylogenetic and statistical studies to gain insights into the evolution of the biosynthetic pathway.The analysis of the genome data illustrated that only one MSDIN, named LvAMA1, exits in the genome, along with a POPB gene. No POPA homolog was identified by direct homology searching, however, one additional POP gene, named LvPOPC, was cloned and the gene structure determined. Similar to ApAMA1 in A. phalloides and GmAMA1 in G. marginata, LvAMA1 directly encodes a-amanitin. The two toxin genes were mapped to the draft genome, and the structures analyzed. Furthermore, phylogenetic and statistical analyses were conducted to study the evolution history of the POPB genes. Compared to our previous report, the phylogenetic trees unambiguously showed that a monophyletic POPB lineage clearly conflicted with the species phylogeny. In contrast, phylogeny of POPA genes resembled the species phylogeny. Topology and divergence tests showed that the POPB lineage was robust and these genes exhibited significantly shorter genetic distances than those of the house-keeping rbp2, a characteristic feature of genes with horizontal gene transfer (HGT) background. Consistently, same scenario applied to the only MSDIN, LvAMA1, in the genome.To the best of our knowledge, this is the first reported genome of Lepiota. The analyses of the toxin genes indicate that the cyclic peptides are synthesized through a ribosomal mechanism. The toxin genes, LvAMA1 and LvPOPB, are not in the vicinity of each other. Phylogenetic and evolutionary studies suggest that HGT is the underlining cause for the occurrence of POPB and MSDIN in Amanita, Galerina and Lepiota, which are allocated in three distantly-related families.