Annotation Archives - Page 23 of 34

April 21, 2020

A new reference genome sequence for Caenorhabditis elegans?

A new study “recompletes” the C. elegans genome sequence, revealing hitherto unseen genes.

April 21, 2020

Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.

The locus for familial cortical myoclonic tremor with epilepsy (FCMTE) has long been mapped to 8q24 in linkage studies, but the causative mutations remain unclear. Recently, expansions of intronic TTTCA and TTTTA repeat motifs within SAMD12 were found to be involved in the pathogenesis of FCMTE in Japanese pedigrees. We aim to identify the causative mutations of FCMTE in Chinese pedigrees.We performed genetic linkage analysis by microsatellite markers in a five-generation Chinese pedigree with 55 members. We also used array-comparative genomic hybridisation (CGH) and next-generation sequencing (NGS) technologies (whole-exome sequencing, capture region deep sequencing and whole-genome sequencing) to identify the causative mutations in the disease locus. Recently, we used low-coverage (~10×) long-read genome sequencing (LRS) on the PacBio Sequel and Oxford Nanopore platforms to identify the causative mutations, and used repeat-primed PCR for validation of the repeat expansions.Linkage analysis mapped the disease locus to 8q23.3-24.23. Array-CGH and NGS failed to identify causative mutations in this locus. LRS identified the intronic TTTCA and TTTTA repeat expansions in SAMD12 as the causative mutations, thus corroborating the recently published results in Japanese pedigrees.We identified the pentanucleotide repeat expansion in SAMD12 as the causative mutation in Chinese FCMTE pedigrees. Our study also suggested that LRS is an effective tool for molecular diagnosis of genetic disorders, especially for neurological diseases that cannot be positively diagnosed by conventional clinical microarray and NGS technologies. © Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.

April 21, 2020

Complete chloroplast genome sequence of Carthamus tinctorius L. from PacBio Sequel Platform

Carthamus tinctorius L, also known as safflower, is an important oil crop planted worldwide. The com- plete chloroplast (cp) genome was reported in this study using the PacBio Sequel Platform. The cp genome with a total size of 152,963bp consisted of two inverted repeats (25,128bp) separated by a large single-copy region (84,124bp) and a small single-copy region (18,583bp). Further annotation revealed the cp genome contains 112 genes, including 79 protein-coding genes, 29 tRNA genes, and 4 rRNA genes. The information of the cp genome will be useful for investigation of evolution and molecular breeding of safflower in the future.

April 21, 2020

Carbohydrate catabolic capability of a Flavobacteriia bacterium isolated from hadal water.

Flavobacteriia are abundant in many marine environments including hadal waters, as demonstrated recently. However, it is unclear how this flavobacterial population adapts to hadal conditions. In this study, extensive comparative genomic analyses were performed for the flavobacterial strain Euzebyella marina RN62 isolated from the Mariana Trench hadal water in low abundance. The complete genome of RN62 possessed a considerable number of carbohydrate-active enzymes with a different composition. There was a predominance of GH family 13 proteins compared to closely related relatives, suggesting that RN62 has preserved a certain capacity for carbohydrate utilization and that the hadal ocean may hold an organic matter reservoir distinct from the surface ocean. Additionally, RN62 possessed potential intracellular cycling of the glycogen/starch pathway, which may serve as a strategy for carbon storage and consumption in response to nutrient pulse and starvation. Moreover, the discovery of higher glycoside hydrolase dissimilarities among Flavobacteriia, compared to peptidases and transporters, suggested variation in polysaccharide utilization related traits as an important ecophysiological factor in response to environmental alterations, such as decreased labile organic carbon in hadal waters. The presence of abundant toxin exporting, transcription and signal transduction related genes in RN62 may further help to survive in hadal conditions, including high pressure/low temperature.Copyright © 2019 Elsevier GmbH. All rights reserved.

April 21, 2020

Evolution of Goat’s Rue Rhizobia (Neorhizobium galegae): Analysis of Polymorphism of the Nitrogen Fixation and Nodule Formation Genes

The goat’s rue rhizobia (Neorhizobium galegae) represent a convenient model to study the evolution and speciation of symbiotic bacteria. This rhizobial species is composed of two biovars (bv. orientalis and bv. officinalis), which form N2-fixing nodules with certain species of goat’s rue (Galega orientalis and G. officinalis). The cross-inoculation between them results in the formation of nodules unable to fix nitrogen. On the basis of the data on the whole-genome sequencing, we studied the nucleotide polymorphism of 11 N. galegae strains isolated from the North Caucasus ecosystems, where G. orientalis has higher diversity than G. officinalis. The low level of differences in the polymorphism within the group of the sym genes in comparison with the nonsymbiotic genes can be associated with the active participation of host plants in the evolution of rhizobia. The intragenic polymorphism of bv. orientalis proved to be significantly higher than that of bv. officinalis. The level of polymorphism of nonsymbiotic genes was lower than that of the symbiotic genes, which are functionally more homogeneous. The divergence of the nitrogen fixation genes (nif/fix) is more pronounced than that of the nodule formation genes (nod) in the N. galegae biovars. These facts indicate the leading role of the host-specific nitrogen fixation in the evolution of the studied rizhobial species.

April 21, 2020

Genetic characterization and potential molecular dissemination mechanism of tet(31) gene in Aeromonas caviae from an oxytetracycline wastewater treatment system.

Recently, the rarely reported tet(31) tetracycline resistance determinant was commonly found in Aeromonas salmonicida, Gallibacterium anatis, and Oblitimonas alkaliphila isolated from farming animals and related environment. However, its distribution in other bacteria and potential molecular dissemination mechanism in environment are still unknown. The purpose of this study was to investigate the potential mechanism underlying dissemination of tet(31) by analysing the tet(31)-carrying fragments in A. caviae strains isolated from an aerobic biofilm reactor treating oxytetracycline bearing wastewater. Twenty-three A. caviae strains were screened for the tet(31) gene by polymerase chain reaction (PCR). Three strains (two harbouring tet(31), one not) were subjected to whole genome sequencing using the PacBio RSII platform. Seventeen A. caviae strains carried the tet(31) gene and exhibited high resistance levels to oxytetracycline with minimum inhibitory concentrations (MICs) ranging from 256 to 512?mg/L. tet(31) was comprised of the transposon Tn6432 on the chromosome of A. caviae, and Tn6432 was also found in 15 additional tet(31)-positive A. caviae isolates by PCR. More important, Tn6432 was located on an integrative conjugative element (ICE)-like element, which could mediate the dissemination of the tet(31)-carrying transposon Tn6432 between bacteria. Comparative analysis demonstrated that Tn6432 homologs with the structure ISCR2-?phzF-tetR(31)-tet(31)-?glmM-sul2 were also carried by A. salmonicida, G. anatis, and O. alkaliphila, suggesting that this transposon can be transferred between species and even genera. This work provides the first report on the identification of the tet(31) gene in A. caviae, and will be helpful in exploring the dissemination mechanisms of tet(31) in water environment.Copyright © 2018. Published by Elsevier B.V.

April 21, 2020

Iron-associated protein interaction networks reveal the key functional modules related to survival and virulence of Pasteurella multocida.

Pasteurella multocida causes respiratory infectious diseases in a multitude of birds and mammals. A number of virulence-associated genes were reported across different strains of P. multocida, including those involved in the iron transport and metabolism. Comparative iron-associated genes of P. multocida among different animal hosts towards their interaction networks have not been fully revealed. Therefore, this study aimed to identify the iron-associated genes from core- and pan-genomes of fourteen P. multocida strains and to construct iron-associated protein interaction networks using genome-scale network analysis which might be associated with the virulence. Results showed that these fourteen strains had 1587 genes in the core-genome and 3400 genes constituting their pan-genome. Out of these, 2651 genes associated with iron transport and metabolism were selected to construct the protein interaction networks and 361 genes were incorporated into the iron-associated protein interaction network (iPIN) consisting of nine different iron-associated functional modules. After comparing with the virulence factor database (VFDB), 21 virulence-associated proteins were determined and 11 of these belonged to the heme biosynthesis module. From this study, the core heme biosynthesis module and the core outer membrane hemoglobin receptor HgbA were proposed as candidate targets to design novel antibiotics and vaccines for preventing pasteurellosis across the serotypes or animal hosts for enhanced precision agriculture to ensure sustainability in food security. Copyright © 2018. Published by Elsevier Ltd.

April 21, 2020

Toxin and genome evolution in a Drosophila defensive symbiosis.

Defenses conferred by microbial symbionts play a vital role in the health and fitness of their animal hosts. An important outstanding question in the study of defensive symbiosis is what determines long term stability and effectiveness against diverse natural enemies. In this study, we combine genome and transcriptome sequencing, symbiont transfection and parasite protection experiments, and toxin activity assays to examine the evolution of the defensive symbiosis between Drosophila flies and their vertically transmitted Spiroplasma bacterial symbionts, focusing in particular on ribosome-inactivating proteins (RIPs), symbiont-encoded toxins that have been implicated in protection against both parasitic wasps and nematodes. Although many strains of Spiroplasma, including the male-killing symbiont (sMel) of Drosophila melanogaster, protect against parasitic wasps, only the strain (sNeo) that infects the mycophagous fly Drosophila neotestacea appears to protect against parasitic nematodes. We find that RIP repertoire is a major differentiating factor between strains that do and do not offer nematode protection, and that sMel RIPs do not show activity against nematode ribosomes in vivo. We also discovered a strain of Spiroplasma infecting a mycophagous phorid fly, Megaselia nigra. Although both the host and its Spiroplasma are distantly related to D. neotestacea and its symbiont, genome sequencing revealed that the M. nigra symbiont encodes abundant and diverse RIPs, including plasmid-encoded toxins that are closely related to the RIPs in sNeo. Our results suggest that distantly related Spiroplasma RIP toxins may perform specialized functions with regard to parasite specificity and suggest an important role for horizontal gene transfer in the emergence of novel defensive phenotypes.

April 21, 2020

Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.

Amplicon sequencing of the 16S rRNA gene is the predominant method to quantify microbial compositions and to discover novel lineages. However, traditional short amplicons often do not contain enough information to confidently resolve their phylogeny. Here we present a cost-effective protocol that amplifies a large part of the rRNA operon and sequences the amplicons with PacBio technology. We tested our method on a mock community and developed a read-curation pipeline that reduces the overall read error rate to 0.18%. Applying our method on four environmental samples, we captured near full-length rRNA operon amplicons from a large diversity of prokaryotes. The method operated at moderately high-throughput (22286-37,850 raw ccs reads) and generated a large amount of putative novel archaeal 23S rRNA gene sequences compared to the archaeal SILVA database. These long amplicons allowed for higher resolution during taxonomic classification by means of long (~1000 bp) 16S rRNA gene fragments and for substantially more confident phylogenies by means of combined near full-length 16S and 23S rRNA gene sequences, compared to shorter traditional amplicons (250 bp of the 16S rRNA gene). We recommend our method to those who wish to cost-effectively and confidently estimate the phylogenetic diversity of prokaryotes in environmental samples at high throughput. © 2019 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd.

April 21, 2020

Long-read sequence capture of the haemoglobin gene clusters across codfish species.

Combining high-throughput sequencing with targeted sequence capture has become an attractive tool to study specific genomic regions of interest. Most studies have so far focused on the exome using short-read technology. These approaches are not designed to capture intergenic regions needed to reconstruct genomic organization, including regulatory regions and gene synteny. Here, we demonstrate the power of combining targeted sequence capture with long-read sequencing technology for comparative genomic analyses of the haemoglobin (Hb) gene clusters across eight species separated by up to 70 million years. Guided by the reference genome assembly of the Atlantic cod (Gadus morhua) together with genome information from draft assemblies of selected codfishes, we designed probes covering the two Hb gene clusters. Use of custom-made barcodes combined with PacBio RSII sequencing led to highly continuous assemblies of the LA (~100 kb) and MN (~200 kb) clusters, which include syntenic regions of coding and intergenic sequences. Our results revealed an overall conserved genomic organization of the Hb genes within this lineage, yet with several, lineage-specific gene duplications. Moreover, for some of the species examined, we identified amino acid substitutions at two sites in the Hbb1 gene as well as length polymorphisms in its regulatory region, which has previously been linked to temperature adaptation in Atlantic cod populations. This study highlights the use of targeted long-read capture as a versatile approach for comparative genomic studies by generation of a cross-species genomic resource elucidating the evolutionary history of the Hb gene family across the highly divergent group of codfishes. © 2018 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.

April 21, 2020

Comparative genomic analysis of Lactobacillus mucosae LM1 identifies potential niche-specific genes and pathways for gastrointestinal adaptation.

Lactobacillus mucosae is currently of interest as putative probiotics due to their metabolic capabilities and ability to colonize host mucosal niches. L. mucosae LM1 has been studied in its functions in cell adhesion and pathogen inhibition, etc. It demonstrated unique abilities to use energy from carbohydrate and non-carbohydrate sources. Due to these functions, we report the first complete genome sequence of an L. mucosae strain, L. mucosae LM1. Analysis of the pan-genome in comparison with closely-related Lactobacillus species identified a complete glycogen metabolism pathway, as well as folate biosynthesis, complementing previous proteomic data on the LM1 strain. It also revealed common and unique niche-adaptation genes among the various L. mucosae strains. The aim of this study was to derive genomic information that would reveal the probable mechanisms underlying the probiotic effect of L. mucosae LM1, and provide a better understanding of the nature of L. mucosae sp. Copyright © 2017 Elsevier Inc. All rights reserved.

April 21, 2020

Population Genome Sequencing of the Scab Fungal Species Venturia inaequalis, Venturia pirina, Venturia aucupariae and Venturia asperata.

The Venturia genus comprises fungal species that are pathogens on Rosaceae host plants, including V. inaequalis and V. asperata on apple, V. aucupariae on sorbus and V. pirina on pear. Although the genetic structure of V. inaequalis populations has been investigated in detail, genomic features underlying these subdivisions remain poorly understood. Here, we report whole genome sequencing of 87 Venturia strains that represent each species and each population within V. inaequalis We present a PacBio genome assembly for the V. inaequalis EU-B04 reference isolate. The size of selected genomes was determined by flow cytometry, and varied from 45 to 93 Mb. Genome assemblies of V. inaequalis and V. aucupariae contain a high content of transposable elements (TEs), most of which belong to the Gypsy or Copia LTR superfamilies and have been inactivated by Repeat-Induced Point mutations. The reference assembly of V. inaequalis presents a mosaic structure of GC-equilibrated regions that mainly contain predicted genes and AT-rich regions, mainly composed of TEs. Six pairs of strains were identified as clones. Single-Nucleotide Polymorphism (SNP) analysis between these clones revealed a high number of SNPs that are mostly located in AT-rich regions due to misalignments and allowed determining a false discovery rate. The availability of these genome sequences is expected to stimulate genetics and population genomics research of Venturia pathogens. Especially, it will help understanding the evolutionary history of Venturia species that are pathogenic on different hosts, a history that has probably been substantially influenced by TEs.Copyright © 2019 Le Cam et al.

April 21, 2020

High Quality Draft Genome of Arogyapacha (Trichopus zeylanicus), an Important Medicinal Plant Endemic to Western Ghats of India.

Arogyapacha, the local name of Trichopus zeylanicus, is a rare, indigenous medicinal plant of India. This plant is famous for its traditional use as an instant energy stimulant. So far, no genomic resource is available for this important plant and hence its metabolic pathways are poorly understood. Here, we report on a high-quality draft assembly of approximately 713.4 Mb genome of T. zeylanicus, first draft genome from the genus Trichopus The assembly was generated in a hybrid approach using Illumina short-reads and Pacbio longer-reads. The total assembly comprised of 22601 scaffolds with an N50 value of 433.3 Kb. We predicted 34452 protein coding genes in T. zeylanicus genome and found that a significant portion of these predicted genes were associated with various secondary metabolite biosynthetic pathways. Comparative genome analysis revealed extensive gene collinearity between T. zeylanicus and its closely related plant species. The present genome and annotation data provide an essential resource to speed-up the research on secondary metabolism, breeding and molecular evolution of T. zeylanicus. Copyright © 2019 Chellappan et al.

April 21, 2020

De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species.

We report the first whole genome sequence (WGS) assembly and annotation of a dwarf coconut variety, ‘Catigan Green Dwarf’ (CATD). The genome sequence was generated using the PacBio SMRT sequencing platform at 15X coverage of the expected genome size of 2.15 Gbp, which was corrected with assembled 50X Illumina paired-end MiSeq reads of the same genome. The draft genome was improved through Chicago sequencing to generate a scaffold assembly that results in a total genome size of 2.1 Gbp consisting of 7,998 scaffolds with N50 of 570,487 bp. The final assembly covers around 97.6% of the estimated genome size of coconut ‘CATD’ based on homozygous k-mer peak analysis. A total of 34,958 high-confidence gene models were predicted and functionally associated to various economically important traits, such as pest/disease resistance, drought tolerance, coconut oil biosynthesis, and putative transcription factors. The assembled genome was used to infer the evolutionary relationship within the palm family based on genomic variations and synteny of coding gene sequences. Data show that at least three (3) rounds of whole genome duplication occurred and are commonly shared by these members of the Arecaceae family. A total of 7,139 unique SSR markers were designed to be used as a resource in marker-based breeding. In addition, we discovered 58,503 variants in coconut by aligning the Hainan Tall (HAT) WGS reads to the non-repetitive regions of the assembled CATD genome. The gene markers and genome-wide SSR markers established here will facilitate the development of varieties with resilience to climate change, resistance to pests and diseases, and improved oil yield and quality.Copyright © 2019 Lantican et al.

April 21, 2020

Complete Genome Sequence of Saccharospirillum mangrovi HK-33T Sheds Light on the Ecological Role of a Bacterium in Mangrove Sediment Environment.

We present the genome sequence of Saccharospirillum mangrovi HK-33T, isolated from a mangrove sediment sample in Haikou, China. The complete genome of S. mangrovi HK-33T consisted of a single-circular chromosome with the size of 3,686,911 bp as well as an average G?+?C content of 57.37%, and contained 3,383 protein-coding genes, 4 operons of 16S-23S-5S rRNA genes, and 52 tRNA genes. Genomic annotation indicated that the genome of S. mangrovi HK-33T had many genes related to oligosaccharide and polysaccharide degradation and utilization of polyhydroxyalkanoate. For nitrogen cycle, genes encoding nitrate and nitrite reductase, glutamate dehydrogenase, glutamate synthase, and glutamine synthetase could be found. For phosphorus cycle, genes related to polyphosphate kinases (ppk1 and ppk2), the high-affinity phosphate-specific transport (Pst) system, and the low-affinity inorganic phosphate transporter (pitA) were predicted. For sulfur cycle, cysteine synthase and type III acyl coenzyme A transferase (dddD) coding genes were searched out. This study provides evidence about carbon, nitrogen, phosphorus, and sulfur metabolic patterns of S. mangrovi HK-33T and broadens our understandings about ecological roles of this bacterium in the mangrove sediment environment.

Auto Tag: Annotation

A new reference genome sequence for Caenorhabditis elegans?

Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy.

Complete chloroplast genome sequence of Carthamus tinctorius L. from PacBio Sequel Platform

Carbohydrate catabolic capability of a Flavobacteriia bacterium isolated from hadal water.

Evolution of Goat’s Rue Rhizobia (Neorhizobium galegae): Analysis of Polymorphism of the Nitrogen Fixation and Nodule Formation Genes

Genetic characterization and potential molecular dissemination mechanism of tet(31) gene in Aeromonas caviae from an oxytetracycline wastewater treatment system.

Iron-associated protein interaction networks reveal the key functional modules related to survival and virulence of Pasteurella multocida.

Toxin and genome evolution in a Drosophila defensive symbiosis.

Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon.

Long-read sequence capture of the haemoglobin gene clusters across codfish species.

Comparative genomic analysis of Lactobacillus mucosae LM1 identifies potential niche-specific genes and pathways for gastrointestinal adaptation.

Population Genome Sequencing of the Scab Fungal Species Venturia inaequalis, Venturia pirina, Venturia aucupariae and Venturia asperata.

High Quality Draft Genome of Arogyapacha (Trichopus zeylanicus), an Important Medicinal Plant Endemic to Western Ghats of India.

De Novo Genome Sequence Assembly of Dwarf Coconut (Cocos nucifera L. ‘Catigan Green Dwarf’) Provides Insights into Genomic Variation Between Coconut Types and Related Palm Species.

Complete Genome Sequence of Saccharospirillum mangrovi HK-33T Sheds Light on the Ecological Role of a Bacterium in Mangrove Sediment Environment.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert