Menu
July 7, 2019

GenomeLandscaper: Landscape analysis of genome-fingerprints maps assessing chromosome architecture.

Assessing correctness of an assembled chromosome architecture is a central challenge. We create a geometric analysis method (called GenomeLandscaper) to conduct landscape analysis of genome-fingerprints maps (GFM), trace large-scale repetitive regions, and assess their impacts on the global architectures of assembled chromosomes. We develop an alignment-free method for phylogenetics analysis. The human Y chromosomes (GRCh.chrY, HuRef.chrY and YH.chrY) are analysed as a proof-of-concept study. We construct a galaxy of genome-fingerprints maps (GGFM) for them, and a landscape compatibility among relatives is observed. But a long sharp straight line on the GGFM breaks such a landscape compatibility, distinguishing GRCh38p1.chrY (and throughout GRCh38p7.chrY) from GRCh37p13.chrY, HuRef.chrY and YH.chrY. We delete a 1.30-Mbp target segment to rescue the landscape compatibility, matching the antecedent GRCh37p13.chrY. We re-locate it into the modelled centromeric and pericentromeric region of GRCh38p10.chrY, matching a gap placeholder of GRCh37p13.chrY. We decompose it into sub-constituents (such as BACs, interspersed repeats, and tandem repeats) and trace their homologues by phylogenetics analysis. We elucidate that most examined tandem repeats are of reasonable quality, but the BAC-sized repeats, 173U1020C (176.46 Kbp) and 5U41068C (205.34 Kbp), are likely over-repeated. These results offer unique insights into the centromeric and pericentromeric regions of the human Y chromosomes.


July 7, 2019

The odyssey of the ancestral Escherich strain through culture collections: an example of allopatric diversification.

More than a century ago, Theodor Escherich isolated the bacterium that was to become Escherichia coli, one of the most studied organisms. Not long after, the strain began an odyssey and landed in many laboratories across the world. As laboratory culture conditions could be responsible for major changes in bacterial strains, we conducted a genome analysis of isolates of this emblematic strain from different culture collections (England, France, the United States, Germany). Strikingly, many discrepancies between the isolates were observed, as revealed by multilocus sequence typing (MLST), the presence of virulence-associated genes, core genome MLST, and single nucleotide polymorphism/indel analyses. These differences are correlated with the phylogeographic history of the strain and were due to an unprecedented number of mutations in coding DNA repair functions such as mismatch repair (MutL) and oxidized guanine nucleotide pool cleaning (MutT), conferring a specific mutational spectrum and leading to a mutator phenotype. The mutator phenotype was probably acquired during subculturing and corresponded to second-order selection. Furthermore, all of the isolates exhibited hypersusceptibility to antibiotics due to mutations in efflux pump- and porin-encoding genes, as well as a specific mutation in the sigma factor-encoding generpoS. These defects reflect a self-preservation and nutritional competence tradeoff allowing survival under the starvation conditions imposed by storage. From a clinical point of view, dealing with such mutator strains can lead microbiologists to draw false conclusions about isolate relatedness and may impact therapeutic effectiveness. IMPORTANCE Mutator phenotypes have been described in laboratory-evolved bacteria, as well as in natural isolates. Several genes can be impacted, each of them being associated with a typical mutational spectrum. By studying one of the oldest strains available, the ancestral Escherich strain, we were able to identify its mutator status leading to tremendous genetic diversity among the isolates from various collections and allowing us to reconstruct the phylogeographic history of the strain. This mutator phenotype was probably acquired during the storage of the strain, promoting adaptation to a specific environment. Other mutations inrpoSand efflux pump- and porin-encoding genes highlight the acclimatization of the strain through self-preservation and nutritional competence regulation. This strain history can be viewed as unintentional experimental evolution in culture collections all over the word since 1885, mimicking the long-term experimental evolution ofE. coliof Lenski et al. (O. Tenaillon, J. E. Barrick, N. Ribeck, D. E. Deatherage, J. L. Blanchard, A. Dasgupta, G. C. Wu, S. Wielgoss, S. Cruveiller, C. Médigue, D. Schneider, and R. E. Lenski, Nature 536:165-170, 2016, https://doi.org/10.1038/nature18959) that shares numerous molecular features.


July 7, 2019

Development of molecular markers linked to powdery mildew resistance GenePm4bby combining SNP discovery from transcriptome sequencing data with bulked segregant analysis (BSR-Seq) in wheat.

Powdery mildew resistance genePm4b, originating fromTriticum persicum, is effective against the prevalentBlumeria graminisf. sp.tritici(Bgt) isolates from certain regions of wheat production in China. The lack of tightly linked molecular markers with the target gene prevents the precise identification ofPm4bduring the application of molecular marker-assisted selection (MAS). The strategy that combines the RNA-Seq technique and the bulked segregant analysis (BSR-Seq) was applied in an F2:3mapping population (237 families) derived from a pair of isogenic lines VPM1/7*Bainong 3217 F4(carryingPm4b) and Bainong 3217 to develop more closely linked molecular markers. RNA-Seq analysis of the two phenotypically contrasting RNA bulks prepared from the representative F2:3families generated 20,745,939 and 25,867,480 high-quality read pairs, and 82.8 and 80.2% of them were uniquely mapped to the wheat whole genome draft assembly for the resistant and susceptible RNA bulks, respectively. Variant calling identified 283,866 raw single nucleotide polymorphisms (SNPs) and InDels between the two bulks. The SNPs that were closely associated with the powdery mildew resistance were concentrated on chromosome 2AL. Among the 84 variants that were potentially associated with the disease resistance trait, 46 variants were enriched in an about 25 Mb region at the distal end of chromosome arm 2AL. FourPm4b-linked SNP markers were developed from these variants. Based on the sequences of Chinese Spring where these polymorphic SNPs were located, 98 SSR primer pairs were designed to develop distal markers flanking thePm4bgene. Three SSR markers,Xics13,Xics43, andXics76, were incorporated in the new genetic linkage map, which locatedPm4bin a 3.0 cM genetic interval spanning a 6.7 Mb physical genomic region. This region had a collinear relationship withBrachypodium distachyonchromosome 5, rice chromosome 4, and sorghum chromosome 6. Seven genes associated with disease resistance were predicted in this collinear genomic region, which included C2 domain protein, peroxidase activity protein, protein kinases of PKc_like super family, Mlo family protein, and catalytic domain of the serine/threonine kinases (STKc_IRAK like super family). The markers developed in the present study facilitate identification ofPm4bduring its MAS practice.


July 7, 2019

Identification and expression analysis of wheat TaGF14 genes.

The 14-3-3 gene family members play key roles in various cellular processes. However, little is known about the numbers and roles of 14-3-3 genes in wheat. The aims of this study were to identify TaGF14 numbers in wheat by searching its whole genome through blast, to study the phylogenetic relationships with other plant species and to discuss the functions of TaGF14s. The results showed that common wheat harbored 20 TaGF14 genes, located on wheat chromosome groups 2, 3, 4, and 7. Out of them, eighteen TaGF14s are non-e proteins, and two wheat TaGF14 genes, TaGF14i and TaGF14f, are e proteins. Phylogenetic analysis indicated that these genes were divided into six clusters: cluster 1 (TaGF14d, TaGF14g, TaGF14j, TaGF14h, TaGF14c, and TaGF14n); cluster 2 (TaGF14k); cluster 3 (TaGF14b, TaGF14l, TaGF14m, and TaGF14s); cluster 4 (TaGF14a, TaGF14e, and TaGF14r); cluster 5 (TaGF14i and TaGF14f); and cluster 6 (TaGF14o, TaGF14p, TaGF14q, and TaGF14t). Tissue-specific gene expressions suggested that all TaGF14s were likely constitutively expressed, except two genes, i.e., TaGF14p and TaGF14f. And the highest amount of TaGF14 transcripts were observed in developing grains at 20 days post anthesis (DPA), especially for TaGF14j and TaGF14l. After drought stress, five genes, i.e., TaGF14c, TaGF14d, TaGF14g, TaGF14h, and TaGF14j, were up-regulated expression under drought stress for both 1 and 6 h, suggesting these genes played vital role in combating against drought stress. However, all the TaGF14s were down-regulated expression under heat stress for both 1 and 6 h, indicating TaGF14s may be negatively associated with heat stress by reducing the expression to combat heat stress or through other pathways. These results suggested that cluster 1, e.g., TaGF14j, may participate in the whole wheat developing stages, e.g., grain-filling (starch biosynthesis) and may also participate in combating against drought stress. Subsequently, a homolog of TaGF14j, TaGF14-JM22, were cloned by RACE and used to validate its function. Immunoblotting results showed that TaGF14-JM22 protein, closely related to TaGF14d, TaGF14g, and TaGF14j, can interact with AGP-L, SSI, SSII, SBEIIa, and SBEIIb in developing grains, suggesting that TaGF14s located on group 4 may be involved in starch biosynthesis. Therefore, it is possible to develop starch-rich wheat cultivars by modifying TaGF14s.


July 7, 2019

Complete genome sequence of uropathogenic Escherichia coli isolate UPEC 26-1.

Urinary tract infections (UTIs) are among the most common infections in humans, predominantly caused by uropathogenic Escherichia coli (UPEC). The diverse genomes of UPEC strains mostly impede disease prevention and control measures. In this study, we comparatively analyzed the whole genome sequence of a highly virulent UPEC strain, namely UPEC 26-1, which was isolated from urine sample of a patient suffering from UTI in Korea. Whole genome analysis showed that the genome consists of one circular chromosome of 5,329,753 bp, comprising 5064 protein-coding genes, 122 RNA genes (94 tRNA, 22 rRNA and 6 ncRNA genes), and 100 pseudogenes, with an average G+C content of 50.56%. In addition, we identified 8 prophage regions comprising 5 intact, 2 incomplete and 1 questionable ones and 63 genomic islands, suggesting the possibility of horizontal gene transfer in this strain. Comparative genome analysis of UPEC 26-1 with the UPEC strain CFT073 revealed an average nucleotide identity of 99.7%. The genome comparison with CFT073 provides major differences in the genome of UPEC 26-1 that would explain its increased virulence and biofilm formation. Nineteen of the total GIs were unique to UPEC 26-1 compared to CFT073 and nine of them harbored unique genes that are involved in virulence, multidrug resistance, biofilm formation and bacterial pathogenesis. The data from this study will assist in future studies of UPEC strains to develop effective control measures.


July 7, 2019

scanPAV: a pipeline for extracting presence-absence variations in genome pairs.

The recent technological advances in genome sequencing techniques have resulted in an exponential increase in the number of sequenced human and non-human genomes. The ever increasing number of assemblies generated by novel de novo pipelines and strategies demands the development of new software to evaluate assembly quality and completeness. One way to determine the completeness of an assembly is by detecting its Presence-Absence variations (PAV) with respect to a reference, where PAVs between two assemblies are defined as the sequences present in one assembly but entirely missing in the other one. Beyond assembly error or technology bias, PAVs can also reveal real genome polymorphism, consequence of species or individual evolution, or horizontal transfer from viruses and bacteria.We present scanPAV, a pipeline for pairwise assembly comparison to identify and extract sequences present in one assembly but not the other. In this note, we use the GRCh38 reference assembly to assess the completeness of six human genome assemblies from various assembly strategies and sequencing technologies including Illumina short reads, 10× genomics linked-reads, PacBio and Oxford Nanopore long reads, and Bionano optical maps. We also discuss the PAV polymorphism of seven Tasmanian devil whole genome assemblies of normal animal tissues and devil facial tumour 1 (DFT1) and 2 (DFT2) samples, and the identification of bacterial sequences as contamination in some of the tumorous assemblies.The pipeline is available under the MIT License at https://github.com/wtsi-hpag/scanPAV.Supplementary data are available at Bioinformatics online.


July 7, 2019

Oryza rufipogon Griff.

Oryza rufipogon, the progenitor of present-day cultivated rice, O. sativa, is one of the most studied wild species of rice. It is a perennial plant commonly found in a marsh or aquatic habitats of eastern and southern Asia. It has partial outcrossing behavior and is photoperiod sensitive. The flowering time usually ranges between September and November. It has been and is being exploited as a source of valuable genes and QTLs for yield components as well as resistance against biotic and abiotic stresses. A number of populations like chromosome segment substitution lines, backcross inbred lines, near-isogenic lines, and recombinant inbred lines have been developed from crosses between O. rufipogon and O. sativa as a prebreeding resource. These are being employed for broadening the genetic base of cultivated rice and diversify the breeder’s pool. With the advent of sequencing technologies, a number of phylogenetic studies have been conducted to reveal the evolutionary relationship of O. rufipogon with cultivated rice O. sativa. Further, transcriptomic studies characterizing the effect of various abiotic stresses have been conducted on this wild species. Role of miRNA under stress reaction has also been studied. Though the genetic, genomic, and transcriptomic resources are abundant, the proteomic resources for O. rufipogon are limited.


July 7, 2019

Oryza meridionalis NQ Ng

Oryza meridionalis is an AA genome species found in Northern Australia. Phylogenetic analysis places this as the most distant of the AA genome species from domesticated rice (Oryza sativa). This makes it a key genetic resource for rice improvement. A draft nuclear genome sequence is available, and also the chloroplast genome has been sequenced from many genotypes. The high amylose starch content in these taxa may be useful for developing new rice grain characteristics. Here we have reviewed the all the research advancements that are made till today on this species.


July 7, 2019

Oryza glaberrima Steud.

Oryza glaberrima is the African cultivated rice species, domesticated from its wild ancestor by farmers living in Inland Delta of Niger River. Several studies indicated that it has extremely narrow genetic diversity compared to both its wild progenitor, Oryza barthii and the Asian rice, Oryza sativa which can mainly be attributed to a severe domestication bottleneck. Despite its scarcity in farmer’s field due to its low yield potential, high shattering and lodging susceptibility, O. glaberrima is of great value not only to Africa but also globally. Perhaps its greatest contribution to regional and global food security is as a source of genes, as it possesses resistance/tolerance to various biotic and abiotic stresses. It also has unique starch-related traits which give it good cooking and eating properties. Advances in DNA sequencing have provided useful genomic resources for African rice, key among them being whole genome sequences. Genomic tools are enabling greater understanding of the useful functional diversity found in this species. These advances have potential of addressing some of the undesirable attributes found in this species which have led to its continued replacement by Asian rice. Development of new generation of rice varieties for African farmers will therefore require the adoption of advanced molecular breeding tools as these will allow efficient utilization of the wealth and resilience found in African rice in rice improvement.


July 7, 2019

IWTomics: testing high-resolution sequence-based ‘Omics’ data at multiple locations and scales.

With increased generation of high-resolution sequence-based ‘Omics’ data, detecting statistically significant effects at different genomic locations and scales has become key to addressing several scientific questions. IWTomics is an R/Bioconductor package (integrated in Galaxy) that, exploiting sophisticated Functional Data Analysis techniques (i.e. statistical techniques that deal with the analysis of curves), allows users to pre-process, visualize and test these data at multiple locations and scales. The package provides a friendly, flexible and complete workflow that can be employed in many genomic and epigenomic applications.IWTomics is freely available at the Bioconductor website (http://bioconductor.org/packages/IWTomics) and on the main Galaxy instance (https://usegalaxy.org/).Supplementary data are available at Bioinformatics online.


July 7, 2019

Paucibacter aquatile sp. nov. isolated from freshwater of the Nakdong River, Republic of Korea.

A Gram-negative, aerobic, motile, and rod-shaped bacterial strain designated CR182T was isolated from freshwater of the Nakdong River, Republic of Korea. Optimal growth conditions for this novel strain were found to be: 25-30 °C, pH 6.5-8.5, and 3% (w/v) NaCl. Phylogenetic analysis based on 16S rRNA gene sequence indicates that the strain CR182T belongs to type strains of genus Paucibacter. Strain CR182T showed 98.0% 16S rRNA gene sequence similarity with Paucibacter oligotrophus CHU3T and formed a robust phylogenetic clade with this species. The average nucleotide identity value between strain CR182T and P. oligotrophus CHU3T was 78.4% and the genome-to-genome distance was 22.2% on average. The genomic DNA G+C content calculated from the genome sequence was 66.3 mol%. Predominant cellular fatty acids of strain CR182T were summed feature 3 (C16:1 ?7c and/or C16:1 ?6c) (31.2%) and C16:0 (16.0%). Its major respiratory quinine was ubiquinone Q-8. Its polar lipids consisted of diphosphatidylglycerol, phosphatidylethanolamine, and two unidentified phospholipids. Its genomic DNA G+C content was 66.3%. Based on data obtained from this polyphasic taxonomic study, strain CR182T represents a novel species belonging to genus Paucibacter, for which a name of P. aquatile sp. nov. is proposed. The type strain is CR182T (=?KCCM 90284T?=?NBRC 113032T).


July 7, 2019

The ‘gifted’ actinomycete Streptomyces leeuwenhoekii.

Streptomyces leeuwenhoekii strains C34T, C38, C58 and C79 were isolated from a soil sample collected from the Chaxa Lagoon, located in the Salar de Atacama in northern Chile. These streptomycetes produce a variety of new specialised metabolites with antibiotic, anti-cancer and anti-inflammatory activities. Moreover, genome mining performed on two of these strains has revealed the presence of biosynthetic gene clusters with the potential to produce new specialised metabolites. This review focusses on this new clade of Streptomyces strains, summarises the literature and presents new information on strain C34T.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.