Menu
April 21, 2020

Genome-wide systematic identification of methyltransferase recognition and modification patterns.

Genome-wide analysis of DNA methylation patterns using single molecule real-time DNA sequencing has boosted the number of publicly available methylomes. However, there is a lack of tools coupling methylation patterns and the corresponding methyltransferase genes. Here we demonstrate a high-throughput method for coupling methyltransferases with their respective motifs, using automated cloning and analysing the methyltransferases in vectors carrying a strain-specific cassette containing all potential target sites. To validate the method, we analyse the genomes of the thermophile Moorella thermoacetica and the mesophile Acetobacterium woodii, two acetogenic bacteria having substantially modified genomes with 12 methylation motifs and a total of 23 methyltransferase genes. Using our method, we characterize the 23 methyltransferases, assign motifs to the respective enzymes and verify activity for 11 of the 12 motifs.


April 21, 2020

Multidrug Resistant Uropathogenic Escherichia coli ST405 With a Novel, Composite IS26 Transposon in a Unique Chromosomal Location.

Escherichia coli ST405 is an emerging urosepsis pathogen, noted for carriage of blaCTX-M, blaNDM, and a repertoire of virulence genes comparable with O25b:H4-ST131. Extraintestinal and multidrug resistant E. coli ST405 are poorly studied in Australia. Here we determined the genome sequence of a uropathogenic, multiple drug resistant E. coli ST405 (strain 2009-27) from the mid-stream urine of a hospital patient in Sydney, Australia, using a combination of Illumina and SMRT sequencing. The genome of strain 2009-27 assembled into two unitigs; a chromosome comprising 5,287,472 bp and an IncB/O plasmid, pSDJ2009-27, of 89,176 bp. In silico and phenotypic analyses showed that strain 2009-27 is a serotype O102:H6, phylogroup D ST405 resistant to ampicillin, azithromycin, kanamycin, streptomycin, trimethoprim, and sulphafurazole. The genes encoding resistance to these antibiotics reside within a novel, mobile IS26-flanked transposon, identified here as Tn6242, in the chromosomal gene yjdA. Tn6242 comprises four modules that each carries resistance genes flanked by IS26, including a class 1 integron with dfrA17 and aadA5 gene cassettes, a variant of Tn6029, and mphA. We exploited unique genetic signatures located within Tn6242 to identify strains of ST405 from Danish patients that also carry the transposon in the same chromosomal location. The acquisition of Tn6242 into yjdA in ST405 is significant because it (i) is vertically inheritable; (ii) represents a reservoir of resistance genes that can transpose onto resident/circulating plasmids; and (iii) is a site for the capture of further IS26-associated resistance gene cargo.


April 21, 2020

Extensive intraspecific gene order and gene structural variations in upland cotton cultivars.

Multiple cotton genomes (diploid and tetraploid) have been assembled. However, genomic variations between cultivars of allotetraploid upland cotton (Gossypium hirsutum L.), the most widely planted cotton species in the world, remain unexplored. Here, we use single-molecule long read and Hi-C sequencing technologies to assemble genomes of the two upland cotton cultivars TM-1 and zhongmiansuo24 (ZM24). Comparisons among TM-1 and ZM24 assemblies and the genomes of the diploid ancestors reveal a large amount of genetic variations. Among them, the top three longest structural variations are located on chromosome A08 of the tetraploid upland cotton, which account for ~30% total length of this chromosome. Haplotype analyses of the mapping population derived from these two cultivars and the germplasm panel show suppressed recombination rates in this region. This study provides additional genomic resources for the community, and the identified genetic variations, especially the reduced meiotic recombination on chromosome A08, will help future breeding.


April 21, 2020

Urinary tract colonization is enhanced by a plasmid that regulates uropathogenic Acinetobacter baumannii chromosomal genes.

Multidrug resistant (MDR) Acinetobacter baumannii poses a growing threat to global health. Research on Acinetobacter pathogenesis has primarily focused on pneumonia and bloodstream infections, even though one in five A. baumannii strains are isolated from urinary sites. In this study, we highlight the role of A. baumannii as a uropathogen. We develop the first A. baumannii catheter-associated urinary tract infection (CAUTI) murine model using UPAB1, a recent MDR urinary isolate. UPAB1 carries the plasmid pAB5, a member of the family of large conjugative plasmids that represses the type VI secretion system (T6SS) in multiple Acinetobacter strains. pAB5 confers niche specificity, as its carriage improves UPAB1 survival in a CAUTI model and decreases virulence in a pneumonia model. Comparative proteomic and transcriptomic analyses show that pAB5 regulates the expression of multiple chromosomally-encoded virulence factors besides T6SS. Our results demonstrate that plasmids can impact bacterial infections by controlling the expression of chromosomal genes.


April 21, 2020

Comprehensive identification of the full-length transcripts and alternative splicing related to the secondary metabolism pathways in the tea plant (Camellia sinensis).

Flavonoids, theanine and caffeine are the main secondary metabolites of the tea plant (Camellia sinensis), which account for the tea’s unique flavor quality and health benefits. The biosynthesis pathways of these metabolites have been extensively studied at the transcriptional level, but the regulatory mechanisms are still unclear. In this study, to explore the transcriptome diversity and complexity of tea plant, PacBio Iso-Seq and RNA-seq analysis were combined to obtain full-length transcripts and to profile the changes in gene expression during the leaf development. A total of 1,388,066 reads of insert (ROI) were generated with an average length of 1,762?bp, and more than 54% (755,716) of the ROIs were full-length non-chimeric (FLNC) reads. The Benchmarking Universal Single-Copy Orthologue (BUSCO) completeness was 92.7%. A total of 93,883 non-redundant transcripts were obtained, and 87,395 (93.1%) were new alternatively spliced isoforms. Meanwhile, 7,650 differential expression transcripts (DETs) were identified. A total of 28,980 alternative splicing (AS) events were predicted, including 1,297 differential AS (DAS) events. The transcript isoforms of the key genes involved in the flavonoid, theanine and caffeine biosynthesis pathways were characterized. Additionally, 5,777 fusion transcripts and 9,052 long non-coding RNAs (lncRNAs) were also predicted. Our results revealed that AS potentially plays a crucial role in the regulation of the secondary metabolism of the tea plant. These findings enhanced our understanding of the complexity of the secondary metabolic regulation of tea plants and provided a basis for the subsequent exploration of the regulatory mechanisms of flavonoid, theanine and caffeine biosynthesis in tea plants.


April 21, 2020

Programmable mutually exclusive alternative splicing for generating RNA and protein diversity.

Alternative splicing performs a central role in expanding genomic coding capacity and proteomic diversity. However, programming of splicing patterns in engineered biological systems remains underused. Synthetic approaches thus far have predominantly focused on controlling expression of a single protein through alternative splicing. Here, we describe a modular and extensible platform for regulating four programmable exons that undergo a mutually exclusive alternative splicing event to generate multiple functionally-distinct proteins. We present an intron framework that enforces the mutual exclusivity of two internal exons and demonstrate a graded series of consensus sequence elements of varying strengths that set the ratio of two mutually exclusive isoforms. We apply this framework to program the DNA-binding domains of modular transcription factors to differentially control downstream gene activation. This splicing platform advances an approach for generating diverse isoforms and can ultimately be applied to program modular proteins and increase coding capacity of synthetic biological systems.


April 21, 2020

Genome analysis of the rice coral Montipora capitata.

Corals comprise a biomineralizing cnidarian, dinoflagellate algal symbionts, and associated microbiome of prokaryotes and viruses. Ongoing efforts to conserve coral reefs by identifying the major stress response pathways and thereby laying the foundation to select resistant genotypes rely on a robust genomic foundation. Here we generated and analyzed a high quality long-read based ~886 Mbp nuclear genome assembly and transcriptome data from the dominant rice coral, Montipora capitata from Hawai’i. Our work provides insights into the architecture of coral genomes and shows how they differ in size and gene inventory, putatively due to population size variation. We describe a recent example of foreign gene acquisition via a bacterial gene transfer agent and illustrate the major pathways of stress response that can be used to predict regulatory components of the transcriptional networks in M. capitata. These genomic resources provide insights into the adaptive potential of these sessile, long-lived species in both natural and human influenced environments and facilitate functional and population genomic studies aimed at Hawaiian reef restoration and conservation.


April 21, 2020

Interspecies conservation of organisation and function between nonhomologous regional centromeres.

Despite the conserved essential function of centromeres, centromeric DNA itself is not conserved. The histone-H3 variant, CENP-A, is the epigenetic mark that specifies centromere identity. Paradoxically, CENP-A normally assembles on particular sequences at specific genomic locations. To gain insight into the specification of complex centromeres, here we take an evolutionary approach, fully assembling genomes and centromeres of related fission yeasts. Centromere domain organization, but not sequence, is conserved between Schizosaccharomyces pombe, S. octosporus and S. cryophilus with a central CENP-ACnp1 domain flanked by heterochromatic outer-repeat regions. Conserved syntenic clusters of tRNA genes and 5S rRNA genes occur across the centromeres of S. octosporus and S. cryophilus, suggesting conserved function. Interestingly, nonhomologous centromere central-core sequences from S. octosporus and S. cryophilus are recognized in S. pombe, resulting in cross-species establishment of CENP-ACnp1 chromatin and functional kinetochores. Therefore, despite the lack of sequence conservation, Schizosaccharomyces centromere DNA possesses intrinsic conserved properties that promote assembly of CENP-A chromatin.


April 21, 2020

Within-host evolution of Helicobacter pylori shaped by niche-specific adaptation, intragastric migrations and selective sweeps.

The human pathogen Helicobacter pylori displays extensive genetic diversity. While H. pylori is known to evolve during infection, population dynamics inside the gastric environment have not been extensively investigated. Here we obtained gastric biopsies from multiple stomach regions of 16 H. pylori-infected adults, and analyze the genomes of 10 H. pylori isolates from each biopsy. Phylogenetic analyses suggest location-specific evolution and bacterial migration between gastric regions. Migration is significantly more frequent between the corpus and the fundus than with the antrum, suggesting that physiological differences between antral and oxyntic mucosa contribute to spatial partitioning of H. pylori populations. Associations between H. pylori gene polymorphisms and stomach niches suggest that chemotaxis, regulatory functions and outer membrane proteins contribute to specific adaptation to the antral and oxyntic mucosa. Moreover, we show that antibiotics can induce severe population bottlenecks and likely play a role in shaping the population structure of H. pylori.


April 21, 2020

Complete Genome Sequence of Sequevar 14M Ralstonia solanacearum Strain HA4-1 Reveals Novel Type III Effectors Acquired Through Horizontal Gene Transfer.

Ralstonia solanacearum, which causes bacterial wilt in a broad range of plants, is considered a “species complex” due to its significant genetic diversity. Recently, we have isolated a new R. solanacearum strain HA4-1 from Hong’an county in Hubei province of China and identified it being phylotype I, sequevar 14M (phylotype I-14M). Interestingly, we found that it can cause various disease symptoms among different potato genotypes and display different pathogenic behavior compared to a phylogenetically related strain, GMI1000. To dissect the pathogenic mechanisms of HA4-1, we sequenced its whole genome by combined sequencing technologies including Illumina HiSeq2000, PacBio RS II, and BAC-end sequencing. Genome assembly results revealed the presence of a conventional chromosome, a megaplasmid as well as a 143 kb plasmid in HA4-1. Comparative genome analysis between HA4-1 and GMI1000 shows high conservation of the general virulence factors such as secretion systems, motility, exopolysaccharides (EPS), and key regulatory factors, but significant variation in the repertoire and structure of type III effectors, which could be the determinants of their differential pathogenesis in certain potato species or genotypes. We have identified two novel type III effectors that were probably acquired through horizontal gene transfer (HGT). These novel R. solanacearum effectors display homology to several YopJ and XopAC family members. We named them as RipBR and RipBS. Notably, the copy of RipBR on the plasmid is a pseudogene, while the other on the megaplasmid is normal. For RipBS, there are three copies located in the megaplasmid and plasmid, respectively. Our results have not only enriched the genome information on R. solanacearum species complex by sequencing the first sequevar 14M strain and the largest plasmid reported in R. solanacearum to date but also revealed the variation in the repertoire of type III effectors. This will greatly contribute to the future studies on the pathogenic evolution, host adaptation, and interaction between R. solanacearum and potato.


April 21, 2020

Arcobacter cryaerophilus Isolated From New Zealand Mussels Harbor a Putative Virulence Plasmid.

A wide range of Arcobacter species have been described from shellfish in various countries but their presence has not been investigated in Australasia, in which shellfish are a popular delicacy. Since several arcobacters are considered to be emerging pathogens, we undertook a small study to evaluate their presence in several different shellfish, including greenshell mussels, oysters, and abalone (paua) in New Zealand. Arcobacter cryaerophilus, a species associated with human gastroenteritis, was the only species isolated, from greenshell mussels. Whole-genome sequencing revealed a range of genomic traits in these strains that were known or associated virulence factors. Furthermore, we describe the first putative virulence plasmid in Arcobacter, containing lytic, immunoavoidance, adhesion, antibiotic resistance, and gene transfer traits, among others. Complete genome sequence determination using a combination of long- and short-read genome sequencing strategies, was needed to identify the plasmid, clearly identifying its benefits. The potential for plasmids to disseminate virulence traits among Arcobacter and other species warrants further consideration by researchers interested in the risks to public health from these organisms.


April 21, 2020

Genomic Analyses Reveal Evidence of Independent Evolution, Demographic History, and Extreme Environment Adaptation of Tibetan Plateau Agaricus bisporus.

Agaricus bisporus distributed in the Tibetan Plateau of China has high-stress resistance that is valuable for breeding improvements. However, its evolutionary history, specialization, and adaptation to the extreme Tibetan Plateau environment are largely unknown. Here, we performed de novo genome sequencing of a representative Tibetan Plateau wild strain ABM and comparative genomic analysis with the reported European strain H97 and H39. The assembled ABM genome was 30.4 Mb in size, and comprised 8,562 protein-coding genes. The ABM genome shared highly conserved syntenic blocks and a few inversions with H97 and H39. The phylogenetic tree constructed by 1,276 single-copy orthologous genes in nine fungal species showed that the Tibetan Plateau and European A. bisporus diverged ~5.5 million years ago. Population genomic analysis using genome resequencing of 29 strains revealed that the Tibetan Plateau population underwent significant differentiation from the European and American populations and evolved independently, and the global climate changes critically shaped the demographic history of the Tibetan Plateau population. Moreover, we identified key genes that are related to the cell wall and membrane system, and the development and defense systems regulated A. bisporus adapting to the harsh Tibetan Plateau environment. These findings highlight the value of genomic data in assessing the evolution and adaptation of mushrooms and will enhance future genetic improvements of A. bisporus.


April 21, 2020

Multi-platform discovery of haplotype-resolved structural variation in human genomes.

The incomplete identification of structural variants (SVs) from whole-genome sequencing data limits studies of human genetic diversity and disease association. Here, we apply a suite of long-read, short-read, strand-specific sequencing technologies, optical mapping, and variant discovery algorithms to comprehensively analyze three trios to define the full spectrum of human genetic variation in a haplotype-resolved manner. We identify 818,054 indel variants (<50?bp) and 27,622 SVs (=50?bp) per genome. We also discover 156 inversions per genome and 58 of the inversions intersect with the critical regions of recurrent microdeletion and microduplication syndromes. Taken together, our SV callsets represent a three to sevenfold increase in SV detection compared to most standard high-throughput sequencing studies, including those from the 1000 Genomes Project. The methods and the dataset presented serve as a gold standard for the scientific community allowing us to make recommendations for maximizing structural variation sensitivity for future genome sequencing studies.


April 21, 2020

Platanus-allee is a de novo haplotype assembler enabling a comprehensive access to divergent heterozygous regions.

The ultimate goal for diploid genome determination is to completely decode homologous chromosomes independently, and several phasing programs from consensus sequences have been developed. These methods work well for lowly heterozygous genomes, but the manifold species have high heterozygosity. Additionally, there are highly divergent regions (HDRs), where the haplotype sequences differ considerably. Because HDRs are likely to direct various interesting biological phenomena, many genomic analysis targets fall within these regions. However, they cannot be accessed by existing phasing methods, and we have to adopt costly traditional methods. Here, we develop a de novo haplotype assembler, Platanus-allee ( http://platanus.bio.titech.ac.jp/platanus2 ), which initially constructs each haplotype sequence and then untangles the assembly graphs utilizing sequence links and synteny information. A comprehensive benchmark analysis reveals that Platanus-allee exhibits high recall and precision, particularly for HDRs. Using this approach, previously unknown HDRs are detected in the human genome, which may uncover novel aspects of genome variability.


April 21, 2020

A high-quality apple genome assembly reveals the association of a retrotransposon and red fruit colour.

A complete and accurate genome sequence provides a fundamental tool for functional genomics and DNA-informed breeding. Here, we assemble a high-quality genome (contig N50 of 6.99?Mb) of the apple anther-derived homozygous line HFTH1, including 22 telomere sequences, using a combination of PacBio single-molecule real-time (SMRT) sequencing, chromosome conformation capture (Hi-C) sequencing, and optical mapping. In comparison to the Golden Delicious reference genome, we identify 18,047 deletions, 12,101 insertions and 14 large inversions. We reveal that these extensive genomic variations are largely attributable to activity of transposable elements. Interestingly, we find that a long terminal repeat (LTR) retrotransposon insertion upstream of MdMYB1, a core transcriptional activator of anthocyanin biosynthesis, is associated with red-skinned phenotype. This finding provides insights into the molecular mechanisms underlying red fruit coloration, and highlights the utility of this high-quality genome assembly in deciphering agriculturally important trait in apple.


Talk with an expert

If you have a question, need to check the status of an order, or are interested in purchasing an instrument, we're here to help.