Barcoding Archives - Page 23 of 24

July 7, 2019

The complete chloroplast genome sequence of the medicinal plant Swertia mussotii using the PacBio RS II platform.

Swertia mussotii is an important medicinal plant that has great economic and medicinal value and is found on the Qinghai Tibetan Plateau. The complete chloroplast (cp) genome of S. mussotii is 153,431 bp in size, with a pair of inverted repeat (IR) regions of 25,761 bp each that separate an large single-copy (LSC) region of 83,567 bp and an a small single-copy (SSC) region of 18,342 bp. The S. mussotii cp genome encodes 84 protein-coding genes, 37 transfer RNA (tRNA) genes, and eight ribosomal RNA (rRNA) genes. The identity, number, and GC content of S. mussotii cp genes were similar to those in the genomes of other Gentianales species. Via analysis of the repeat structure, 11 forward repeats, eight palindromic repeats, and one reverse repeat were detected in the S. mussotii cp genome. There are 45 SSRs in the S. mussotii cp genome, the majority of which are mononucleotides found in all other Gentianales species. An entire cp genome comparison study of S. mussotii and two other species in Gentianaceae was conducted. The complete cp genome sequence provides intragenic information for the cp genetic engineering of this medicinal plant.

July 7, 2019

The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century.

Microsatellites, or simple sequence repeats (SSRs), have long played a major role in genetic studies due to their typically high polymorphism. They have diverse applications, including genome mapping, forensics, ascertaining parentage, population and conservation genetics, identification of the parentage of polyploids, and phylogeography. We compare SSRs and newer methods, such as genotyping by sequencing (GBS) and restriction site associated DNA sequencing (RAD-Seq), and offer recommendations for researchers considering which genetic markers to use. We also review the variety of techniques currently used for identifying microsatellite loci and developing primers, with a particular focus on those that make use of next-generation sequencing (NGS). Additionally, we review software for microsatellite development and report on an experiment to assess the utility of currently available software for SSR development. Finally, we discuss the future of microsatellites and make recommendations for researchers preparing to use microsatellites. We argue that microsatellites still have an important place in the genomic age as they remain effective and cost-efficient markers.

July 7, 2019

An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly.

High density genetic linkage maps that are extensively anchored to assembled genome sequences of the organism in question are extremely useful in gene discovery. To facilitate this process in perennial ryegrass (Lolium perenne L.), a high density single nucleotide polymorphism (SNP)- and presence/absence variant (PAV)-based genetic linkage map has been developed in an F2 mapping population that has been used as a reference population in numerous studies. To provide a reference sequence to which to align genotyping by sequencing (GBS) reads, a shotgun assembly of one of the grandparents of the population, a tenth-generation inbred line, was created using Illumina-based sequencing.The assembly was based on paired-end Illumina reads, scaffolded by mate pair and long jumping distance reads in the range of 3-40?kb, with >200-fold initial genome coverage. A total of 169 individuals from an F2 mapping population were used to construct PstI-based GBS libraries tagged with unique 4-9 nucleotide barcodes, resulting in 284 million reads, with approx. 1·6 million reads per individual. A bioinformatics pipeline was employed to identify both SNPs and PAVs. A core genetic map was generated using high confidence SNPs, to which lower confidence SNPs and PAVs were subsequently fitted in a straightforward binning approach.The assembly comprises 424?750 scaffolds, covering 1·11 Gbp of the 2·5 Gbp perennial ryegrass genome, with a scaffold N50 of 25 212?bp and a contig N50 of 3790?bp. It is available for download, and access to a genome browser has been provided. Comparison of the assembly with available transcript and gene model data sets for perennial ryegrass indicates that approx. 570 Mbp of the gene-rich portion of the genome has been captured. An ultra-high density genetic linkage map with 3092 SNPs and 7260 PAVs was developed, anchoring just over 200?Mb of the reference assembly.The combined genetic map and assembly, combined with another recently released genome assembly, represent a significant resource for the perennial ryegrass genetics community.© The Author 2016. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

July 7, 2019

Association between progranulin and Gaucher disease.

Gaucher disease (GD) is a genetic disease caused by mutations in the GBA1 gene which result in reduced enzymatic activity of ß-glucocerebrosidase (GCase). This study identified the progranulin (PGRN) gene (GRN) as another gene associated with GD.Serum levels of PGRN were measured from 115 GD patients and 99 healthy controls, whole GRN gene from 40 GD patients was sequenced, and the genotyping of 4 SNPs identified in GD patients was performed in 161 GD and 142 healthy control samples. Development of GD in PGRN-deficient mice was characterized, and the therapeutic effect of rPGRN on GD analyzed.Serum PGRN levels were significantly lower in GD patients (96.65±53.45ng/ml) than those in healthy controls of the general population (164.99±43.16ng/ml, p<0.0001) and of Ashkenazi Jews (150.64±33.99ng/ml, p<0.0001). Four GRN gene SNPs, including rs4792937, rs78403836, rs850713, and rs5848, and three point mutations, were identified in a full-length GRN gene sequencing in 40 GD patients. Large scale SNP genotyping in 161 GD and 142 healthy controls was conducted and the four SNP sites have significantly higher frequency in GD patients. In addition, "aged" and challenged adult PGRN null mice develop GD-like phenotypes, including typical Gaucher-like cells in lung, spleen, and bone marrow. Moreover, lysosomes in PGRN KO mice exhibit a tubular-like appearance. PGRN is required for the lysosomal appearance of GCase and its deficiency leads to GCase accumulation in the cytoplasm. More importantly, recombinant PGRN is therapeutic in various animal models of GD and human fibroblasts from GD patients.Our data demonstrates an unknown association between PGRN and GD and identifies PGRN as an essential factor for GCase's lysosomal localization. These findings not only provide new insight into the pathogenesis of GD, but may also have implications for diagnosis and alternative targeted therapies for GD. Copyright © 2016 Forschungsgesellschaft für Arbeitsphysiologie und Arbeitschutz e.V. Published by Elsevier B.V. All rights reserved.

July 7, 2019

Complete circular genome sequence of successful ST8/SCCmecIV community-associated methicillin-resistant Staphylococcus aureus (OC8) in Russia: one-megabase genomic inversion, IS256’s spread, and evolution of Russia ST8-IV.

ST8/SCCmecIV community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) has been a common threat, with large USA300 epidemics in the United States. The global geographical structure of ST8/SCCmecIV has not yet been fully elucidated. We herein determined the complete circular genome sequence of ST8/SCCmecIVc strain OC8 from Siberian Russia. We found that 36.0% of the genome was inverted relative to USA300. Two IS256, oppositely oriented, at IS256-enriched hot spots were implicated with the one-megabase genomic inversion (MbIN) and vSaß split. The behavior of IS256 was flexible: its insertion site (att) sequences on the genome and junction sequences of extrachromosomal circular DNA were all divergent, albeit with fixed sizes. A similar multi-IS256 system was detected, even in prevalent ST239 healthcare-associated MRSA in Russia, suggesting IS256’s strong transmission potential and advantage in evolution. Regarding epidemiology, all ST8/SCCmecIVc strains from European, Siberian, and Far Eastern Russia, examined had MbIN, and geographical expansion accompanied divergent spa types and resistance to fluoroquinolones, chloramphenicol, and often rifampicin. Russia ST8/SCCmecIVc has been associated with life-threatening infections such as pneumonia and sepsis in both community and hospital settings. Regarding virulence, the OC8 genome carried a series of toxin and immune evasion genes, a truncated giant surface protein gene, and IS256 insertion adjacent to a pan-regulatory gene. These results suggest that unique single ST8/spa1(t008)/SCCmecIVc CA-MRSA (clade, Russia ST8-IVc) emerged in Russia, and this was followed by large geographical expansion, with MbIN as an epidemiological marker, and fluoroquinolone resistance, multiple virulence factors, and possibly a multi-IS256 system as selective advantages.

July 7, 2019

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome.

The Asian seabass is an important marine food fish that has been cultured for several decades in Asia Pacific. However, the lack of a high quality reference genome has hampered efforts to improve its selective breeding. A 3D BAC pool set generated in this study was screened using 22 SSR markers located on linkage group 2 which contains a growth-related QTL region. Seventy-two clones corresponding to 22 FPC contigs were sequenced by Illumina MiSeq technology. We co-assembled the MiSeq-derived scaffolds from each FPC contig with error-corrected PacBio reads, resulting in 187 sequences covering 9.7?Mb. Eleven genes annotated within this region were found to be potentially associated with growth and their tissue-specific expression was investigated. Correlation analysis demonstrated that SNPs in ctsb, skp1 and ppp2ca can be potentially used as markers for selecting fast-growing fingerlings. Conserved syntenies between seabass LG2 and five other teleosts were identified. This study i) provided a 10?Mb targeted genome assembly; ii) demonstrated NGS of BAC pools as a potential approach for mining candidates underlying QTLs of this species; iii) detected eleven genes potentially responsible for growth in the QTL region; and iv) identified useful SNP markers for selective breeding programs of Asian seabass.

July 7, 2019

Silicon content of individual cells of Synechococcus from the North Atlantic Ocean

The widely distributed marine cyanobacterium Synechococcus is thought to exert an influence on the marine silicon (Si) cycle through its high cellular Si relative to organic content. There are few measurements of Si in natural populations of Synechococcus, however, and the degree to which Synechococcus from various oligotrophic field sites and depths accumulate the element is unknown. We used synchrotron x-ray fluorescence to measure Si quotas in individual Synechococcus cells collected during three cruises in the western North Atlantic Ocean in the summer and fall, focusing on cells from the surface mixed layer (SML; <10 m) and the deep chlorophyll maximum (DCM). Individual cell quotas varied widely, from 1 to 4700 amol Si cell- 1, though the middle 50% of quotas ranged between 17 and 119 amol Si cell- 1. Mean station-specific quotas exhibited an even narrower range of 31–72 amol Si cell- 1. No significant differences in Si quotas were observed across cruises or among stations, and no effect of ambient silicic acid concentration on quotas was observed within the narrow range of silicic acid concentrations encountered (0.6–1.3 µM). Despite this small range in ambient silicic acid, cells collected from the SML had an average of two-fold more Si than cells collected from the DCM. Differences in Si content with depth may be related to observed differences in the dominant Synechococcus clades between the SML and DCM habitats, determined by petB gene sequencing.

July 7, 2019

Probabilistic viral quasispecies assembly

Viruses are pathogens that cause infectious diseases. The swarm of virions is subject to the host’s immune pressure and possibly antiviral therapy. It may escape this selective pressure and gain selective advantage by acquiring one or more of the genomic alterations: single-nucleotide variants (SNVs), loss or gain of one or more amino acids, large deletions, for example, due to alternative splicing, or recombination of different strains. Genotypic antiretroviral drug resistance testing is performed via sequencing. Next-generation sequencing (NGS) technologies revolutionized assessing viral genetic diversity experimentally. In viral quasispecies analysis, there are two main goals: the identification of low-frequency variants and haplotype assembly on a whole-genome scale. PacBio performs single-molecule sequencing. This chapter elaborates human haplotyping and its relationship to probabilistic viral haplotype reconstruction methods. Viral quasispecies assembly has the potential to replace the current de facto diversity estimation by SNV calling. With advances in library preparation, increasing sensitivity of sequencing platforms, and more sophisticated models, it might be possible to detect all or most viral strains in a single individual.

July 7, 2019

The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance.

The whitefly Bemisia tabaci (Hemiptera: Aleyrodidae) is among the 100 worst invasive species in the world. As one of the most important crop pests and virus vectors, B. tabaci causes substantial crop losses and poses a serious threat to global food security. We report the 615-Mb high-quality genome sequence of B. tabaci Middle East-Asia Minor 1 (MEAM1), the first genome sequence in the Aleyrodidae family, which contains 15,664 protein-coding genes. The B. tabaci genome is highly divergent from other sequenced hemipteran genomes, sharing no detectable synteny. A number of known detoxification gene families, including cytochrome P450s and UDP-glucuronosyltransferases, are significantly expanded in B. tabaci. Other expanded gene families, including cathepsins, large clusters of tandemly duplicated B. tabaci-specific genes, and phosphatidylethanolamine-binding proteins (PEBPs), were found to be associated with virus acquisition and transmission and/or insecticide resistance, likely contributing to the global invasiveness and efficient virus transmission capacity of B. tabaci. The presence of 142 horizontally transferred genes from bacteria or fungi in the B. tabaci genome, including genes encoding hopanoid/sterol synthesis and xenobiotic detoxification enzymes that are not present in other insects, offers novel insights into the unique biological adaptations of this insect such as polyphagy and insecticide resistance. Interestingly, two adjacent bacterial pantothenate biosynthesis genes, panB and panC, have been co-transferred into B. tabaci and fused into a single gene that has acquired introns during its evolution.The B. tabaci genome contains numerous genetic novelties, including expansions in gene families associated with insecticide resistance, detoxification and virus transmission, as well as numerous horizontally transferred genes from bacteria and fungi. We believe these novelties likely have shaped B. tabaci as a highly invasive polyphagous crop pest and efficient vector of plant viruses. The genome serves as a reference for resolving the B. tabaci cryptic species complex, understanding fundamental biological novelties, and providing valuable genetic information to assist the development of novel strategies for controlling whiteflies and the viruses they transmit.

July 7, 2019

A simple thermoplastic substrate containing hierarchical silica lamellae for high-molecular-weight DNA extraction.

An inexpensive, magnetic thermoplastic nanomaterial is developed utilizing a hierarchical layering of micro- and nanoscale silica lamellae to create a high-surface-area and low-shear substrate capable of capturing vast amounts of ultrahigh-molecular-weight DNA. Extraction is performed via a simple 45 min process and is capable of achieving binding capacities up to 1 000 000 times greater than silica microparticles. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

July 7, 2019

DNA extraction protocols for whole-genome sequencing in marine organisms.

The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths’ different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.

July 7, 2019

D1FHS, the type strain of the ammonia-oxidizing bacterium Nitrosococcus wardiae spec. nov.: enrichment, isolation, phylogenetic, and growth physiological characterization.

An ammonia-oxidizing bacterium, strain D1FHS, was enriched into pure culture from a sediment sample retrieved in Jiaozhou Bay, a hyper-eutrophic semi-closed water body hosting the metropolitan area of Qingdao, China. Based on initial 16S rRNA gene sequence analysis, strain D1FHS was classified in the genus Nitrosococcus, family Chromatiaceae, order Chromatiales, class Gammaproteobacteria; the 16S rRNA gene sequence with highest level of identity to that of D1FHS was obtained from Nitrosococcus halophilus Nc4(T). The average nucleotide identity between the genomes of strain D1FHS and N. halophilus strain Nc4 is 89.5%. Known species in the genus Nitrosococcus are obligate aerobic chemolithotrophic ammonia-oxidizing bacteria adapted to and restricted to marine environments. The optimum growth (maximum nitrite production) conditions for D1FHS in a minimal salts medium are: 50 mM ammonium and 700 mM NaCl at pH of 7.5 to 8.0 and at 37°C in dark. Because pertinent conditions for other studied Nitrosococcus spp. are 100-200 mM ammonium and <700 mM NaCl at pH of 7.5 to 8.0 and at 28-32°C, D1FHS is physiologically distinct from other Nitrosococcus spp. in terms of substrate, salt, and thermal tolerance.

July 7, 2019

Efficient, cost-effective, high-throughput, Multilocus Sequencing Typing (MLST) method, NGMLST, and the analytical software program MLSTEZ.

Multilocus sequence typing (MLST) has become the preferred method for genotyping many biological species. It can be used to identify major phylogenetic clades, molecular groups, or subpopulations of a species, as well as individual strains or clones. However, conventional MLST is costly and time consuming, which limits its power for genotyping large numbers of samples. Here, we describe a new MLST method that uses next-generation sequencing, a multiplexing protocol, and appropriate analytical software to provide accurate, rapid, and economical MLST genotyping of 96 or more isolates in a single assay.

July 7, 2019

Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations.

Mutations, the fuel of evolution, are first manifested as rare DNA changes within a population of cells. Although next-generation sequencing (NGS) technologies have revolutionized the study of genomic variation between species and individual organisms, most have limited ability to accurately detect and quantify rare variants among the different genome copies in heterogeneous mixtures of cells or molecules. We describe the technical challenges in characterizing subclonal variants using conventional NGS protocols and the recent development of error correction strategies, both computational and experimental, including consensus sequencing of single DNA molecules. We also highlight major applications for low-frequency mutation detection in science and medicine, describe emerging methodologies and provide our vision for the future of DNA sequencing.

July 7, 2019

High-quality complete and draft genome sequences for three Escherichia spp. and three Shigella spp. generated with Pacific Biosciences and Illumina sequencing and optical mapping.

Escherichia spp., including E. albertii and E. coli, Shigella dysenteriae, and S. flexneri are causative agents of foodborne disease. We report here reference-level whole-genome sequences of E. albertii (2014C-4356), E. coli (2011C-4315 and 2012C-4431), S. dysenteriae (BU53M1), and S. flexneri (94-3007 and 71-2783).. Copyright © 2018 Schroeder et al.

Auto Tag: Barcoding

The complete chloroplast genome sequence of the medicinal plant Swertia mussotii using the PacBio RS II platform.

The report of my death was an exaggeration: A review for researchers using microsatellites in the 21st century.

An ultra-high density genetic linkage map of perennial ryegrass (Lolium perenne) using genotyping by sequencing (GBS) based on a reference shotgun genome assembly.

Association between progranulin and Gaucher disease.

Complete circular genome sequence of successful ST8/SCCmecIV community-associated methicillin-resistant Staphylococcus aureus (OC8) in Russia: one-megabase genomic inversion, IS256’s spread, and evolution of Russia ST8-IV.

BAC-pool sequencing and analysis confirms growth-associated QTLs in the Asian seabass genome.

Silicon content of individual cells of Synechococcus from the North Atlantic Ocean

Probabilistic viral quasispecies assembly

The draft genome of whitefly Bemisia tabaci MEAM1, a global crop pest, provides novel insights into virus transmission, host adaptation, and insecticide resistance.

A simple thermoplastic substrate containing hierarchical silica lamellae for high-molecular-weight DNA extraction.

DNA extraction protocols for whole-genome sequencing in marine organisms.

D1FHS, the type strain of the ammonia-oxidizing bacterium Nitrosococcus wardiae spec. nov.: enrichment, isolation, phylogenetic, and growth physiological characterization.

Efficient, cost-effective, high-throughput, Multilocus Sequencing Typing (MLST) method, NGMLST, and the analytical software program MLSTEZ.

Enhancing the accuracy of next-generation sequencing for detecting rare and subclonal mutations.

High-quality complete and draft genome sequences for three Escherichia spp. and three Shigella spp. generated with Pacific Biosciences and Illumina sequencing and optical mapping.

Subscribe for blog updates:

Filter by topic

Talk with an expert

Antimicrobial resistance research

Subscribe for blog updates:

Filter by topic

Talk with an expert