The widespread occurrence of repetitive stretches of DNA in genomes of organisms across the tree of life imposes fundamental challenges for sequencing, genome assembly, and automated annotation of genes and proteins. This multi-level problem can lead to errors in genome and protein databases that are often not recognized or acknowledged. As a consequence, end users working with sequences with repetitive regions are faced with ‘ready-to-use’ deposited data whose trustworthiness is difficult to determine, let alone to quantify. Here, we provide a review of the problems associated with tandem repeat sequences that originate from different stages during the sequencing-assembly-annotation-deposition workflow, and…
Since the first genome of a halophilic archaeon was sequenced in 2000, microbes inhabiting hypersaline environments have been investigated largely based on genomic characteristics. Salinigranum rubrum GX10T, the type species of the genus Salinigranum belonging to the euryarchaeal family Haloferacaceae, was isolated from the brine of Gangxi marine solar saltern near Weihai, China. Similar with most members of the class Halobacteria, S. rubrum GX10T is an extreme halophile requiring at least 1.5?M NaCl for growth and 3.1?M NaCl for optimum growth. We sequenced and annotated the complete genome of S. rubrum GX10T, which was found to be 4,973,118?bp and comprise…
Red maple (Acer rubrum L.) is one of the most common and widespread trees with colorful leaves. We found a mutant with red, yellow, and green leaf phenotypes in different branches, which provided ideal materials with the same genetic relationship, and little interference from the environment, for the study of complex metabolic networks that underly variations in the coloration of leaves. We applied a combination of NGS and SMRT sequencing to various red maple tissues.A total of 125,448 unigenes were obtained, of which 46 and 69 were thought to be related to the synthesis of anthocyanins and carotenoids, respectively. In…