Chromatin and extracellular vesicle associated sperm RNAs

染色质和胞外囊泡相关精子RNA

A diverse pool of RNAs remain encapsulated within the transcriptionally silent spermatozoon despite the dramatic reduction in cellular and nuclear volume following cytoplasm/nucleoplasm expulsion. The impact of this pronounced restructuring on the distribution of transcripts inside the sperm essentially remains unknown. To define their compartmentalization, total RNA >100 nt was extracted from sonicated (SS) mouse spermatozoa and detergent demembranated sucrose gradient fractionated (Cs/Tx) sperm heads. Sperm RNAs predominately localized toward the periphery. The corresponding distribution of transcripts and thus localization and complexity were then inferred by RNA-seq. Interestingly, the number of annotated RNAs in the CsTx sperm heads exhibiting reduced peripheral enrichment was restricted. However this included Cabyr, the calcium-binding tyrosine phosphorylation-regulated protein encoded transcript. It is present in murine zygotes prior to the maternal to the zygotic transition yet absent in oocytes, consistent with the delivery of internally positioned sperm-borne RNAs to the embryo. In comparison, transcripts enriched in sonicated sperm contributed to the mitochondria and exosomes along with several nuclear transcripts including the metastasis associated lung adenocarcinoma transcript 1 (Malat1) and several small nucleolar RNAs. Their preferential peripheral localization suggests that chromatin remodeling during spermiogenesis is not limited to nucleoproteins as part of the nucleoprotein exchange.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Gene regulation, Chromatin and Epigenetics

Site-Specific Phosphorylation of VEGFR2 Is Mediated by Receptor Trafficking: Insights from a Computational Model

VEGFR2站点特定的磷酸化是通过受体介导的:贩卖从计算模型的见解

by Lindsay Wendel Clegg, Feilim Mac Gabhann

Matrix-binding isoforms and non-matrix-binding isoforms of vascular endothelial growth factor (VEGF) are both capable of stimulating vascular remodeling, but the resulting blood vessel networks are structurally and functionally different. Here, we develop and validate a computational model of the binding of soluble and immobilized ligands to VEGF receptor 2 (VEGFR2), the endosomal trafficking of VEGFR2, and site-specific VEGFR2 tyrosine phosphorylation to study differences in induced signaling between these VEGF isoforms. In capturing essential features of VEGFR2 signaling and trafficking, our model suggests that VEGFR2 trafficking parameters are largely consistent across multiple endothelial cell lines. Simulations demonstrate distinct localization of VEGFR2 phosphorylated on Y1175 and Y1214. This is the first model to clearly show that differences in site-specific VEGFR2 activation when stimulated with immobilized VEGF compared to soluble VEGF can be accounted for by altered trafficking of VEGFR2 without an intrinsic difference in receptor activation. The model predicts that Neuropilin-1 can induce differences in the surface-to-internal distribution of VEGFR2. Simulations also show that ligated VEGFR2 and phosphorylated VEGFR2 levels diverge over time following stimulation. Using this model, we identify multiple key levers that alter how VEGF binding to VEGFR2 results in different coordinated patterns of multiple downstream signaling pathways. Specifically, simulations predict that VEGF immobilization, interactions with Neuropilin-1, perturbations of VEGFR2 trafficking, and changes in expression or activity of phosphatases acting on VEGFR2 all affect the magnitude, duration, and relative strength of VEGFR2 phosphorylation on tyrosines 1175 and 1214, and they do so predictably within our single consistent model framework.

[详细]

  • PLOS Computational Biology
  • 10年前

Stochastic Dynamics Underlying Cognitive Stability and Flexibility

随机动力学的基本认知的稳定性和灵活性

by Kai Ueltzhöffer, Diana J. N. Armbruster-Genç, Christian J. Fiebach

Cognitive stability and flexibility are core functions in the successful pursuit of behavioral goals. While there is evidence for a common frontoparietal network underlying both functions and for a key role of dopamine in the modulation of flexible versus stable behavior, the exact neurocomputational mechanisms underlying those executive functions and their adaptation to environmental demands are still unclear. In this work we study the neurocomputational mechanisms underlying cue based task switching (flexibility) and distractor inhibition (stability) in a paradigm specifically designed to probe both functions. We develop a physiologically plausible, explicit model of neural networks that maintain the currently active task rule in working memory and implement the decision process. We simplify the four-choice decision network to a nonlinear drift-diffusion process that we canonically derive from a generic winner-take-all network model. By fitting our model to the behavioral data of individual subjects, we can reproduce their full behavior in terms of decisions and reaction time distributions in baseline as well as distractor inhibition and switch conditions. Furthermore, we predict the individual hemodynamic response timecourse of the rule-representing network and localize it to a frontoparietal network including the inferior frontal junction area and the intraparietal sulcus, using functional magnetic resonance imaging. This refines the understanding of task-switch-related frontoparietal brain activity as reflecting attractor-like working memory representations of task rules. Finally, we estimate the subject-specific stability of the rule-representing attractor states in terms of the minimal action associated with a transition between different rule states in the phase-space of the fitted models. This stability measure correlates with switching-specific thalamocorticostriatal activation, i.e., with a system associated with flexible working memory updating and dopaminergic modulation of cognitive flexibility. These results show that stochastic dynamical systems can implement the basic computations underlying cognitive stability and flexibility and explain neurobiological bases of individual differences.

[详细]

  • PLOS Computational Biology
  • 10年前

DNA transposons have colonized the genome of the giant virus Pandoravirus salinus

DNA转座子在巨型病毒<它> Pandoravirus salinus <它>基因组

Background: Transposable elements are mobile DNA sequences that are widely distributed in prokaryotic and eukaryotic genomes, where they represent a major force in genome evolution. However, transposable elements have rarely been documented in viruses, and their contribution to viral genome evolution remains largely unexplored. Pandoraviruses are recently described DNA viruses with genome sizes that exceed those of some prokaryotes, rivaling parasitic eukaryotes. These large genomes appear to include substantial noncoding intergenic spaces, which provide potential locations for transposable element insertions. However, no mobile genetic elements have yet been reported in pandoravirus genomes. Results: Here, we report a family of miniature inverted-repeat transposable elements (MITEs) in the Pandoravirus salinus genome, representing the first description of a virus populated with a canonical transposable element family that proliferated by transposition within the viral genome. The MITE family, which we name Submariner, includes 30 copies with all the hallmarks of MITEs: short length, terminal inverted repeats, TA target site duplication, and no coding capacity. Submariner elements show signs of transposition and are undetectable in the genome of Pandoravirus dulcis, the closest known relative Pandoravirus salinus. We identified a DNA transposon related to Submariner in the genome of Acanthamoeba castellanii — a species thought to host pandoraviruses — that contains remnants of coding sequence for a Tc1/mariner transposase. These observations suggest that the Submariner MITEs of P. salinus (1) belong to the widespread Tc1/mariner superfamily, and (2) may have been mobilized by an amoebozoan host. Ten of the 30 MITEs in the P. salinus genome are located within coding regions of predicted genes, while others are close to genes, suggesting that these transposons may have contributed to viral genetic novelty. Conclusions: Our discovery highlights the remarkable ability of DNA transposons to colonize and shape genomes from all domains of life, as well as giant viruses. Our findings continue to blur the division between viral and cellular genomes, adhering to the emerging view that the content, dynamics, and evolution of the genomes of giant viruses do not substantially differ from those of cellular organisms.

[详细]

  • BMC Biology 2015, null:38
  • 10年前

[In Depth] HIV family trees reveal viral spread

[深度]家谱揭示HIV病毒传播

New genetic studies that compare different HIV isolates with each other are allowing researchers to create sophisticated maps of transmission networks. These phylogenetic transmission maps, in turn, are pinpointing where prevention efforts can get the most bang for their buck. A meeting held at the New York Academy of Sciences in New York City on 4 June gathered researchers, representatives from public health organizations, and HIV/AIDS advocates to discuss the latest studies in this emerging field, which revealed several surprising findings about HIV's spread in many different geographic regions around the world. The meeting participants also discussed at length the ethical and legal dilemmas raised by such research, which even though it does not specify who infected whom might be misused by, for example, jurisdictions that allow criminal prosecution of HIV transmission.

[详细]

  • Science
  • 10年前
  • Molecular Epidemiology

[Perspective] Microbial metabolite triggers antimicrobial defense

[观点]微生物的代谢产物引起的抗菌防御

In general, innate immune responses are beneficial to the host. However, under certain circumstances, such as coinfections, these responses may contribute to disease progression. For instance, an initial infection may induce an immune response that renders the host susceptible to a subsequent infection by a different pathogen. For years, it has been understood that Neisseria gonorrhoeae and HIV coinfection increase viral shedding and transmission (1). However, the molecular mechanisms that control this phenomenon have been unclear. Neisseria spp. release a factor that can activate the transcription factor nuclear factor κB (NF-κB) in host cells and drive HIV gene expression (2). On page 1251 of this issue, Gaudet et al. (3) reveal the identity of this bacterial-derived factor, as well as the signaling axis through which it drives innate immune gene expression. This detection pathway may constitute a previously unknown innate immune signaling response with broader implications in microbial defense, as well as the pathogenesis of HIV.

[详细]

  • Science
  • 10年前
  • Immunology

[Technical Comment] Comment on “Principles of ER cotranslational translocation revealed by proximity-specific ribosome profiling”

[“ER共翻译转运接近特定的核糖体图谱”显示原理技术评论]评论

Jan et al. (Research Articles, 7 November 2014, p. 716) propose that ribosomes translating secretome messenger RNAs (mRNAs) traffic from the cytosol to the endoplasmic reticulum (ER) upon emergence of the signal peptide and return to the cytosol after termination. An accounting of controls demonstrates that mRNAs initiate translation on ER-bound ribosomes and that ribosomes are retained on the ER through many cycles of translation.

[详细]

  • Science
  • 10年前

Unbalanced translocations arise from diverse mutational mechanisms including chromothripsis [RESEARCH]

不平衡易位引起的包括染色体[研究]不同的突变机制

Unbalanced translocations are a relatively common type of copy number variation and a major contributor to neurodevelopmental disorders. We analyzed the breakpoints of 57 unique unbalanced translocations to investigate the mechanisms of how they form. Fifty-one are simple unbalanced translocations between two different chromosome ends, and six rearrangements have more than three breakpoints involving two to five chromosomes. Sequencing 37 breakpoint junctions revealed that simple translocations have between 0 and 4 base pairs (bp) of microhomology (n = 26), short inserted sequences (n = 8), or paralogous repeats (n = 3) at the junctions, indicating that translocations do not arise primarily from nonallelic homologous recombination but instead form most often via nonhomologous end joining or microhomology-mediated break-induced replication. Three simple translocations fuse genes that are predicted to produce in-frame transcripts of SIRPG-WWOX, SMOC2-PROX1, and PIEZO2-MTA1, which may lead to gain of function. Three complex translocations have inversions, insertions, and multiple breakpoint junctions between only two chromosomes. Whole-genome sequencing and fluorescence in situ hybridization analysis of two de novo translocations revealed at least 18 and 33 breakpoints involving five different chromosomes. Breakpoint sequencing of one maternally inherited translocation involving four chromosomes uncovered multiple breakpoints with inversions and insertions. All of these breakpoint junctions had 0–4 bp of microhomology consistent with chromothripsis, and both de novo events occurred on paternal alleles. Together with other studies, these data suggest that germline chromothripsis arises in the paternal genome and may be transmitted maternally. Breakpoint sequencing of our large collection of chromosome rearrangements provides a comprehensive analysis of the molecular mechanisms behind translocation formation.

[详细]

  • Genome Research
  • 10年前
  • RESEARCH

[Report] A male-determining factor in the mosquito Aedes aegypti

[报告]一个雄性决定因子在埃及伊蚊

Sex determination in the mosquito Aedes aegypti is governed by a dominant male-determining factor (M factor) located within a Y chromosome–like region called the M locus. Here, we show that an M-locus gene, Nix, functions as an M factor in A. aegypti. Nix exhibits persistent M linkage and early embryonic expression, two characteristics required of an M factor. Nix knockout with clustered regularly interspaced short palindromic repeats (CRISPR)–Cas9 resulted in largely feminized genetic males and the production of female isoforms of two key regulators of sexual differentiation: doublesex and fruitless. Ectopic expression of Nix resulted in genetic females with nearly complete male genitalia. Thus, Nix is both required and sufficient to initiate male development. This study provides a foundation for mosquito control strategies that convert female mosquitoes into harmless males.

[详细]

  • Science
  • 10年前

AllelicImbalance: an R/ bioconductor package for detecting, managing, and visualizing allele expression imbalance data from RNA sequencing

allelicimbalance:一个R/Bioconductor包检测,管理,和可视化等位基因表达失衡的RNA测序数据

Background: One aspect in which RNA sequencing is more valuable than microarray-based methods is the ability to examine the allelic imbalance of the expression of a gene. This process is often a complex task that entails quality control, alignment, and the counting of reads over heterozygous single-nucleotide polymorphisms. Allelic imbalance analysis is subject to technical biases, due to differences in the sequences of the measured alleles. Flexible bioinformatics tools are needed to ease the workflow while retaining as much RNA sequencing information as possible throughout the analysis to detect and address the possible biases. Results: We present AllelicImblance, a software program that is designed to detect, manage, and visualize allelic imbalances comprehensively. The purpose of this software is to allow users to pose genetic questions in any RNA sequencing experiment quickly, enhancing the general utility of RNA sequencing. The visualization features can reveal notable, non-trivial allelic imbalance behavior over specific regions, such as exons. Conclusions: The software provides a complete framework to perform allelic imbalance analyses of aligned RNA sequencing data, from detection to visualization, within the robust and versatile management class, ASEset.

[详细]

  • BMC Bioinformatics 2015, null:194
  • 10年前

Host-virus interaction: the antiviral defense function of small interfering RNAs can be enhanced by host microRNA-7 in vitro

宿主的相互作用:小干扰RNA的抗病毒防御功能可以提高主机微小RNA-7体外

Small interfering RNAs (siRNAs) directed against poliovirus (PV) and other viruses effectively inhibit viral replication and have been developed as antiviral agents. Here, we demonstrate that a specific siRNA targeting the region between nucleotides 100–125 (siRNA-100) from the 5′-untranslated region (5′-UTR) of PV plays a critical role in inhibiting PV replication. Our data demonstrate that siRNA-100 treatment can greatly reduce PV titers, resulting in up-regulation of host microRNA-7 (miR-7), which in turn, leads to enhance inhibition of PV infection further. Moreover, our results suggest that siRNA-100 can also impair the spread of PV to uninfected cells by increasing host resistance to PV, resulting in decreasing necrosis and cytopathic effects (CPE) levels, as well as prolonging the survival of infected cells. Indeed, the active antiviral effect of siRNA-100 was potentially supplemented by the activity of miR-7, and both of them can serve as stabilizing factors for maintenance of cellular homeostasis. Results of this study identify a molecular mechanism of RNAi for antiviral defense, and extend our knowledge of the complex interplay between host and PV, which will provide a basis for the development of effective RNAi-based therapies designed to inhibit PV replication and protect host cells.

[详细]

  • Scientific Reports 5
  • 10年前
  • Article

VRK1 regulates Cajal body dynamics and protects coilin from proteasomal degradation in cell cycle

VRK1调节卡哈尔体动力学和保护从细胞周期蛋白降解coilin

Cajal bodies (CBs) are nuclear organelles associated with ribonucleoprotein functions and RNA maturation. CBs are assembled on coilin, its main scaffold protein, in a cell cycle dependent manner. The Ser-Thr VRK1 (vaccinia-related kinase 1) kinase, whose activity is also cell cycle regulated, interacts with and phosphorylates coilin regulating assembly of CBs. Coilin phosphorylation is not necessary for its interaction with VRK1, but it occurs in mitosis and regulates coilin stability. Knockdown of VRK1 or VRK1 inactivation by serum deprivation causes a loss of coilin phosphorylation in Ser184 and of CBs formation, which are rescued with an active VRK1, but not by kinase-dead VRK1. The phosphorylation of coilin in Ser184 occurs during mitosis before assembly of CBs. Loss of coilin phosphorylation results in disintegration of CBs, and of coilin degradation that is prevented by proteasome inhibitors. After depletion of VRK1, coilin is ubiquitinated in nuclei, which is partly mediated by mdm2, but its proteasomal degradation occurs in cytosol and is prevented by blocking its nuclear export. We conclude that VRK1 is a novel regulator of CBs dynamics and stability in cell cycle by protecting coilin from ubiquitination and degradation in the proteasome, and propose a model of CB dynamics.

[详细]

  • Scientific Reports 5
  • 10年前
  • Article

Diversity and functions of bacterial community in drinking water biofilms revealed by high-throughput sequencing

在饮用水生物膜的高通量测序揭示细菌群落多样性及功能

The development of biofilms in drinking water (DW) systems may cause various problems to water quality. To investigate the community structure of biofilms on different pipe materials and the global/specific metabolic functions of DW biofilms, PCR-based 454 pyrosequencing data for 16S rRNA genes and Illumina metagenomic data were generated and analysed. Considerable differences in bacterial diversity and taxonomic structure were identified between biofilms formed on stainless steel and biofilms formed on plastics, indicating that the metallic materials facilitate the formation of higher diversity biofilms. Moreover, variations in several dominant genera were observed during biofilm formation. Based on PCA analysis, the global functions in the DW biofilms were similar to other DW metagenomes. Beyond the global functions, the occurrences and abundances of specific protective genes involved in the glutathione metabolism, the SoxRS system, the OxyR system, RpoS regulated genes, and the production/degradation of extracellular polymeric substances were also evaluated. A near-complete and low-contamination draft genome was constructed from the metagenome of the DW biofilm, based on the coverage and tetranucleotide frequencies, and identified as a Bradyrhizobiaceae-like bacterium according to a phylogenetic analysis. Our findings provide new insight into DW biofilms, especially in terms of their metabolic functions.

[详细]

  • Scientific Reports 5
  • 10年前
  • Article

TRRUST: a reference database of human transcriptional regulatory interactions

TRRUST:参考人类转录调控相互作用数据库

The reconstruction of transcriptional regulatory networks (TRNs) is a long-standing challenge in human genetics. Numerous computational methods have been developed to infer regulatory interactions between human transcriptional factors (TFs) and target genes from high-throughput data, and their performance evaluation requires gold-standard interactions. Here we present a database of literature-curated human TF-target interactions, TRRUST (transcriptional regulatory relationships unravelled by sentence-based text-mining, http://www.grnpedia.org/trrust), which currently contains 8,015 interactions between 748 TF genes and 1,975 non-TF genes. A sentence-based text-mining approach was employed for efficient manual curation of regulatory interactions from approximately 20 million Medline abstracts. To the best of our knowledge, TRRUST is the largest publicly available database of literature-curated human TF-target interactions to date. TRRUST also has several useful features: i) information about the mode-of-regulation; ii) tests for target modularity of a query TF; iii) tests for TF cooperativity of a query target; iv) inferences about cooperating TFs of a query TF; and v) prioritizing associated pathways and diseases with a query TF. We observed high enrichment of TF-target pairs in TRRUST for top-scored interactions inferred from high-throughput data, which suggests that TRRUST provides a reliable benchmark for the computational reconstruction of human TRNs.

[详细]

  • Scientific Reports 5
  • 10年前
  • Article

A Semiquantitative Framework for Gene Regulatory Networks: Increasing the Time and Quantitative Resolution of Boolean Networks

基因调控网络的半定量框架:增加布尔网络的时间和定量解析

by Johan Kerkhofs, Liesbet Geris

Boolean models have been instrumental in predicting general features of gene networks and more recently also as explorative tools in specific biological applications. In this study we introduce a basic quantitative and a limited time resolution to a discrete (Boolean) framework. Quantitative resolution is improved through the employ of normalized variables in unison with an additive approach. Increased time resolution stems from the introduction of two distinct priority classes. Through the implementation of a previously published chondrocyte network and T helper cell network, we show that this addition of quantitative and time resolution broadens the scope of biological behaviour that can be captured by the models. Specifically, the quantitative resolution readily allows models to discern qualitative differences in dosage response to growth factors. The limited time resolution, in turn, can influence the reachability of attractors, delineating the likely long term system behaviour. Importantly, the information required for implementation of these features, such as the nature of an interaction, is typically obtainable from the literature. Nonetheless, a trade-off is always present between additional computational cost of this approach and the likelihood of extending the model’s scope. Indeed, in some cases the inclusion of these features does not yield additional insight. This framework, incorporating increased and readily available time and semi-quantitative resolution, can help in substantiating the litmus test of dynamics for gene networks, firstly by excluding unlikely dynamics and secondly by refining falsifiable predictions on qualitative behaviour.

[详细]

  • PloS one
  • 10年前

zflncRNApedia: A Comprehensive Online Resource for Zebrafish Long Non-Coding RNAs

zflncrnapedia:斑马鱼长非编码RNA的一个综合性的在线资源

by Heena Dhiman, Shruti Kapoor, Ambily Sivadas, Sridhar Sivasubbu, Vinod Scaria

Recent transcriptome annotation using deep sequencing approaches have annotated a large number of long non-coding RNAs in zebrafish, a popular model organism for human diseases. These studies characterized lncRNAs in critical developmental stages as well as adult tissues. Each of the studies has uncovered a distinct set of lncRNAs, with minor overlaps. The availability of the raw RNA-Seq datasets in public domain encompassing critical developmental time-points and adult tissues provides us with a unique opportunity to understand the spatiotemporal expression patterns of lncRNAs. In the present report, we created a catalog of lncRNAs in zebrafish, derived largely from the three annotation sets, as well as manual curation of literature to compile a total of 2,267 lncRNA transcripts in zebrafish. The lncRNAs were further classified based on the genomic context and relationship with protein coding gene neighbors into 4 categories. Analysis revealed a total of 86 intronic, 309 promoter associated, 485 overlapping and 1,386 lincRNAs. We created a comprehensive resource which houses the annotation of lncRNAs as well as associated information including expression levels, promoter epigenetic marks, genomic variants and retroviral insertion mutants. The resource also hosts a genome browser where the datasets could be browsed in the genome context. To the best of our knowledge, this is the first comprehensive resource providing a unified catalog of lncRNAs in zebrafish. The resource is freely available at URL: http://genome.igib.res.in/zflncRNApedia

[详细]

  • PloS one
  • 10年前

Multiplexing of ChIP-Seq Samples in an Optimized Experimental Condition Has Minimal Impact on Peak Detection

在优化的操作条件下,ChIP-seq样品复用对峰值检测的影响最小

by Thadeous J. Kacmarczyk, Caitlin Bourque, Xihui Zhang, Yanwen Jiang, Yariv Houvras, Alicia Alonso, Doron Betel

Multiplexing samples in sequencing experiments is a common approach to maximize information yield while minimizing cost. In most cases the number of samples that are multiplexed is determined by financial consideration or experimental convenience, with limited understanding on the effects on the experimental results. Here we set to examine the impact of multiplexing ChIP-seq experiments on the ability to identify a specific epigenetic modification. We performed peak detection analyses to determine the effects of multiplexing. These include false discovery rates, size, position and statistical significance of peak detection, and changes in gene annotation. We found that, for histone marker H3K4me3, one can multiplex up to 8 samples (7 IP + 1 input) at ~21 million single-end reads each and still detect over 90% of all peaks found when using a full lane for sample (~181 million reads). Furthermore, there are no variations introduced by indexing or lane batch effects and importantly there is no significant reduction in the number of genes with neighboring H3K4me3 peaks. We conclude that, for a well characterized antibody and, therefore, model IP condition, multiplexing 8 samples per lane is sufficient to capture most of the biological signal.

[详细]

  • PloS one
  • 10年前

Cooperative DNA Recognition Modulated by an Interplay between Protein-Protein Interactions and DNA-Mediated Allostery

合作的DNA识别蛋白质相互作用与DNA之间的相互作用介导的变构调制

by Felipe Merino, Benjamin Bouvier, Vlad Cojocaru

Highly specific transcriptional regulation depends on the cooperative association of transcription factors into enhanceosomes. Usually, their DNA-binding cooperativity originates from either direct interactions or DNA-mediated allostery. Here, we performed unbiased molecular simulations followed by simulations of protein-DNA unbinding and free energy profiling to study the cooperative DNA recognition by OCT4 and SOX2, key components of enhanceosomes in pluripotent cells. We found that SOX2 influences the orientation and dynamics of the DNA-bound configuration of OCT4. In addition SOX2 modifies the unbinding free energy profiles of both DNA-binding domains of OCT4, the POU specific and POU homeodomain, despite interacting directly only with the first. Thus, we demonstrate that the OCT4-SOX2 cooperativity is modulated by an interplay between protein-protein interactions and DNA-mediated allostery. Further, we estimated the change in OCT4-DNA binding free energy due to the cooperativity with SOX2, observed a good agreement with experimental measurements, and found that SOX2 affects the relative DNA-binding strength of the two OCT4 domains. Based on these findings, we propose that available interaction partners in different biological contexts modulate the DNA exploration routes of multi-domain transcription factors such as OCT4. We consider the OCT4-SOX2 cooperativity as a paradigm of how specificity of transcriptional regulation is achieved through concerted modulation of protein-DNA recognition by different types of interactions.

[详细]

  • PLOS Computational Biology
  • 10年前

CMRegNet–An interspecies reference database for corynebacterial and mycobacterial regulatory networks

cmregnet

Background: Organisms utilize a multitude of mechanisms for responding to changing environmental conditions, maintaining their functional homeostasis and to overcome stress situations. One of the most important mechanisms is transcriptional gene regulation. In-depth study of the transcriptional gene regulatory network can lead to various practical applications, creating a greater understanding of how organisms control their cellular behavior.DescriptionIn this work, we present a new database, CMRegNet for the gene regulatory networks of Corynebacterium glutamicum ATCC 13032 and Mycobacterium tuberculosis H37Rv. We furthermore transferred the known networks of these model organisms to 18 other non-model but phylogenetically close species (target organisms) of the CMNR group. In comparison to other network transfers, for the first time we utilized two model organisms resulting into a more diverse and complete network of the target organisms. Conclusion: CMRegNet provides easy access to a total of 3,103 known regulations in C. glutamicum ATCC 13032 and M. tuberculosis H37Rv and to 38,940 evolutionary conserved interactions for 18 non-model species of the CMNR group. This makes CMRegNet to date the most comprehensive database of regulatory interactions of CMNR bacteria. The content of CMRegNet is publicly available online via a web interface found at http://lgcm.icb.ufmg.br/cmregnet.

[详细]

  • BMC Genomics 2015, null:452
  • 10年前

Population structure of mitochondrial genomes in Saccharomyces cerevisiae

线粒体基因组在<它> > < /酿酒酵母种群结构

Background: Rigorous study of mitochondrial functions and cell biology in the budding yeast, Saccharomyces cerevisiae has advanced our understanding of mitochondrial genetics. This yeast is now a powerful model for population genetics, owing to large genetic diversity and highly structured populations among wild isolates. Comparative mitochondrial genomic analyses between yeast species have revealed broad evolutionary changes in genome organization and architecture. A fine-scale view of recent evolutionary changes within S. cerevisiae has not been possible due to low numbers of complete mitochondrial sequences. Results: To address challenges of sequencing AT-rich and repetitive mitochondrial DNAs (mtDNAs), we sequenced two divergent S. cerevisiae mtDNAs using a single-molecule sequencing platform (PacBio RS). Using de novo assemblies, we generated highly accurate complete mtDNA sequences. These mtDNA sequences were compared with 98 additional mtDNA sequences gathered from various published collections. Phylogenies based on mitochondrial coding sequences and intron profiles revealed that intraspecific diversity in mitochondrial genomes generally recapitulated the population structure of nuclear genomes. Analysis of intergenic sequence indicated a recent expansion of mobile elements in certain populations. Additionally, our analyses revealed that certain populations lacked introns previously believed conserved throughout the species, as well as the presence of introns never before reported in S. cerevisiae. Conclusions: Our results revealed that the extensive variation in S. cerevisiae mtDNAs is often population specific, thus offering a window into the recent evolutionary processes shaping these genomes. In addition, we offer an effective strategy for sequencing these challenging AT-rich mitochondrial genomes for small scale projects.

[详细]

  • BMC Genomics 2015, null:451
  • 10年前

The Eucalyptus terpene synthase gene family

<正> > < /桉树萜烯合成酶基因家族

Background: Terpenoids are abundant in the foliage of Eucalyptus, providing the characteristic smell as well as being valuable economically and influencing ecological interactions. Quantitative and qualitative inter- and intra- specific variation of terpenes is common in eucalypts. Results: The genome sequences of Eucalyptus grandis and E. globulus were mined for terpene synthase genes (TPS) and compared to other plant species. We investigated the relative expression of TPS in seven plant tissues and functionally characterized five TPS genes from E. grandis. Compared to other sequenced plant genomes, Eucalyptus grandis has the largest number of putative functional TPS genes of any sequenced plant. We discovered 113 and 106 putative functional TPS genes in E. grandis and E. globulus, respectively. All but one TPS from E. grandis were expressed in at least one of seven plant tissues examined. Genomic clusters of up to 20 genes were identified. Many TPS are expressed in tissues other than leaves which invites a re-evaluation of the function of terpenes in Eucalyptus. Conclusions: Our data indicate that terpenes in Eucalyptus may play a wider role in biotic and abiotic interactions than previously thought. Tissue specific expression is common and the possibility of stress induction needs further investigation. Phylogenetic comparison of the two investigated Eucalyptus species gives insight about recent evolution of different clades within the TPS gene family. While the majority of TPS genes occur in orthologous pairs some clades show evidence of recent gene duplication, as well as loss of function.

[详细]

  • BMC Genomics 2015, null:450
  • 10年前

A composite genome approach to identify phylogenetically informative data from next-generation sequencing

从新一代测序确定的系统发育信息的数据合成基因组的方法

Background: Improvements in sequencing technology now allow easy acquisition of large datasets; however, analyzing these data for phylogenetics can be challenging. We have developed a novel method to rapidly obtain homologous genomic data for phylogenetics directly from next-generation sequencing reads without the use of a reference genome. This software, called SISRS, avoids the time consuming steps of de novo whole genome assembly, multiple genome alignment, and annotation. Results: For simulations SISRS is able to identify large numbers of loci containing variable sites with phylogenetic signal. For genomic data from apes, SISRS identified thousands of variable sites, from which we produced an accurate phylogeny. Finally, we used SISRS to identify phylogenetic markers that we used to estimate the phylogeny of placental mammals. We recovered eight phylogenies that resolved the basal relationships among mammals using datasets with different levels of missing data. The three alternate resolutions of the basal relationships are consistent with the major hypotheses for the relationships among mammals, all of which have been supported previously by different molecular datasets. Conclusions: SISRS has the potential to transform phylogenetic research. This method eliminates the need for expensive marker development in many studies by using whole genome shotgun sequence data directly. SISRS is open source and freely available at https://github.com/rachelss/SISRS/releases.

[详细]

  • BMC Bioinformatics 2015, null:193
  • 10年前

BitMapper: an efficient all-mapper based on bit-vector computing

bitmapper:一个有效的所有映射基于位向量的计算

Background: As the next-generation sequencing (NGS) technologies producing hundreds of millions of reads every day, a tremendous computational challenge is to map NGS reads to a given reference genome efficiently. However, existing methods of all-mappers, which aim at finding all mapping locations of each read, are very time consuming. The majority of existing all-mappers consist of 2 main parts, filtration and verification. This work significantly reduces verification time, which is the dominant part of the running time. Results: An efficient all-mapper, BitMapper, is developed based on a new vectorized bit-vector algorithm, which simultaneously calculates the edit distance of one read to multiple locations in a given reference genome. Experimental results on both simulated and real data sets show that BitMapper is from several times to an order of magnitude faster than the current state-of-the-art all-mappers, while achieving higher sensitivity, i.e., better quality solutions. Conclusions: We present BitMapper, which is designed to return all mapping locations of raw reads containing indels as well as mismatches. BitMapper is implemented in C under a GPL license. Binaries are freely available at http://home.ustc.edu.cn/%7Echhy.

[详细]

  • BMC Bioinformatics 2015, null:192
  • 10年前

Identification of two key genes controlling chill haze stability of beer in barley (Hordeum vulgare L)

两个关键基因控制冷混浊稳定性鉴定啤酒大麦(<它>大麦> < / L)

Background: In bright beer, haze formation is a serious quality problem, degrading beer quality and reducing its shelf life. The quality of barley (Hordeum vulgare L) malt, as the main raw material for beer brewing, largely affects the colloidal stability of beer. Results: In this study, the genetic mechanism of the factors affecting beer haze stability in barley was studied. Quantitative trait loci (QTL) analysis of alcohol chill haze (ACH) in beer was carried out using a Franklin/Yerong double haploid (DH) population. One QTL, named as qACH, was detected for ACH, and it was located on the position of about 108 cM in chromosome 4H and can explain about 20 % of the phenotypic variation. Two key haze active proteins, BATI-CMb and BATI-CMd were identified by proteomics analysis. Bioinformatics analysis showed that BATI-CMb and BATI-CMd had the same position as qACH in the chromosome. It may be deduced that BATI-CMb and BATI-CMd are candidate genes for qACH, controlling colloidal stability of beer. Polymorphism comparison between Yerong and Franklin in the nucleotide and amino acid sequence of BATI-CMb and BATI-CMd detected the corresponding gene specific markers, which could be used in marker-assisted selection for malt barley breeding. Conclusions: We identified a novel QTL, qACH controlling chill haze of beer, and two key haze active proteins, BATI-CMb and BATI-CMd. And further analysis showed that BATI-CMb and BATI-CMd might be the candidate genes associated with beer chill haze.

[详细]

  • BMC Genomics 2015, null:449
  • 10年前