Inferential modeling of 3D chromatin structure

染色质结构的三维推理建模

For eukaryotic cells, the biological processes involving regulatory DNA elements play an important role in cell cycle. Understanding 3D spatial arrangements of chromosomes and revealing long-range chromatin interactions are critical to decipher these biological processes. In recent years, chromosome conformation capture (3C) related techniques have been developed to measure the interaction frequencies between long-range genome loci, which have provided a great opportunity to decode the 3D organization of the genome. In this paper, we develop a new Bayesian framework to derive the 3D architecture of a chromosome from 3C-based data. By modeling each chromosome as a polymer chain, we define the conformational energy based on our current knowledge on polymer physics and use it as prior information in the Bayesian framework. We also propose an expectation-maximization (EM) based algorithm to estimate the unknown parameters of the Bayesian model and infer an ensemble of chromatin structures based on interaction frequency data. We have validated our Bayesian inference approach through cross-validation and verified the computed chromatin conformations using the geometric constraints derived from fluorescence in situ hybridization (FISH) experiments. We have further confirmed the inferred chromatin structures using the known genetic interactions derived from other studies in the literature. Our test results have indicated that our Bayesian framework can compute an accurate ensemble of 3D chromatin conformations that best interpret the distance constraints derived from 3C-based data and also agree with other sources of geometric constraints derived from experimental evidence in the previous studies. The source code of our approach can be found in https://github.com/wangsy11/InfMod3DGen.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Methods Online

Highly efficient CRISPR/Cas9-mediated TAR cloning of genes and chromosomal loci from complex genomes in yeast

高效的CRISPR/Cas9基因和染色体介导的复杂基因组的焦油在酵母中的克隆

Transformation-associated recombination (TAR) protocol allowing the selective isolation of full-length genes complete with their distal enhancer regions and entire genomic loci with sizes up to 250 kb from complex genomes in yeast S. cerevisiae has been developed more than a decade ago. However, its wide spread usage has been impeded by a low efficiency (0.5–2%) of chromosomal region capture during yeast transformants which in turn requires a time-consuming screen of hundreds of colonies. Here, we demonstrate that pre-treatment of genomic DNA with CRISPR-Cas9 nucleases to generate double-strand breaks near the targeted genomic region results in a dramatic increase in the fraction of gene-positive colonies (up to 32%). As only a dozen or less yeast transformants need to be screened to obtain a clone with the desired chromosomal region, extensive experience with yeast is no longer required. A TAR-CRISPR protocol may help to create a bank of human genes, each represented by a genomic copy containing its native regulatory elements, that would lead to a significant advance in functional, structural and comparative genomics, in diagnostics, gene replacement, generation of animal models for human diseases and has a potential for gene therapy.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Methods Online

Gene-specific cell labeling using MiMIC transposons

基因特异性细胞采用模拟的转座子标签

Binary expression systems such as GAL4/UAS, LexA/LexAop and QF/QUAS have greatly enhanced the power of Drosophila as a model organism by allowing spatio-temporal manipulation of gene function as well as cell and neural circuit function. Tissue-specific expression of these heterologous transcription factors relies on random transposon integration near enhancers or promoters that drive the binary transcription factor embedded in the transposon. Alternatively, gene-specific promoter elements are directly fused to the binary factor within the transposon followed by random or site-specific integration. However, such insertions do not consistently recapitulate endogenous expression. We used Minos-Mediated Integration Cassette (MiMIC) transposons to convert host loci into reliable gene-specific binary effectors. MiMIC transposons allow recombinase-mediated cassette exchange to modify the transposon content. We developed novel exchange cassettes to convert coding intronic MiMIC insertions into gene-specific binary factor protein-traps. In addition, we expanded the set of binary factor exchange cassettes available for non-coding intronic MiMIC insertions. We show that binary factor conversions of different insertions in the same locus have indistinguishable expression patterns, suggesting that they reliably reflect endogenous gene expression. We show the efficacy and broad applicability of these new tools by dissecting the cellular expression patterns of the Drosophila serotonin receptor gene family.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Methods Online

Regulation of yeast DNA polymerase {delta}-mediated strand displacement synthesis by 5'-flaps

调节酵母DNA聚合酶{δ}介导链置换合成5’-皮瓣

The strand displacement activity of DNA polymerase is strongly stimulated by its interaction with proliferating cell nuclear antigen (PCNA). However, inactivation of the 3'–5' exonuclease activity is sufficient to allow the polymerase to carry out strand displacement even in the absence of PCNA. We have examined in vitro the basic biochemical properties that allow Pol -exo to carry out strand displacement synthesis and discovered that it is regulated by the 5'-flaps in the DNA strand to be displaced. Under conditions where Pol carries out strand displacement synthesis, the presence of long 5'-flaps or addition in trans of ssDNA suppress this activity. This suggests the presence of a secondary DNA binding site on the enzyme that is responsible for modulation of strand displacement activity. The inhibitory effect of a long 5'-flap can be suppressed by its interaction with single-stranded DNA binding proteins. However, this relief of flap-inhibition does not simply originate from binding of Replication Protein A to the flap and sequestering it. Interaction of Pol with PCNA eliminates flap-mediated inhibition of strand displacement synthesis by masking the secondary DNA site on the polymerase. These data suggest that in addition to enhancing the processivity of the polymerase PCNA is an allosteric modulator of other Pol activities.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Nucleic Acid Enzymes

Heme activation by DNA: isoguanine pentaplexes, but not quadruplexes, bind heme and enhance its oxidative activity

通过DNA血红素激活:pentaplexes异鸟,但不是四,结合血红素和提高其氧化活性

Guanine-rich, single-stranded, DNAs and RNAs are able to fold to form G-quadruplexes that are held together by guanine base quartets. G-quadruplexes are known to bind ferric heme [Fe(III)-protoporphyrin IX] and to strongly activate such bound hemes toward peroxidase (1-electron oxidation) as well as oxygenase/peroxygenase (2-electron oxidation) activities. However, much remains unknown about how such activation is effected. Herein, we investigated whether G-quadruplexes were strictly required for heme activation or whether related multi-stranded DNA/RNA structures such as isoguanine (iG) quadruplexes and pentaplexes could also bind and activate heme. We found that iG-pentaplexes did indeed bind and activate heme comparably to G-quadruplexes; however, iG-quadruplexes did neither. Earlier structural and computational studies had suggested that while the geometry of backbone-unconstrained iG-quintets templated by cations such as Na+ or NH4+ was planar, that of iG-quartets deviated from planarity. We hypothesize that the binding as well as activation of heme by DNA or RNA is strongly supported by the planarity of the nucleobase quartet or quintet that interacts directly with the heme.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Nucleic Acid Enzymes

Splicing factor SRSF1 negatively regulates alternative splicing of MDM2 under damage

剪接因子srsf1负调控剪接MDM2在损伤

Genotoxic stress induces alternative splicing of the oncogene MDM2 generating MDM2-ALT1, an isoform attributed with tumorigenic properties. However, the mechanisms underlying this event remain unclear. Here we explore MDM2 splicing regulation by utilizing a novel minigene that mimics endogenous MDM2 splicing in response to UV and cisplatinum-induced DNA damage. We report that exon 11 is necessary and sufficient for the damage-specific alternative splicing of the MDM2 minigene and that the splicing factor SRSF1 binds exon 11 at evolutionarily conserved sites. Interestingly, mutations disrupting this interaction proved sufficient to abolish the stress-induced alternative splicing of the MDM2 minigene. Furthermore, SRSF1 overexpression promoted exclusion of exon 11, while its siRNA-mediated knockdown prevented the stress-induced alternative splicing of endogenous MDM2. Additionally, we observed elevated SRSF1 levels under stress and in tumors correlating with the expression of MDM2-ALT1. Notably, we demonstrate that MDM2-ALT1 splicing can be blocked by targeting SRSF1 sites on exon 11 using antisense oligonucleotides. These results present conclusive evidence supporting a negative role for SRSF1 in MDM2 alternative splicing. Importantly, we define for the first time, a clear-cut mechanism for the regulation of damage-induced MDM2 splicing and present potential strategies for manipulating MDM2 expression via splicing modulation.

[详细]

  • Nucleic Acids Research
  • 10年前
  • RNA

The feedback control of UPF3 is crucial for RNA surveillance in plants

对UPF3反馈控制是植物中RNA监控至关重要

Nonsense-mediated-decay (NMD) is a eukaryotic RNA surveillance mechanism that controls the levels of both aberrant and normal transcripts. The regulation of this process is not well understood. The Arabidopsis NMD factor UPF3 is regulated by a negative feedback-loop that targets its own transcript for NMD. We investigated the functional significance of this control for the overall regulation of NMD in Arabidopsis. For this, we tested the ability of NMD-sensitive and -insensitive forms of UPF3, expressed under the control of UPF3 promoter, to complement NMD functionality in NMD-mutant plants and investigated their impact in wild-type (WT) plants. The sensitivity of UPF3 transcript to NMD was essential for efficient complementation of NMD in upf3 mutants. Upregulated UPF3 expression in WT plants resulted in over-degradation of certain transcripts and inhibited degradation of other transcripts. Our results demonstrate that, in contrast to mammalian cells, a delicate balance of UPF3 transcript levels by its feedback loop and by restriction of its transcription, are crucial for proper NMD regulation in Arabidopsis. Interestingly, the levels of many small-nucleolar-RNAs (snoRNAs) were decreased in upf1 and upf3 mutants and increased upon enhanced UPF3 expression. This suggests that proper snoRNA homeostasis in Arabidopsis depends on the integrity of the NMD pathway.

[详细]

  • Nucleic Acids Research
  • 10年前
  • RNA

RBM7 subunit of the NEXT complex binds U-rich sequences and targets 3'-end extended forms of snRNAs

在未来的复杂结合丰富的序列和3’端的目标扩展形式由rbm7亚基

The Nuclear Exosome Targeting (NEXT) complex is a key cofactor of the mammalian nuclear exosome in the removal of Promoter Upstream Transcripts (PROMPTs) and potentially aberrant forms of other noncoding RNAs, such as snRNAs. NEXT is composed of three subunits SKIV2L2, ZCCHC8 and RBM7. We have recently identified the NEXT complex in our screen for oligo(U) RNA-binding factors. Here, we demonstrate that NEXT displays preference for U-rich pyrimidine sequences and this RNA binding is mediated by the RNA recognition motif (RRM) of the RBM7 subunit. We solved the structure of RBM7 RRM and identified two phenylalanine residues that are critical for interaction with RNA. Furthermore, we showed that these residues are required for the NEXT interaction with snRNAs in vivo. Finally, we show that depletion of components of the NEXT complex alone or together with exosome nucleases resulted in the accumulation of mature as well as extended forms of snRNAs. Thus, our data suggest a new scenario in which the NEXT complex is involved in the surveillance of snRNAs and/or biogenesis of snRNPs.

[详细]

  • Nucleic Acids Research
  • 10年前
  • RNA

Nuclear pore components affect distinct stages of intron-containing gene expression

核孔复合体影响含有内含子的不同阶段的基因表达

Several nuclear pore-associated factors, including the SUMO-protease Ulp1, have been proposed to prevent the export of intron-containing messenger ribonucleoparticles (mRNPs) in yeast. However, the molecular mechanisms of this nuclear pore-dependent mRNA quality control, including the sumoylated targets of Ulp1, have remained unidentified. Here, we demonstrate that the apparent ‘pre-mRNA leakage’ phenotype arising upon ULP1 inactivation is shared by sumoylation mutants of the THO complex, an early mRNP biogenesis factor. Importantly, we establish that alteration of THO complex activity differentially impairs the expression of intronless and intron-containing reporter genes, rather than triggering bona fide ‘pre-mRNA leakage’. Indeed, we show that the presence of introns within THO target genes attenuates the effect of THO inactivation on their transcription. Epistasis analyses further clarify that different nuclear pore components influence intron-containing gene expression at distinct stages. Ulp1, whose maintenance at nuclear pores depends on the Nup84 complex, impacts on THO-dependent gene expression, whereas the nuclear basket-associated Mlp1/Pml39 proteins prevent pre-mRNA export at a later stage, contributing to mRNA quality control. Our study thus highlights the multiplicity of mechanisms by which nuclear pores contribute to gene expression, and further provides the first evidence that intronic sequences can alleviate early mRNP biogenesis defects.

[详细]

  • Nucleic Acids Research
  • 10年前
  • RNA

A common tRNA modification at an unusual location: the discovery of wyosine biosynthesis in mitochondria

在一个不寻常的位置一般在线粒体tRNA修饰:丫苷生物合成的发现

Establishment of the early genetic code likely required strategies to ensure translational accuracy and inevitably involved tRNA post-transcriptional modifications. One such modification, wybutosine/wyosine is crucial for translational fidelity in Archaea and Eukarya; yet it does not occur in Bacteria and has never been described in mitochondria. Here, we present genetic, molecular and mass spectromery data demonstrating the first example of wyosine in mitochondria, a situation thus far unique to kinetoplastids. We also show that these modifications are important for mitochondrial function, underscoring their biological significance. This work focuses on TyW1, the enzyme required for the most critical step of wyosine biosynthesis. Based on molecular phylogeny, we suggest that the kinetoplastids pathways evolved via gene duplication and acquisition of an FMN-binding domain now prevalent in TyW1 of most eukaryotes. These findings are discussed in the context of the extensive U-insertion RNA editing in trypanosome mitochondria, which may have provided selective pressure for maintenance of mitochondrial wyosine in this lineage.

[详细]

  • Nucleic Acids Research
  • 10年前
  • RNA

Distribution of DNA-condensing protein complexes in the adenovirus core

DNA浓缩蛋白质复合物在腺病毒核心分布

Genome packing in adenovirus has long evaded precise description, since the viral dsDNA molecule condensed by proteins (core) lacks icosahedral order characteristic of the virus protein coating (capsid). We show that useful insights regarding the organization of the core can be inferred from the analysis of spatial distributions of the DNA and condensing protein units (adenosomes). These were obtained from the inspection of cryo-electron tomography reconstructions of individual human adenovirus particles. Our analysis shows that the core lacks symmetry and strict order, yet the adenosome distribution is not entirely random. The features of the distribution can be explained by modeling the condensing proteins and the part of the genome in each adenosome as very soft spheres, interacting repulsively with each other and with the capsid, producing a minimum outward pressure of ~0.06 atm. Although the condensing proteins are connected by DNA in disrupted virion cores, in our models a backbone of DNA linking the adenosomes is not required to explain the experimental results in the confined state. In conclusion, the interior of an adenovirus infectious particle is a strongly confined and dense phase of soft particles (adenosomes) without a strictly defined DNA backbone.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

The hexameric structure of the human mitochondrial replicative helicase Twinkle

人类的线粒体复制解旋酶闪烁的六聚体结构

The mitochondrial replicative helicase Twinkle is involved in strand separation at the replication fork of mitochondrial DNA (mtDNA). Twinkle malfunction is associated with rare diseases that include late onset mitochondrial myopathies, neuromuscular disorders and fatal infantile mtDNA depletion syndrome. We examined its 3D structure by electron microscopy (EM) and small angle X-ray scattering (SAXS) and built the corresponding atomic models, which gave insight into the first molecular architecture of a full-length SF4 helicase that includes an N-terminal zinc-binding domain (ZBD), an intermediate RNA polymerase domain (RPD) and a RecA-like hexamerization C-terminal domain (CTD). The EM model of Twinkle reveals a hexameric two-layered ring comprising the ZBDs and RPDs in one layer and the CTDs in another. In the hexamer, contacts in trans with adjacent subunits occur between ZBDs and RPDs, and between RPDs and CTDs. The ZBDs show important structural heterogeneity. In solution, the scattering data are compatible with a mixture of extended hexa- and heptameric models in variable conformations. Overall, our structural data show a complex network of dynamic interactions that reconciles with the structural flexibility required for helicase activity.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

Structures of Escherichia coli DNA adenine methyltransferase (Dam) in complex with a non-GATC sequence: potential implications for methylation-independent transcriptional repression

大肠杆菌DNA腺嘌呤甲基转移酶结构(坝)与非GATC序列复杂:独立的转录抑制甲基化的潜在影响

DNA adenine methyltransferase (Dam) is widespread and conserved among the -proteobacteria. Methylation of the Ade in GATC sequences regulates diverse bacterial cell functions, including gene expression, mismatch repair and chromosome replication. Dam also controls virulence in many pathogenic Gram-negative bacteria. An unexplained and perplexing observation about Escherichia coli Dam (EcoDam) is that there is no obvious relationship between the genes that are transcriptionally responsive to Dam and the promoter-proximal presence of GATC sequences. Here, we demonstrate that EcoDam interacts with a 5-base pair non-cognate sequence distinct from GATC. The crystal structure of a non-cognate complex allowed us to identify a DNA binding element, GTYTA/TARAC (where Y = C/T and R = A/G). This element immediately flanks GATC sites in some Dam-regulated promoters, including the Pap operon which specifies pyelonephritis-associated pili. In addition, Dam interacts with near-cognate GATC sequences (i.e. 3/4-site ATC and GAT). Taken together, these results imply that Dam, in addition to being responsible for GATC methylation, could also function as a methylation-independent transcriptional repressor.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

The structural impact of DNA mismatches

DNA错配的结构性影响

The structure and dynamics of all the transversion and transition mismatches in three different DNA environments have been characterized by molecular dynamics simulations and NMR spectroscopy. We found that the presence of mismatches produced significant local structural alterations, especially in the case of purine transversions. Mismatched pairs often show promiscuous hydrogen bonding patterns, which interchange among each other in the nanosecond time scale. This therefore defines flexible base pairs, where breathing is frequent, and where distortions in helical parameters are strong, resulting in significant alterations in groove dimension. Even if the DNA structure is plastic enough to absorb the structural impact of the mismatch, local structural changes can be propagated far from the mismatch site, following the expected through-backbone and a previously unknown through-space mechanism. The structural changes related to the presence of mismatches help to understand the different susceptibility of mismatches to the action of repairing proteins.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

Heterogeneous dynamics in DNA site discrimination by the structurally homologous DNA-binding domains of ETS-family transcription factors

由ETS家族转录因子的结构同源的DNA结合结构域的DNA位点鉴别异构动力学

The ETS family of transcription factors exemplifies current uncertainty in how eukaryotic genetic regulators with overlapping DNA sequence preferences achieve target site specificity. PU.1 and Ets-1 represent archetypes for studying site discrimination by ETS proteins because their DNA-binding domains are the most divergent in sequence, yet they share remarkably superimposable DNA-bound structures. To gain insight into the contrasting thermodynamics and kinetics of DNA recognition by these two proteins, we investigated the structure and dynamics of site discrimination by their DNA-binding domains. Electrophoretic mobilities of complexes formed by the two homologs with circularly permuted binding sites showed significant dynamic differences only for DNA complexes of PU.1. Free solution measurements by dynamic light scattering showed PU.1 to be more dynamic than Ets-1; moreover, dynamic changes are strongly coupled to site discrimination by PU.1, but not Ets-1. Interrogation of the protein/DNA interface by DNA footprinting showed similar accessibility to dimethyl sulfate for PU.1/DNA and Ets-1/DNA complexes, indicating that the dynamics of PU.1/DNA complexes reside primarily outside that interface. An information-based analysis of the two homologs’ binding motifs suggests a role for dynamic coupling in PU.1's ability to enforce a more stringent sequence preference than Ets-1 and its proximal sequence homologs.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

Effects of Hfq on the conformation and compaction of DNA

HFQ对压实的DNA构象的影响

Hfq is a bacterial pleiotropic regulator that mediates several aspects of nucleic acids metabolism. The protein notably influences translation and turnover of cellular RNAs. Although most previous contributions concentrated on Hfq's interaction with RNA, its association to DNA has also been observed in vitro and in vivo. Here, we focus on DNA-compacting properties of Hfq. Various experimental technologies, including fluorescence microscopy imaging of single DNA molecules confined inside nanofluidic channels, atomic force microscopy and small angle neutron scattering have been used to follow the assembly of Hfq on DNA. Our results show that Hfq forms a nucleoprotein complex, changes the mechanical properties of the double helix and compacts DNA into a condensed form. We propose a compaction mechanism based on protein-mediated bridging of DNA segments. The propensity for bridging is presumably related to multi-arm functionality of the Hfq hexamer, resulting from binding of the C-terminal domains to the duplex. Results are discussed in regard to previous results obtained for H-NS, with important implications for protein binding related gene regulation.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Structural Biology

Selective inhibition of miR-21 by phage display screened peptide

利用噬菌体展示技术筛选选择性抑制miR-21的肽

miRNAs are nodal regulators of gene expression and deregulation of miRNAs is causally associated with different diseases, including cancer. Modulation of miRNA expression is thus of therapeutic importance. Small molecules are currently being explored for their potential to downregulate miRNAs. Peptides have shown to have better potency and selectivity toward their targets but their potential in targeting and modulating miRNAs remain unexplored. Herein, using phage display we found a very selective peptide against pre-miR-21. Interestingly, the peptide has the potential to downregulate miR-21, by binding to pre-miR-21 and hindering Dicer processing. It is selective towards miR-21 inside the cell. By antagonising miR-21 function, the peptide is able to increase the expression of its target proteins and thereby increase apoptosis and suppress cell proliferation, invasion and migration. This peptide can further be explored for its anti-cancer activity in vivo and may be even extended to clinical studies.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Synthetic Biology and Bioengineering

A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells

一个通用的战略调整在原核和真核细胞mRNA的翻译

We describe a simple strategy to control mRNA translation in both prokaryotic and eukaryotic cells which relies on a unique protein–RNA interaction. Specifically, we used the Pumilio/FBF (PUF) protein to repress translation by binding in between the ribosome binding site (RBS) and the start codon (in Escherichia coli), or by binding to the 5' untranslated region of target mRNAs (in mammalian cells). The design principle is straightforward, the extent of translational repression can be tuned and the regulator is genetically encoded, enabling the construction of artificial signal cascades. We demonstrate that this approach can also be used to regulate polycistronic mRNAs; such regulation has rarely been achieved in previous reports. Since the regulator used in this study is a modular RNA-binding protein, which can be engineered to target different 8-nucleotide RNA sequences, our strategy could be used in the future to target endogenous mRNAs for regulating metabolic flows and signaling pathways in both prokaryotic and eukaryotic cells.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Synthetic Biology and Bioengineering

Modeling the relationship of epigenetic modifications to transcription factor binding

对转录因子结合的表观遗传修饰的关系建模

Transcription factors (TFs) and epigenetic modifications play crucial roles in the regulation of gene expression, and correlations between the two types of factors have been discovered. However, methods for quantitatively studying the correlations remain limited. Here, we present a computational approach to systematically investigating how epigenetic changes in chromatin architectures or DNA sequences relate to TF binding. We implemented statistical analyses to illustrate that epigenetic modifications are predictive of TF binding affinities, without the need of sequence information. Intriguingly, by considering genome locations relative to transcription start sites (TSSs) or enhancer midpoints, our analyses show that different locations display various relationship patterns. For instance, H3K4me3, H3k9ac and H3k27ac contribute more in the regions near TSSs, whereas H3K4me1 and H3k79me2 dominate in the regions far from TSSs. DNA methylation plays relatively important roles when close to TSSs than in other regions. In addition, the results show that epigenetic modification models for the predictions of TF binding affinities are cell line-specific. Taken together, our study elucidates highly coordinated, but location- and cell type-specific relationships between epigenetic modifications and binding affinities of TFs.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Computational Biology

SubmiRine: assessing variants in microRNA targets using clinical genomic data sets

submirine:评估microRNA靶采用临床基因组数据集的变种

MicroRNAs (miRNAs) regulate gene expression by binding to partially complementary sequences on target mRNA transcripts, thereby causing their degradation, deadenylation, or inhibiting their translation. Genomic variants can alter miRNA regulation by modifying miRNA target sites, and multiple human disease phenotypes have been linked to such miRNA target site variants (miR-TSVs). However, systematic genome-wide identification of functional miR-TSVs is difficult due to high false positive rates; functional miRNA recognition sequences can be as short as six nucleotides, with the human genome encoding thousands of miRNAs. Furthermore, while large-scale clinical genomic data sets are becoming increasingly commonplace, existing miR-TSV prediction methods are not designed to analyze these data. Here, we present an open-source tool called SubmiRine that is designed to perform efficient miR-TSV prediction systematically on variants identified in novel clinical genomic data sets. Most importantly, SubmiRine allows for the prioritization of predicted miR-TSVs according to their relative probability of being functional. We present the results of SubmiRine using integrated clinical genomic data from a large-scale cohort study on chronic obstructive pulmonary disease (COPD), making a number of high-scoring, novel miR-TSV predictions. We also demonstrate SubmiRine's ability to predict and prioritize known miR-TSVs that have undergone experimental validation in previous studies.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Computational Biology

Reconstructing genome-scale metabolic models with merlin

梅林重建的基因组规模的代谢模型

The Metabolic Models Reconstruction Using Genome-Scale Information (merlin) tool is a user-friendly Java application that aids the reconstruction of genome-scale metabolic models for any organism that has its genome sequenced. It performs the major steps of the reconstruction process, including the functional genomic annotation of the whole genome and subsequent construction of the portfolio of reactions. Moreover, merlin includes tools for the identification and annotation of genes encoding transport proteins, generating the transport reactions for those carriers. It also performs the compartmentalisation of the model, predicting the organelle localisation of the proteins encoded in the genome and thus the localisation of the metabolites involved in the reactions promoted by such enzymes. The gene-proteins-reactions (GPR) associations are automatically generated and included in the model. Finally, merlin expedites the transition from genomic data to draft metabolic models reconstructions exported in the SBML standard format, allowing the user to have a preliminary view of the biochemical network, which can be manually curated within the environment provided by merlin.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Computational Biology

Epigenetic regulation of intragenic transposable elements impacts gene transcription in Arabidopsis thaliana

基因转座因子影响基因转录的表观遗传调控拟南芥

Genomes of higher eukaryotes, including plants, contain numerous transposable elements (TEs), that are often silenced by epigenetic mechanisms, such as histone modifications and DNA methylation. Although TE silencing adversely affects expression of nearby genes, recent studies reveal the presence of intragenic TEs marked by repressive heterochromatic epigenetic marks within transcribed genes. However, even for the well-studied plant model Arabidopsis thaliana, the abundance of intragenic TEs, how they are epigenetically regulated, and their potential impacts on host gene expression, remain unexplored. In this study, we comprehensively analyzed genome-wide distribution and epigenetic regulation of intragenic TEs in A. thaliana. Our analysis revealed that about 3% of TEs are located within gene bodies, dominantly at intronic regions. Most of them are shorter and less methylated than intergenic TEs, but they are still targeted by RNA-directed DNA methylation-dependent and independent pathways. Surprisingly, the heterochromatic epigenetic marks at TEs are maintained within actively transcribed genes. Moreover, the heterochromatic state of intronic TEs is critical for proper transcription of associated genes. Our study provides the first insight into how intragenic TEs affect the transcriptional landscape of the A. thaliana genome, and suggests the importance of epigenetic mechanisms for regulation of TEs within transcriptional gene units.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Gene regulation, Chromatin and Epigenetics

The interaction of mycobacterial protein Rv2966c with host chromatin is mediated through non-CpG methylation and histone H3/H4 binding

与宿主染色体rv2966c分枝杆菌蛋白质的相互作用是介导通过非CpG甲基化和组蛋白H3、H4结合

To effectively modulate the gene expression within an infected mammalian cell, the pathogen Mycobacterium tuberculosis would need to bring about epigenetic modifications at appropriate genomic loci. Working on this hypothesis, we show in this study that the mycobacterial protein Rv2966c is a 5-methylcytosine-specific DNA methyltransferase that is secreted out from the mycobacterium and gets localized to the nucleus in addition to the cytoplasm inside the host cell. Importantly, Rv2966c binds to specific DNA sequences, methylates cytosines predominantly in a non-CpG context and its methylation activity is positively influenced by phosphorylation. Interestingly, like the mammalian DNA methyltransferase, DNMT3L, Rv2966c can also interact with histone proteins. Ours is the first study that identifies a protein from a pathogenic bacteria with potential to influence host DNA methylation in a non-canonical manner providing the pathogen with a novel mechanism to alter the host epigenetic machinery. This contention is supported by repression of host genes upon M. tuberculosis infection correlated with Rv2966c binding and non-CpG methylation.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Gene regulation, Chromatin and Epigenetics

RNA polymerase II pausing can be retained or acquired during activation of genes involved in the epithelial to mesenchymal transition

RNA聚合酶II暂停可以保留或参与上皮间质转化的基因活化过程中获得

Promoter-proximal RNA polymerase II (Pol II) pausing is implicated in the regulation of gene transcription. However, the mechanisms of pausing including its dynamics during transcriptional responses remain to be fully understood. We performed global analysis of short capped RNAs and Pol II Chromatin Immunoprecipitation sequencing in MCF-7 breast cancer cells to map Pol II pausing across the genome, and used permanganate footprinting to specifically follow pausing during transcriptional activation of several genes involved in the epithelial to mesenchymal transition (EMT). We find that the gene for EMT master regulator Snail (SNAI1), but not Slug (SNAI2), shows evidence of Pol II pausing before activation. Transcriptional activation of the paused SNAI1 gene is accompanied by a further increase in Pol II pausing signal, whereas activation of non-paused SNAI2 gene results in the acquisition of a typical pausing signature. The increase in pausing signal reflects increased transcription initiation without changes in Pol II pausing. Activation of the heat shock HSP70 gene involves pausing release that speeds up Pol II turnover, but does not change pausing location. We suggest that Pol II pausing is retained during transcriptional activation and can further undergo regulated release in a signal-specific manner.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Gene regulation, Chromatin and Epigenetics