NetworkAnalyst for statistical, visual and network-based meta-analysis of gene expression data

networkanalyst统计,视觉和基于网络的基因表达数据的荟萃分析

This protocol describes NetworkAnalyst, a web-based tool for performing network analysis and visualization from gene lists and meta-analysis of gene expression datasets, and for displaying results as protein-protein interaction networks, heatmaps and chord diagrams.

[详细]

  • Nature Protocols 10, 823 (2015)
  • 10年前
  • Protocol

The Phyre2 web portal for protein modeling, prediction and analysis

蛋白质建模Phyre2门户网站,预测和分析

Phyre2 is a web-based tool for predicting and analyzing protein structure and function. Phyre2 uses advanced remote homology detection methods to build 3D models, predict ligand binding sites, and analyze amino acid variants in a protein sequence.

[详细]

  • Nature Protocols 10, 845 (2015)
  • 10年前
  • Protocol

Accurate prediction of RNA nucleotide interactions with backbone k-tree model

准确预测RNA的核苷酸与骨干k-tree交互模型

Motivation: Given the importance of non-coding RNAs to cellular regulatory functions, it would be highly desirable to have accurate computational prediction of RNA 3D structure, a task which remains challenging. Even for a short RNA sequence, the space of tertiary conformations is immense; existing methods to identify native-like conformations mostly resort to random sampling of conformations to achieve computational feasibility. However, native conformations may not be examined and prediction accuracy may be compromised due to sampling. State-of-the-art methods have yet to deliver satisfactory predictions for RNAs of length beyond 50 nucleotides.

Results: This paper presents a method to tackle a key step in the RNA 3D structure prediction problem, the prediction of the nucleotide interactions that constitute the desired 3D structure. The research is based on a novel graph model, called a backbone k-tree, to tightly constrain the nucleotide interaction relationships considered for RNA 3D structures. It is shown that the new model makes it possible to efficiently predict the optimal set of nucleotide interactions (including the non-canonical interactions in all recently revealed families) from the query sequence along with known or predicted canonical basepairs. The preliminary results indicate that in most cases the new method can predict with a high accuracy the nucleotide interactions that constitute the 3D structure of the query sequence. It thus provides a useful tool for the accurate prediction of RNA 3D structure.

Availability and Implementation: The source package for BkTree is available at http://rna-informatics. uga.edu/index.php?f=software&p=BkTree.

Contact: lding@uga.edu or cai@cs.uga.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • ORIGINAL PAPER

PDIviz: analysis and visualization of protein-DNA binding interfaces

PDIviz:分析和可视化protein-DNA绑定接口

Summary: Specific recognition of DNA by proteins is a crucial step of many biological processes. PDIviz is a plugin for the PyMOL molecular visualization system that analyzes protein–DNA binding interfaces by comparing the solvent accessible surface area of the complex against the free protein and free DNA. The plugin provides three distinct three-dimensional visualization modes to highlight interactions with DNA bases and backbone, major and minor groove, and with atoms of different pharmacophoric type (hydrogen bond donors/acceptors, hydrophobic and thymine methyl). Each mode comes in three styles to focus the visual analysis on the protein or DNA side of the interface, or on the nucleotide sequence. PDIviz allows for the generation of publication quality images, all calculated data can be written to disk, and a command line interface is provided for automating tasks. The plugin may be helpful for the detailed identification of regions involved in DNA base and shape readout, and can be particularly useful in rapidly pinpointing the overall mode of interaction.

Availability and implementation: Freely available at http://melolab.org/pdiviz/ as a PyMOL plugin. Tested with incentive, educational, and open source versions of PyMOL on Windows, Mac and Linux systems.

Contact: aschueller@bio.puc.cl

Supplementary Information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • APPLICATIONS NOTE

GeneTIER: prioritization of candidate disease genes using tissue-specific gene expression profiles

GeneTIER:优先级使用特定组织的基因表达的候选致病基因资料

Motivation: In attempts to determine the genetic causes of human disease, researchers are often faced with a large number of candidate genes. Linkage studies can point to a genomic region containing hundreds of genes, while the high-throughput sequencing approach will often identify a great number of non-synonymous genetic variants. Since systematic experimental verification of each such candidate gene is not feasible, a method is needed to decide which genes are worth investigating further. Computational gene prioritization presents itself as a solution to this problem, systematically analyzing and sorting each gene from the most to least likely to be the disease-causing gene, in a fraction of the time it would take a researcher to perform such queries manually.

Results: Here, we present Gene TIssue Expression Ranker (GeneTIER), a new web-based application for candidate gene prioritization. GeneTIER replaces knowledge-based inference traditionally used in candidate disease gene prioritization applications with experimental data from tissue-specific gene expression datasets and thus largely overcomes the bias toward the better characterized genes/diseases that commonly afflict other methods. We show that our approach is capable of accurate candidate gene prioritization and illustrate its strengths and weaknesses using case study examples.

Availability and Implementation: Freely available on the web at http://dna.leeds.ac.uk/GeneTIER/.

Contact: umaan@leeds.ac.uk

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • ORIGINAL PAPER

HAT3-mediated acetylation of PCNA precedes PCNA monoubiquitination following exposure to UV radiation in Leishmania donovani

增殖细胞核抗原PCNA HAT3介导的乙酰化先于monoubiquitination杜氏利什曼原虫暴露于紫外线辐射

Histone modifications impact various processes. In examining histone acetyltranferase HAT3 of Leishmania donovani, we find elimination of HAT3 causes decreased cell viability due to defects in histone deposition, and aberrant cell cycle progression pattern. HAT3 associates with proliferating cell nuclear antigen (PCNA), helping load PCNA onto chromatin in proliferating cells. HAT3-nulls show heightened sensitivity to UV radiation. Following UV exposure, PCNA cycles off/on chromatin only in cells expressing HAT3. Inhibition of the ubiquitin-proteasome pathway prior to UV exposure allows accumulation of chromatin-bound PCNA, and reveals that HAT3-nulls are deficient in PCNA monoubiquitination as well as polyubiquitination. While poor monoubiquitination of PCNA may adversely affect translesion DNA synthesis-based repair processes, polyubiquitination deficiencies may result in continued retention of chromatin-bound PCNA, leading to genomic instability. On suppressing the proteasome pathway we also find that HAT3 mediates PCNA acetylation in response to UV. HAT3-mediated PCNA acetylation may serve as a flag for PCNA ubiquitination, thus aiding DNA repair. While PCNA acetylation has previously been linked to its degradation following UV exposure, this is the first report linking a HAT-mediated PCNA acetylation to PCNA monoubiquitination. These findings add a new dimension to our knowledge of the mechanisms regulating PCNA ubiquitination post-UV exposure in eukaryotes.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Gene regulation, Chromatin and Epigenetics

PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences

paccmit / paccmit-cds:识别靶基因3’非翻译区和编码序列

The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3' untranslated regions (3' UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3' UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA–mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA–mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

antiSMASH 3.0--a comprehensive resource for the genome mining of biosynthetic gene clusters

对于生物合成基因簇的基因挖掘3.0--a antismash综合资源

Microbial secondary metabolism constitutes a rich source of antibiotics, chemotherapeutics, insecticides and other high-value chemicals. Genome mining of gene clusters that encode the biosynthetic pathways for these metabolites has become a key methodology for novel compound discovery. In 2011, we introduced antiSMASH, a web server and stand-alone tool for the automatic genomic identification and analysis of biosynthetic gene clusters, available at http://antismash.secondarymetabolites.org. Here, we present version 3.0 of antiSMASH, which has undergone major improvements. A full integration of the recently published ClusterFinder algorithm now allows using this probabilistic algorithm to detect putative gene clusters of unknown types. Also, a new dereplication variant of the ClusterBlast module now identifies similarities of identified clusters to any of 1172 clusters with known end products. At the enzyme level, active sites of key biosynthetic enzymes are now pinpointed through a curated pattern-matching procedure and Enzyme Commission numbers are assigned to functionally classify all enzyme-coding genes. Additionally, chemical structure prediction has been improved by incorporating polyketide reduction states. Finally, in order for users to be able to organize and analyze multiple antiSMASH outputs in a private setting, a new XML output module allows offline editing of antiSMASH annotations within the Geneious software.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server issue

CDK-mediated RNF4 phosphorylation regulates homologous recombination in S-phase

蛋白激酶介导的RNF4磷酸化调节在S期的同源重组

There are the two major pathways responsible for the repair of DNA double-strand breaks (DSBs): non-homologous end-joining (NHEJ) and homologous recombination (HR). NHEJ operates throughout the cell-cycle, while HR is primarily active in the S/G2 phases suggesting that there are cell cycle-specific mechanisms that regulate the balance between NHEJ and HR. Here we reported that CDK2 could phosphorylate RNF4 on T26 and T112 and enhance RNF4 E3 ligase activity, which is important for MDC1 degradation and proper HR repair during S phase. Mutation of the RNF4 phosphorylation sites results in MDC1 stabilization, which in turn compromised HR during S-phase. These results suggest that in addition to drive cell cycle progression, CDK also targets RNF4, which is involved in the regulatory network of DSBs repair.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Genome integrity, repair and replication

Cyclic di-GMP acts as a cell cycle oscillator to drive chromosome replication

环状二GMP作为一个细胞周期振荡器驱动染色体复制

Fundamental to all living organisms is the capacity to coordinate cell division and cell differentiation to generate appropriate numbers of specialized cells. Whereas eukaryotes use cyclins and cyclin-dependent kinases to balance division with cell fate decisions, equivalent regulatory systems have not been described in bacteria. Moreover, the mechanisms used by bacteria to tune division in line with developmental programs are poorly understood. Here we show that Caulobacter crescentus, a bacterium with an asymmetric division cycle, uses oscillating levels of the second messenger cyclic diguanylate (c-di-GMP) to drive its cell cycle. We demonstrate that c-di-GMP directly binds to the essential cell cycle kinase CckA to inhibit kinase activity and stimulate phosphatase activity. An upshift of c-di-GMP during the G1–S transition switches CckA from the kinase to the phosphatase mode, thereby allowing replication initiation and cell cycle progression. Finally, we show that during division, c-di-GMP imposes spatial control on CckA to install the replication asymmetry of future daughter cells. These studies reveal c-di-GMP to be a cyclin-like molecule in bacteria that coordinates chromosome replication with cell morphogenesis in Caulobacter. The observation that c-di-GMP-mediated control is conserved in the plant pathogen Agrobacterium tumefaciens suggests a general mechanism through which this global regulator of bacterial virulence and persistence coordinates behaviour and cell proliferation.

[详细]

  • Nature
  • 10年前
  • Letter

GS-align for glycan structure alignment and similarity measurement

GS-align多糖结构一致性和相似性度量

Motivation: Glycans play critical roles in many biological processes, and their structural diversity is key for specific protein-glycan recognition. Comparative structural studies of biological molecules provide useful insight into their biological relationships. However, most computational tools are designed for protein structure, and despite their importance, there is no currently available tool for comparing glycan structures in a sequence order- and size-independent manner.

Results: A novel method, GS-align, is developed for glycan structure alignment and similarity measurement. GS-align generates possible alignments between two glycan structures through iterative maximum clique search and fragment superposition. The optimal alignment is then determined by the maximum structural similarity score, GS-score, which is size-independent. Benchmark tests against the Protein Data Bank (PDB) N-linked glycan library and PDB homologous/non-homologous N-glycoprotein sets indicate that GS-align is a robust computational tool to align glycan structures and quantify their structural similarity. GS-align is also applied to template-based glycan structure prediction and monosaccharide substitution matrix generation to illustrate its utility.

Availability and implementation: http://www.glycanstructure.org/gsalign.

Contact: wonpil@ku.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • ORIGINAL PAPER

(PS)2: protein structure prediction server version 3.0

(PS)2:蛋白质结构预测服务器版本3

Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)2 web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)2 server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)2 is freely available at http://ps2v3.life.nctu.edu.tw/.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

CCTOP: a Consensus Constrained TOPology prediction web server

cctop:共识约束拓扑预测的Web服务器

The Consensus Constrained TOPology prediction (CCTOP; http://cctop.enzim.ttk.mta.hu) server is a web-based application providing transmembrane topology prediction. In addition to utilizing 10 different state-of-the-art topology prediction methods, the CCTOP server incorporates topology information from existing experimental and computational sources available in the PDBTM, TOPDB and TOPDOM databases using the probabilistic framework of hidden Markov model. The server provides the option to precede the topology prediction with signal peptide prediction and transmembrane-globular protein discrimination. The initial result can be recalculated by (de)selecting any of the prediction methods or mapped experiments or by adding user specified constraints. CCTOP showed superior performance to existing approaches. The reliability of each prediction is also calculated, which correlates with the accuracy of the per protein topology prediction. The prediction results and the collected experimental information are visualized on the CCTOP home page and can be downloaded in XML format. Programmable access of the CCTOP server is also available, and an example of client-side script is provided.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

HMMER web server: 2015 update

HMMER Web服务器:2015更新

The HMMER website, available at http://www.ebi.ac.uk/Tools/hmmer/, provides access to the protein homology search algorithms found in the HMMER software suite. Since the first release of the website in 2011, the search repertoire has been expanded to include the iterative search algorithm, jackhmmer. The continued growth of the target sequence databases means that traditional tabular representations of significant sequence hits can be overwhelming to the user. Consequently, additional ways of presenting homology search results have been developed, allowing them to be summarised according to taxonomic distribution or domain architecture. The taxonomy and domain architecture representations can be used in combination to filter the results according to the needs of a user. Searches can also be restricted prior to submission using a new taxonomic filter, which not only ensures that the results are specific to the requested taxonomic group, but also improves search performance. The repertoire of profile hidden Markov model libraries, which are used for annotation of query sequences with protein families and domains, has been expanded to include the libraries from CATH-Gene3D, PIRSF, Superfamily and TIGRFAMs. Finally, we discuss the relocation of the HMMER webserver to the European Bioinformatics Institute and the potential impact that this will have.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

Cosmetics-triggered percutaneous remote control of transgene expression in mice

化妆品引发的转基因表达的小鼠经皮的远程控制

Synthetic biology has significantly advanced the rational design of trigger-inducible gene switches that program cellular behavior in a reliable and predictable manner. Capitalizing on genetic componentry, including the repressor PmeR and its cognate operator OPmeR, that has evolved in Pseudomonas syringae pathovar tomato DC3000 to sense and resist plant-defence metabolites of the paraben class, we have designed a set of inducible and repressible mammalian transcription-control devices that could dose-dependently fine-tune transgene expression in mammalian cells and mice in response to paraben derivatives. With an over 60-years track record as licensed preservatives in the cosmetics industry, paraben derivatives have become a commonplace ingredient of most skin-care products including shower gels, cleansing toners and hand creams. As parabens can rapidly reach the bloodstream of mice following topical application, we used this feature to percutaneously program transgene expression of subcutaneous designer cell implants using off-the-shelf commercial paraben-containing skin-care cosmetics. The combination of non-invasive, transdermal and orthogonal trigger-inducible remote control of transgene expression may provide novel opportunities for dynamic interventions in future gene and cell-based therapies.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Methods Online

CABS-dock web server for the flexible docking of peptides to proteins without prior knowledge of the binding site

出租车停靠的Web服务器的灵活对接肽蛋白的结合位点没有先验知识

Protein–peptide interactions play a key role in cell functions. Their structural characterization, though challenging, is important for the discovery of new drugs. The CABS-dock web server provides an interface for modeling protein–peptide interactions using a highly efficient protocol for the flexible docking of peptides to proteins. While other docking algorithms require pre-defined localization of the binding site, CABS-dock does not require such knowledge. Given a protein receptor structure and a peptide sequence (and starting from random conformations and positions of the peptide), CABS-dock performs simulation search for the binding site allowing for full flexibility of the peptide and small fluctuations of the receptor backbone. This protocol was extensively tested over the largest dataset of non-redundant protein–peptide interactions available to date (including bound and unbound docking cases). For over 80% of bound and unbound dataset cases, we obtained models with high or medium accuracy (sufficient for practical applications). Additionally, as optional features, CABS-dock can exclude user-selected binding modes from docking search or to increase the level of flexibility for chosen receptor fragments. CABS-dock is freely available as a web server at http://biocomp.chem.uw.edu.pl/CABSdock.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

FlyNet: a versatile network prioritization server for the Drosophila community

防虫网:为果蝇社区通用的网络优先级的服务器

Drosophila melanogaster (fruit fly) has been a popular model organism in animal genetics due to the high accessibility of reverse-genetics tools. In addition, the close relationship between the Drosophila and human genomes rationalizes the use of Drosophila as an invertebrate model for human neurobiology and disease research. A platform technology for predicting candidate genes or functions would further enhance the usefulness of this long-established model organism for gene-to-phenotype mapping. Recently, the power of network prioritization for gene-to-phenotype mapping has been demonstrated in many organisms. Here we present a network prioritization server dedicated to Drosophila that covers ~95% of the coding genome. This server, dubbed FlyNet, has several distinctive features, including (i) prioritization for both genes and functions; (ii) two complementary network algorithms: direct neighborhood and network diffusion; (iii) spatiotemporal-specific networks as an additional prioritization strategy for traits associated with a specific developmental stage or tissue and (iv) prioritization for human disease genes. FlyNet is expected to serve as a versatile hypothesis-generation platform for genes and functions in the study of basic animal genetics, developmental biology and human disease. FlyNet is available for free at http://www.inetbio.org/flynet.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server Issue

TFmiR: a web server for constructing and analyzing disease-specific transcription factor and miRNA co-regulatory networks

tfmir:Web服务器的构建和分析疾病特异的转录因子与miRNA共调控网络

TFmiR is a freely available web server for deep and integrative analysis of combinatorial regulatory interactions between transcription factors, microRNAs and target genes that are involved in disease pathogenesis. Since the inner workings of cells rely on the correct functioning of an enormously complex system of activating and repressing interactions that can be perturbed in many ways, TFmiR helps to better elucidate cellular mechanisms at the molecular level from a network perspective. The provided topological and functional analyses promote TFmiR as a reliable systems biology tool for researchers across the life science communities. TFmiR web server is accessible through the following URL: http://service.bioinformatik.uni-saarland.de/tfmir.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server issue

edgeRun: an R package for sensitive, functionally relevant differential expression discovery using an unconditional exact test

edgeRun:R包为敏感,发现使用一个功能相关的差异表达无条件的准确测试

Summary: Next-generation sequencing platforms for measuring digital expression such as RNA-Seq are displacing traditional microarray-based methods in biological experiments. The detection of differentially expressed genes between groups of biological conditions has led to the development of numerous bioinformatics tools, but so far, few exploit the expanded dynamic range afforded by the new technologies. We present edgeRun, an R package that implements an unconditional exact test that is a more powerful version of the exact test in edgeR. This increase in power is especially pronounced for experiments with as few as two replicates per condition, for genes with low total expression and with large biological coefficient of variation. In comparison with a panel of other tools, edgeRun consistently captures functionally similar differentially expressed genes.

Availability and implementation: The package is freely available under the MIT license from CRAN (http://cran.r-project.org/web/packages/edgeRun).

Contact: edimont@mail.harvard.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • APPLICATIONS NOTE

GREGOR: evaluating global enrichment of trait-associated variants in epigenomic features using a systematic, data-driven approach

格雷戈尔:评估全球浓缩epigenomic trait-associated变异的特性使用系统,数据驱动的方法

Motivation: The majority of variation identified by genome wide association studies falls in non-coding genomic regions and is hypothesized to impact regulatory elements that modulate gene expression. Here we present a statistically rigorous software tool GREGOR (Genomic Regulatory Elements and Gwas Overlap algoRithm) for evaluating enrichment of any set of genetic variants with any set of regulatory features. Using variants from five phenotypes, we describe a data-driven approach to determine the tissue and cell types most relevant to a trait of interest and to identify the subset of regulatory features likely impacted by these variants. Last, we experimentally evaluate six predicted functional variants at six lipid-associated loci and demonstrate significant evidence for allele-specific impact on expression levels. GREGOR systematically evaluates enrichment of genetic variation with the vast collection of regulatory data available to explore novel biological mechanisms of disease and guide us toward the functional variant at trait-associated loci.

Availability and implementation: GREGOR, including source code, documentation, examples, and executables, is available at http://genome.sph.umich.edu/wiki/GREGOR.

Contact: cristen@umich.edu

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • ORIGINAL PAPER

chipPCR: an R package to pre-process raw data of amplification curves

chipPCR:R包预处理放大曲线的原始数据

Motivation: Both the quantitative real-time polymerase chain reaction (qPCR) and quantitative isothermal amplification (qIA) are standard methods for nucleic acid quantification. Numerous real-time read-out technologies have been developed. Despite the continuous interest in amplification-based techniques, there are only few tools for pre-processing of amplification data. However, a transparent tool for precise control of raw data is indispensable in several scenarios, for example, during the development of new instruments.

Results: chipPCR is an R package for the pre-processing and quality analysis of raw data of amplification curves. The package takes advantage of R’s S4 object model and offers an extensible environment. chipPCR contains tools for raw data exploration: normalization, baselining, imputation of missing values, a powerful wrapper for amplification curve smoothing and a function to detect the start and end of an amplification curve. The capabilities of the software are enhanced by the implementation of algorithms unavailable in R, such as a 5-point stencil for derivative interpolation. Simulation tools, statistical tests, plots for data quality management, amplification efficiency/quantification cycle calculation, and datasets from qPCR and qIA experiments are part of the package. Core functionalities are integrated in GUIs (web-based and standalone shiny applications), thus streamlining analysis and report generation.

Availability and implementation: http://cran.r-project.org/web/packages/chipPCR. Source code: https://github.com/michbur/chipPCR.

Contact: stefan.roediger@b-tu.de

Supplementary information: Supplementary data are available at Bioinformatics online.

[详细]

  • Bioinformatics
  • 10年前
  • APPLICATIONS NOTE

Accurate identification of centromere locations in yeast genomes using Hi-C

在酵母基因组着丝粒位置准确识别使用Hi-C

Centromeres are essential for proper chromosome segregation. Despite extensive research, centromere locations in yeast genomes remain difficult to infer, and in most species they are still unknown. Recently, the chromatin conformation capture assay, Hi-C, has been re-purposed for diverse applications, including de novo genome assembly, deconvolution of metagenomic samples and inference of centromere locations. We describe a method, Centurion, that jointly infers the locations of all centromeres in a single genome from Hi-C data by exploiting the centromeres’ tendency to cluster in three-dimensional space. We first demonstrate the accuracy of Centurion in identifying known centromere locations from high coverage Hi-C data of budding yeast and a human malaria parasite. We then use Centurion to infer centromere locations in 14 yeast species. Across all microbes that we consider, Centurion predicts 89% of centromeres within 5 kb of their known locations. We also demonstrate the robustness of the approach in datasets with low sequencing depth. Finally, we predict centromere coordinates for six yeast species that currently lack centromere annotations. These results show that Centurion can be used for centromere identification for diverse species of yeast and possibly other microorganisms.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Computational Biology

MapMyFlu: visualizing spatio-temporal relationships between related influenza sequences

mapmyflu:可视化相关的流感病毒序列之间的时空关系

Understanding the molecular dynamics of viral spreading is crucial for anticipating the epidemiological implications of disease outbreaks. In the case of influenza, reassortments or point mutations affect the adaption to new hosts or resistance to anti-viral drugs and can determine whether a new strain will result in a pandemic infection or a less severe progression. To this end, tools integrating molecular information with epidemiological parameters are important to understand how molecular characteristics reflect in the infection dynamics. We present a new web tool, MapMyFlu, which allows to spatially and temporally display influenza viruses related to a query sequence on a Google Map based on BLAST results against the NCBI Influenza Database. Temporal and geographical trends appear clearly and may help in reconstructing the evolutionary history of a particular sequence. The tool is accessible through a web server, hence without the need for local installation. The website has an intuitive design and provides an easy-to-use service, and is available at http://mapmyflu.ipmb.uni-heidelberg.de

[详细]

  • Nucleic Acids Research
  • 10年前
  • Web Server issue

The telomerase essential N-terminal domain promotes DNA synthesis by stabilizing short RNA-DNA hybrids

端粒酶必不可少的N-端结构域的稳定短RNA-DNA杂交促进DNA合成

Telomerase is an enzyme that adds repetitive DNA sequences to the ends of chromosomes and consists of two main subunits: the telomerase reverse transcriptase (TERT) protein and an associated telomerase RNA (TER). The telomerase essential N-terminal (TEN) domain is a conserved region of TERT proposed to mediate DNA substrate interactions. Here, we have employed single molecule telomerase binding assays to investigate the function of the TEN domain. Our results reveal telomeric DNA substrates bound to telomerase exhibit a dynamic equilibrium between two states: a docked conformation and an alternative conformation. The relative stabilities of the docked and alternative states correlate with the number of basepairs that can be formed between the DNA substrate and the RNA template, with more basepairing favoring the docked state. The docked state is further buttressed by the TEN domain and mutations within the TEN domain substantially alter the DNA substrate structural equilibrium. We propose a model in which the TEN domain stabilizes short RNA–DNA duplexes in the active site of the enzyme, promoting the docked state to augment telomerase processivity.

[详细]

  • Nucleic Acids Research
  • 10年前
  • Nucleic Acid Enzymes