Bioinformatics

atSNP: transcription factor binding affinity testing for regulatory SNP detection

Zuo, C., Shin, S., Keles, S..

Motivation: Genome-wide association studies revealed that most disease-associated single nucleotide polymoprhisms (SNPs) are located in regulatory regions within introns or in regions between genes. Regulatory SNPs (rSNPs) are such SNPs that affect gene regulation by changing transcription factor (TF) binding affinities to genomic sequences. Identifying potential rSNPs is crucial for understanding disease mechanisms. In silico methods that evaluate the impact of SNPs on TF binding affinities are not scalable for large-scale analysis.

Results: We describe atSNP (affinity testing for regulatory SNPs), a computationally efficient R package for identifying rSNPs in silico. atSNP implements an importance sampling algorithm coupled with a first-order Markov model for the background nucleotide sequences to test the significance of affinity scores and SNP-driven changes in these scores. Application of atSNP with >20K SNPs indicates that atSNP is the only available tool for such a large-scale task. atSNP provides user-friendly output in the form of both tables and composite logo plots for visualizing SNP-motif interactions. Evaluations of atSNP with known rSNP-TF interactions indicates that rSNP is able to prioritize motifs for a given set of SNPs with high accuracy.

Availability: http://github.com/chandlerzuo/atsnp.

Contact: keles@stat.wisc.edu

Supplementary information: Supplementary data are available at Bioinformatics online.