Bioinformatics

KeBABS: an R package for kernel-based analysis of biological sequences

Palme, J., Hochreiter, S., Bodenhofer, U..

Summary: KeBABS provides a powerful, flexible and easy to use framework for kernel-based analysis of biological sequences in R. It includes efficient implementations of the most important sequence kernels, also including variants that allow for taking sequence annotations and positional information into account. KeBABS seamlessly integrates three common support vector machine (SVM) implementations with a unified interface. It allows for hyperparameter selection by cross validation, nested cross validation and also features grouped cross validation. The biological interpretation of SVM models is supported by (1) the computation of weights of sequence patterns and (2) prediction profiles that highlight the contributions of individual sequence positions or sections.

Availability and implementation: The R package kebabs is available via the Bioconductor project: http://bio conductor.org/packages/release/bioc/html/kebabs.html. Further information and the R code of the example in this paper are available at http://www.bioinf.jku.at/software/kebabs/.

Contact: kebabs@bioinf.jku.at or bodenhofer@bioinf.jku.at