A computational framework to discover the content and location of long novel sequence insertions using paired-end sequencing data