Saturday, June 24, 2017 - 09:00 to 15:45
Overlap detection for a genome assembly based on genomic signal processing
Although the genome sequences of most studied organisms, like human, E. coli, and others are already known, de novo genome sequencing remains popular as a majority of genomes remains unknown. Unfortunately, sequencing machines are able to read only short fragments of DNA. Therefore, one of the basic steps in reconstructing novel genomes lies in putting these pieces of DNA, called ‘reads’, together into complete genome sequences using a process known as genome assembly. Reads joining, however, requires efficient detection of their overlaps. This is commonly performed by comparing the particular characters (A, C, G, T) of the reads using string processing techniques. In this paper, we present an alternative way of detecting overlaps using genomic signal processing. Unlike string comparison, numerical phase signals reflect the complementarity of double stranded DNA making the signal ideal for effective strand independent overlap detection using covariance with high accuracy.
Robin Jugas's picture
Robin Jugas
Brno University of Technology (CZ)
Karel Sedlar's picture
Karel Sedlar
Helena Skutkova's picture
Helena Skutkova
Martin Vitek's picture
Martin Vitek