Mapping uniquely occurring short sequences derived from high throughput technologies to a reference genome

Pavlos Antoniou*, Jackie W. Daykin, Costas S. Iliopoulos, Derrick Kourie, Laurent Mouchard, Solon P. Pissis

*Corresponding author for this work

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

Novel high throughput sequencing technology methods have redefined the way genome sequencing is performed. They are able to produce tens of millions of short sequences (reads) in a single experiment and with a much lower cost than previous sequencing methods. Due to this massive amount of data generated by the above systems, efficient algorithms for mapping short sequences to a reference genome are in great demand. In this paper, we present a practical algorithm for addressing the problem of efficiently mapping uniquely occuring short reads to a reference genome. This requires the classification of these short reads into unique and duplicate matches. In particular, we define and solve the Massive Exact Unique Pattern Matching problem in genomes.

Original languageEnglish
Title of host publicationFinal Program and Abstract Book - 9th International Conference on Information Technology and Applications in Biomedicine, ITAB 2009
DOIs
Publication statusPublished - 1 Dec 2009
Externally publishedYes
Event9th International Conference on Information Technology and Applications in Biomedicine, ITAB 2009 - Larnaca, Cyprus
Duration: 4 Nov 20097 Nov 2009

Publication series

NameFinal Program and Abstract Book - 9th International Conference on Information Technology and Applications in Biomedicine, ITAB 2009

Conference

Conference9th International Conference on Information Technology and Applications in Biomedicine, ITAB 2009
Country/TerritoryCyprus
CityLarnaca
Period4/11/097/11/09

Keywords

  • High throughput
  • Mapping
  • Pattern matching
  • Sequencing
  • Short reads

Fingerprint

Dive into the research topics of 'Mapping uniquely occurring short sequences derived from high throughput technologies to a reference genome'. Together they form a unique fingerprint.

Cite this