Linear-time algorithm for long LCF with κ mismatches

Panagiotis Charalampopoulos, Maxime Crochemore, Costas S. Iliopoulos, Tomasz Kociumaka, Solon P. Pissis, Jakub Radoszewski, Wojciech Rytter, Tomasz Waleń

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

In the Longest Common Factor with κ Mismatches (LCFκ) problem, we are given two strings X and Y of total length n, and we are asked to find a pair of maximal-length factors, one of X and the other of Y , such that their Hamming distance is at most κ. Thankachan et al. [27] show that this problem can be solved in O(n logκ n) time and O(n) space for constant k. We consider the LCFκ (ℓ) problem in which we assume that the sought factors have length at least ℓ. We use difference covers to reduce the LCFκ (ℓ) problem with ℓ = Ω(log2κ+2 n) to a task involving m = O(n/logκ+1 n) synchronized factors. The latter can be solved in O(mlogκ+1 m) time, which results in a linear-time algorithm for LCFκ (ℓ) with ℓ = Ω(log2κ+2 n). In general, our solution to the LCFκ(ℓ) problem for arbitrary ℓ takes O(n + n logκ+1 n/√ℓ) time.

Original languageEnglish
Title of host publication29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018
EditorsBinhai Zhu, Gonzalo Navarro, David Sankoff
PublisherSchloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
Pages231-2316
Number of pages2086
ISBN (Electronic)9783959770743
DOIs
Publication statusPublished - 1 May 2018
Externally publishedYes
Event29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018 - Qingdao, China
Duration: 2 Jul 20184 Jul 2018

Publication series

NameLeibniz International Proceedings in Informatics, LIPIcs
Volume105
ISSN (Print)1868-8969

Conference

Conference29th Annual Symposium on Combinatorial Pattern Matching, CPM 2018
Country/TerritoryChina
CityQingdao
Period2/07/184/07/18

Funding

Funding Jakub Radoszewski is supported by the “Algorithms for text processing with errors and uncertainties” project carried out within the HOMING programme of the Foundation for Polish Science co-financed by the European Union under the European Regional Development Fund. Wojciech Rytter and Tomasz Waleń are supported by the Polish National Science Center, grant no. 2014/13/B/ST6/00770.

FundersFunder number
Polish National Science Center2014/13/B/ST6/00770
European Chiropractors' Union
European Commission
Fundacja na rzecz Nauki Polskiej
European Regional Development Fund

    Keywords

    • Difference cover
    • Hamming distance
    • Heavy-light decomposition
    • Longest common factor
    • Longest common substring

    Fingerprint

    Dive into the research topics of 'Linear-time algorithm for long LCF with κ mismatches'. Together they form a unique fingerprint.

    Cite this