Solving the minimum string cover problem

Stefan Canzar, Tobias Marschall, Sven Rahmann, Chris Schwiegelshohn

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

A string cover C of a set of strings S is a set of substrings from S such that every string in S can be written as a concatenation of the strings in C. Given costs assigned to each substring from S, the MINIMUM STRING COVER (MSC) problem asks for a cover of minimum total cost. This NP-hard problem has so far only been approached from a purely theoretical perspective. A previous integer linear programming (ILP) formulation was designed for a special case, in which each string in S must be generated by a (small) constant number of substrings. If this restriction is removed, the ILP has an exponential number of variables, for which we show the pricing problem to be NP-hard. We propose an alternative flow-based ILP formulation of polynomial size, whose structure is particularly favorable for a Lagrangian relaxation approach. By making use of the strong bounds obtained through a repeated shortest path computation in a branch-and-bound manner, we show for the first time that non-trivial MSC instances can be solved to provable optimality in reasonable time. We also provide and solve real-world instances derived from the classic text "Alice in Wonderland". On almost all instances, our Lagrangian relaxation approach outperforms a CPLEX-based implementation by an order of magnitude. Our software is available under the terms of the GNU general public license.

Original languageEnglish (US)
Title of host publicationProceedings of the Meeting on Algorithm Engineering and Experiments, ALENEX 2012
PublisherSociety for Industrial and Applied Mathematics Publications
Pages75-83
Number of pages9
ISBN (Print)9781611972122
DOIs
StatePublished - 2012
Event14th Annual Meeting on Algorithm Engineering and Experiments, ALENEX 2012 - Kyoto, Japan
Duration: Jan 16 2012Jan 16 2012

Publication series

NameProceedings of the Workshop on Algorithm Engineering and Experiments
ISSN (Print)2164-0300

Conference

Conference14th Annual Meeting on Algorithm Engineering and Experiments, ALENEX 2012
Country/TerritoryJapan
CityKyoto
Period1/16/121/16/12

All Science Journal Classification (ASJC) codes

  • General Engineering
  • Applied Mathematics

Fingerprint

Dive into the research topics of 'Solving the minimum string cover problem'. Together they form a unique fingerprint.

Cite this