TY - GEN
T1 - Approximate Trace Reconstruction via Median String (In Average-Case)
AU - Chakraborty, Diptarka
AU - Das, Debarati
AU - Krauthgamer, Robert
N1 - Publisher Copyright:
© Diptarka Chakraborty, Debarati Das, and Robert Krauthgamer.
PY - 2021/12/1
Y1 - 2021/12/1
N2 - We consider an approximate version of the trace reconstruction problem, where the goal is to recover an unknown string s ∈ {0, 1}n from m traces (each trace is generated independently by passing s through a probabilistic insertion-deletion channel with rate p). We present a deterministic near-linear time algorithm for the average-case model, where s is random, that uses only three traces. It runs in near-linear time Õ(n) and with high probability reports a string within edit distance Õ(p2n) from s, which significantly improves over the straightforward bound of O(pn). Technically, our algorithm computes a (1 + ϵ)-approximate median of the three input traces. To prove its correctness, our probabilistic analysis shows that an approximate median is indeed close to the unknown s. To achieve a near-linear time bound, we have to bypass the well-known dynamic programming algorithm that computes an optimal median in time O(n3).
AB - We consider an approximate version of the trace reconstruction problem, where the goal is to recover an unknown string s ∈ {0, 1}n from m traces (each trace is generated independently by passing s through a probabilistic insertion-deletion channel with rate p). We present a deterministic near-linear time algorithm for the average-case model, where s is random, that uses only three traces. It runs in near-linear time Õ(n) and with high probability reports a string within edit distance Õ(p2n) from s, which significantly improves over the straightforward bound of O(pn). Technically, our algorithm computes a (1 + ϵ)-approximate median of the three input traces. To prove its correctness, our probabilistic analysis shows that an approximate median is indeed close to the unknown s. To achieve a near-linear time bound, we have to bypass the well-known dynamic programming algorithm that computes an optimal median in time O(n3).
UR - http://www.scopus.com/inward/record.url?scp=85122467233&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85122467233&partnerID=8YFLogxK
U2 - 10.4230/LIPIcs.FSTTCS.2021.11
DO - 10.4230/LIPIcs.FSTTCS.2021.11
M3 - Conference contribution
AN - SCOPUS:85122467233
T3 - Leibniz International Proceedings in Informatics, LIPIcs
BT - 41st IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, FSTTCS 2021
A2 - Bojanczyk, Mikolaj
A2 - Chekuri, Chandra
PB - Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
T2 - 41st IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science, FSTTCS 2021
Y2 - 15 December 2021 through 17 December 2021
ER -