Shift Finding in Sub-linear Time

We study the following basic pattern matching problem. Consider a "code" sequence c consisting of n bits chosen uniformly at random, and a "signal" sequence x obtained by shifting c (modulo n) and adding noise. The goal is to efficiently recover the shift with high probability. T...

Full description

Bibliographic Details
Main Authors: Andoni, Alexandr (Author), Hassanieh, Haitham (Contributor), Indyk, Piotr (Contributor), Katabi, Dina (Contributor)
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory (Contributor), Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science (Contributor)
Format: Article
Language:English
Published: Society for Industrial and Applied Mathematics, 2014-05-15T14:02:22Z.
Subjects:
Online Access:Get fulltext
LEADER 02404 am a22002893u 4500
001 86973
042 |a dc 
100 1 0 |a Andoni, Alexandr  |e author 
100 1 0 |a Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory  |e contributor 
100 1 0 |a Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science  |e contributor 
100 1 0 |a Hassanieh, Haitham  |e contributor 
100 1 0 |a Indyk, Piotr  |e contributor 
100 1 0 |a Katabi, Dina  |e contributor 
700 1 0 |a Hassanieh, Haitham  |e author 
700 1 0 |a Indyk, Piotr  |e author 
700 1 0 |a Katabi, Dina  |e author 
245 0 0 |a Shift Finding in Sub-linear Time 
260 |b Society for Industrial and Applied Mathematics,   |c 2014-05-15T14:02:22Z. 
856 |z Get fulltext  |u http://hdl.handle.net/1721.1/86973 
520 |a We study the following basic pattern matching problem. Consider a "code" sequence c consisting of n bits chosen uniformly at random, and a "signal" sequence x obtained by shifting c (modulo n) and adding noise. The goal is to efficiently recover the shift with high probability. The problem models tasks of interest in several applications, including GPS synchronization and motion estimation. We present an algorithm that solves the problem in time Õ(n[superscript (f/(1+f)]), where Õ(N[superscript f]) is the running time of the best algorithm for finding the closest pair among N "random" sequences of length O(log N). A trivial bound of f = 2 leads to a simple algorithm with a running time of Õ(n[superscript 2/3]). The asymptotic running time can be further improved by plugging in recent more efficient algorithms for the closest pair problem. Our results also yield a sub-linear time algorithm for approximate pattern matching algorithm for a random signal (text), even for the case when the error between the signal and the code (pattern) is asymptotically as large as the code size. This is the first sublinear time algorithm for such error rates. 
520 |a David & Lucile Packard Foundation 
520 |a National Science Foundation (U.S.) (NSF grant CCF 1065125) 
520 |a National Science Foundation (U.S.) (NSF grant CCF 1012042) 
520 |a National Science Foundation (U.S.) (NSF grant CNS 0831664) 
546 |a en_US 
655 7 |a Article 
773 |t Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms