Controlling size when aligning multiple genomic sequences with duplications

Minmei Hou, Piotr Berman, Louxin Zhang, Webb Miller

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    8 Scopus citations


    For a genomic region containing a tandem gene cluster, a proper set of alignments needs to align only orthologous segments, i.e., those separated by a speciation event. Otherwise, methods for finding regions under evolutionary selection will not perform properly. Conversely, the alignments should indicate every orthologous pair of genes or genomic segments. Attaining this goal in practice requires a technique for avoiding a combinatorial explosion in the number of local alignments. To better understand this process, we model it as a graph problem of finding a minimum cardinality set of cliques that contain all edges. We provide an upper bound for an important class of graphs (the problem is 7 NP-hard and very difficult to approximate in the general case), and use the bound and computer simulations to evaluate two heuristic solutions. An implementation of one of them is evaluated on mammalian sequences from the α-globin gene cluster.

    Original languageEnglish (US)
    Title of host publicationAlgorithms in Bioinformatics - 6th International Workshop, WABI 2006, Proceedings
    PublisherSpringer Verlag
    Number of pages12
    ISBN (Print)3540395830, 9783540395836
    StatePublished - 2006
    Event6th International Workshop on Algorithms in Bioinformatics, WABI 2006 - Zurich, Switzerland
    Duration: Sep 11 2006Sep 13 2006

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume4175 LNBI
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349


    Other6th International Workshop on Algorithms in Bioinformatics, WABI 2006

    All Science Journal Classification (ASJC) codes

    • Theoretical Computer Science
    • General Computer Science


    Dive into the research topics of 'Controlling size when aligning multiple genomic sequences with duplications'. Together they form a unique fingerprint.

    Cite this