TY - GEN
T1 - A new approach for tree alignment based on local re-optimization
AU - Yue, Feng
AU - Tang, Jijun
PY - 2008
Y1 - 2008
N2 - Multiple sequence alignment is the most fundamental task in bioinformatics and computational biology. In this paper, we present a new algorithm to conduct multiple sequences alignment based on phylogenetic trees. It is widely accepted that a good phylogenetic tree can help produce high quality alignment, but the direct dynamic programming solution grows exponentially [13]. To overcome this problem, we first devise a procedure that can produce optimal alignment for three sequences and infer their common ancestor. We then extend the above procedure to compute the alignment of a given tree with more sequences by iteratively relabeling the internal nodes. We have implemented our algorithm as a C program package, which can handle both DNA and protein data and can take simple cost model as well as complex substitution matrices, such as PAM or BLOSUM series. We test our new method with biological and simulated datasets, and compare its performance with those of other popular multiple sequence alignment tools, including the widely used programs such as Clustalw and T-Coffee.
AB - Multiple sequence alignment is the most fundamental task in bioinformatics and computational biology. In this paper, we present a new algorithm to conduct multiple sequences alignment based on phylogenetic trees. It is widely accepted that a good phylogenetic tree can help produce high quality alignment, but the direct dynamic programming solution grows exponentially [13]. To overcome this problem, we first devise a procedure that can produce optimal alignment for three sequences and infer their common ancestor. We then extend the above procedure to compute the alignment of a given tree with more sequences by iteratively relabeling the internal nodes. We have implemented our algorithm as a C program package, which can handle both DNA and protein data and can take simple cost model as well as complex substitution matrices, such as PAM or BLOSUM series. We test our new method with biological and simulated datasets, and compare its performance with those of other popular multiple sequence alignment tools, including the widely used programs such as Clustalw and T-Coffee.
UR - http://www.scopus.com/inward/record.url?scp=51549094759&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=51549094759&partnerID=8YFLogxK
U2 - 10.1109/BMEI.2008.290
DO - 10.1109/BMEI.2008.290
M3 - Conference contribution
AN - SCOPUS:51549094759
SN - 9780769531182
T3 - BioMedical Engineering and Informatics: New Development and the Future - Proceedings of the 1st International Conference on BioMedical Engineering and Informatics, BMEI 2008
SP - 34
EP - 38
BT - BioMedical Engineering and Informatics
T2 - BioMedical Engineering and Informatics: New Development and the Future - 1st International Conference on BioMedical Engineering and Informatics, BMEI 2008
Y2 - 27 May 2008 through 30 May 2008
ER -