TY - GEN
T1 - Multi-task text segmentation and alignment based on weighted mutual information
AU - Sun, Bingjun
AU - Zhou, Ding
AU - Zha, Hongyuan
AU - Yen, John
PY - 2006
Y1 - 2006
N2 - Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the extension of single-task segmentation to utilize information of multi-source documents. In this paper we introduce a novel domain-independent unsupervised method for multi-task segmentation and alignment based on the idea that the optimal segmentation and alignment maximizes weighted mutual information, mutual information with term weights. The experiment results show that our approach works well.
AB - Text segmentation is important for text analysis, while text alignment is to determine shared sub-topics among similar documents. Multi-task text segmentation and alignment is the extension of single-task segmentation to utilize information of multi-source documents. In this paper we introduce a novel domain-independent unsupervised method for multi-task segmentation and alignment based on the idea that the optimal segmentation and alignment maximizes weighted mutual information, mutual information with term weights. The experiment results show that our approach works well.
UR - http://www.scopus.com/inward/record.url?scp=34547617165&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=34547617165&partnerID=8YFLogxK
U2 - 10.1145/1183614.1183760
DO - 10.1145/1183614.1183760
M3 - Conference contribution
AN - SCOPUS:34547617165
SN - 1595934332
SN - 9781595934338
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 846
EP - 847
BT - Proceedings of the 15th ACM Conference on Information and Knowledge Management, CIKM 2006
T2 - 15th ACM Conference on Information and Knowledge Management, CIKM 2006
Y2 - 6 November 2006 through 11 November 2006
ER -