TY - GEN
T1 - Exploring high-D spaces with Multiform matrices and small multiples
AU - MacEachren, Alan
AU - Dai, Xiping
AU - Hardisty, Frank
AU - Guo, Diansheng
AU - Lengerich, Gene
PY - 2003
Y1 - 2003
N2 - We introduce an approach to visual analysis of multivariate data that integrates several methods from information visualization, exploratory data analysis (EDA), and geovisualization. The approach leverages the component-based architecture implemented in GeoVISTA Studio to construct a flexible, multiview, tightly (but generically) coordinated, EDA toolkit. This toolkit builds upon traditional ideas behind both small multiples and scatterplot matrices in three fundamental ways. First, we develop a general, MultiForm, Bivariate Matrix and a complementary MultiForm, Bivariate Small Multiple plot in which different bivariate representation forms can be used in combination. We demonstrate the flexibility of this approach with matrices and small multiples that depict multivariate data through combinations of: scatterplots, bivariate maps, and space-filling displays. Second, we apply a measure of conditional entropy to (a) identify variables from a high-dimensional data set that are likely to display interesting relationships and (b) generate a default order of these variables in the matrix or small multiple display. Third, we add conditioning, a kind of dynamic query/filtering in which supplementary (un-displayed) variables are used to constrain the view onto variables that are displayed. Conditioning allows the effects of one or more well understood variables to be removed from the analysis, making relationships among remaining variables easier to explore. We illustrate the individual and combined functionality enabled by this approach through application to analysis of cancer diagnosis and mortality data and their associated covariates and risk factors.
AB - We introduce an approach to visual analysis of multivariate data that integrates several methods from information visualization, exploratory data analysis (EDA), and geovisualization. The approach leverages the component-based architecture implemented in GeoVISTA Studio to construct a flexible, multiview, tightly (but generically) coordinated, EDA toolkit. This toolkit builds upon traditional ideas behind both small multiples and scatterplot matrices in three fundamental ways. First, we develop a general, MultiForm, Bivariate Matrix and a complementary MultiForm, Bivariate Small Multiple plot in which different bivariate representation forms can be used in combination. We demonstrate the flexibility of this approach with matrices and small multiples that depict multivariate data through combinations of: scatterplots, bivariate maps, and space-filling displays. Second, we apply a measure of conditional entropy to (a) identify variables from a high-dimensional data set that are likely to display interesting relationships and (b) generate a default order of these variables in the matrix or small multiple display. Third, we add conditioning, a kind of dynamic query/filtering in which supplementary (un-displayed) variables are used to constrain the view onto variables that are displayed. Conditioning allows the effects of one or more well understood variables to be removed from the analysis, making relationships among remaining variables easier to explore. We illustrate the individual and combined functionality enabled by this approach through application to analysis of cancer diagnosis and mortality data and their associated covariates and risk factors.
UR - http://www.scopus.com/inward/record.url?scp=78649532416&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78649532416&partnerID=8YFLogxK
U2 - 10.1109/INFVIS.2003.1249006
DO - 10.1109/INFVIS.2003.1249006
M3 - Conference contribution
C2 - 21947129
AN - SCOPUS:78649532416
SN - 0780381548
SN - 9780780381544
T3 - Proceedings - IEEE Symposium on Information Visualization, INFO VIS
SP - 31
EP - 40
BT - IEEE Symposium on Information Visualization 2003, InfoVis 2003
T2 - 9th Annual IEEE Symposium on Information Visualization, InfoVis 2003
Y2 - 19 October 2003 through 21 October 2003
ER -