Sunday, November 30, 2008

Star Plots


http://start1.jpl.nasa.gov/images/MERStarPlot.gif
"In this star plot, the center represents the most desirable results. The red line represents the handcrafted MER IDD, and is used for comparison and validation of the approach."
http://start1.jpl.nasa.gov/caseStudies/autoTool.cfm
Star plots use a design with "spines" radiating from the center, with one spine for each variable.
This star plot is from the NASA studies related to automated engineering design studies for the Mars Exploration Rover (MER) Instrument Deployment Device (IDD). Looking at the "Mass" spine, Design 1 was the least desireable for that variable. Design 1 fared better for Link Deflection.

Correlation Matrix


A correlation matrix such as the above provides the statistical correlation values for variables down a row and across a column. The diagonal down the middle will be the correlation of 1, since that is the same variable on each side. The two triangles formed by this red correlation line are mirrors of one another. The user is able to compare correlation values between the variable in the squares that are not on the diagonal.
This correlation matrix is from a study on gene expression patterns and potentially developing a model that relates to humans and cancer. "From Genome to Phenome:Here is Looking at You and Your Cancers", Antonio Reverter, Wes Barris, Sean McWilliam, Greg Harper and Brian Dalrymple. It is a "Correlation matrix for a subset of the top 100 cancer genes. Thick lines indicate blocks: A for extracellular matrix; B for nucleus and cell progression; C for actin cytoskeleton; D for fatty acid metabolism and E for glutamine/glutathioine/oxidative. stress (png file)." The grouping of the seven extracellular matrix genes in the top left corner all have a high degree of positive correlation to one another, and four of the nucleus & cell progression genes do.

Similarity Matrix


http://www.fxpal.com/systems/MediaAnalysis/sim10000.gif

This similarity matrix illustrates photo similarity in a study related to organization of digital photo collections.
The authors of the paper "Temporal Even Clustering for Digital Photo Collections" were attempting to address the growing use of digital photography and develop methods for developing user friendly software for organization and retrieval of photos. This matrix is from their paper. (Matthew Cooper, Jonathan Foote, Andreas Girgensohn, and Lynn Wilcox). http://www.fxpal.com/publications/FXPAL-PR-03-215.pdf
"The similarity matrix above visualizes the temporal similarity of a collection of 512 photos. To compute the matrix, K = 10000. For each matrix, a photo-indexed novelty score is computed by correlating a checkerboard kernel along the main diagonal of the similarity matrix as described in this presentation. Starting with the coarsest scale (largest K), peaks in this score are detected and a hierarchical set of event boundaries is generated. A confidence measure trading off intra-event similarity and inter-event dissimilarity is used to select a single level of the hierarchy for presentation to the user. The similarity matrix above visualizes the temporal similarity of a collection of 512 photos. To compute the matrix, K = 10000. " http://www.fxpal.com/?p=eventDetector


Stem and Leaf Plot


Stem and leaf plots are somewhat similar to histograms: they illustrate the frequency of occurance of data in a range. However, stem and leaf plots show the exact numerics for the range, and would be like a "histogram-on-it's side". They can be used to see a distribution pattern in the data.
The "stem" is the first digit (or "tens" digit in this case) of the number, and the leaf is the last digit (or the "ones" digit in this case). The ages for this group are 1,8,9,32,34,37,45,51,55,and 81.