Sunday, July 10, 2011

Population Profile





Population profiles are charts that show the number of people as a function of their ages. They can also contain other variables such as gender, race and ethnicity. The above population profile is a profile of the persons within the District of Columbia. The vertical column represents ages and the horizontal row represents the population of a certain age. Also, the chart is comprised of two sides: male and female. This profile shows the number of males and females within the District of Columbia in relation to their ages in a bar graph.

Histogram




Histograms are summary graphs that shows a count of data points that fall in various ranges. It displays a rough approximation of the frequency of distribution of the data. The frequencies are displayed as adjacent rectangles, built on specific intervals, in which the area corresponds to the frequency of the observations in the interval. The histogram above measures the frequency of a range of heights in a 25 student statistical representation. The frequency is the number of students that fell into a specific height class. This histogram is showing the frequency, number of occurrences, of certain ranges of heights between 25 students.

Box Plot



Box plots is a method of vonviently depicting groups of numerical data graphically and they are non-parametric. The tops of the boxes represent the 75th percentile or upper quartile and the bottom represents the 25th percentile or lower quartile. The data is divided into five summaries within the plots: smallest observation, lower quartile, median, upper quartile, and largest observation. The horizontal line within the box represents the median value. In this box plot the mileages of car travel is displayed according to the corresponding country. 

Similarity Matrix



Similarity matrices measure the pairwise similarities of objects or data. Here, the larger the assigned value given to the measured data, the greater the similarity. Low values indicate a greater dissimilarity. This type of matrix is used in sequence alignment. Here, an example of how a similarity matrix would be displayed is given. The example is not actually measuring anything, it is just an example. However, if we were comparing the similarities between Microsoft Office 2007 and 2010 they would have a close similarity giving it a larger value. Whereas if we compared Macintosh word processors to that of Microsoft's they would have dissimilar values.

Correlation Matrix




A correlation matrix is a graphical data analysis approach that shows the correlations between all pairs of data sets and is used when a variety of variables is being measured. Correlation matrices shows the computed or measured correlation between intersecting columns and rows. The above matrix shows asset returns from investments for a period of business days 

Star Plot




Star plots are a form of graphical data analysis which examins the behavior of all variables in a multivariate set. Each "star" displayed can represent a single measured observation. The spokes in star plots represents a unique variable in the data set.  Where the data points meet on the spoke determine to measured data's value. They use a specified subset of data to examine the behavior of variables. The above example compares different car brands and models with specified variables. These range from price to length. The spokes are numbered correlating to a specific data set, ie, "1" is the price and "9" is the length. The longer the intersection point on the spoke the more expensive the vehicle is when looking at spoke "1" and longer if the length of intersection on spoke "9" is greater. This star plot is comparing 7 different characteristics between various makes and models of automobiles.