What are the two ways of representing data?
Numerical - measures of central tendency and dispersion
Bar charts
Pictorial - charts and graphs
Line graphs
What are the measures of central tendency?
Mean
Standard deviation
Mode
Median
Range
Inter-quartile range
What is the mean?
The arithmetic average that is calculated through adding all of the values and then dividing by how many there are.
The middle number after all numbers have been ordered
The most frequent number in a set
What are the advantages of the mean?
• Contributed to by all values
• Preferred when data is skewed
• Useful in statistics
• Used within continuous data
• Preferred when data is normally distributed
• Easiest to understand and calculate
What are the disadvantages of the mean?
• More difficult to understand
• Time consuming
• Affected by extreme values
• Not appropriate with ordinal and nominal level data
• Might not be an actual value in the set
What is the median
The sum of all values divided by the number of values
What are the advantages of the median?
• Easier to understand
• Unaffected by extreme values
• Used with ordinal and continuous data
• An actual value
What are the disadvantages of the median
What is the mode?
What are the advantages of the mode?
• Used with categorical data
What are the disadvantages of the mode?
• Sometimes there is more than one
What are the different measures of distribution or the spread of data?
Interquartile range
What is the range?
How much on average each score deviates from the mean score.
The difference between the largest and the smallest.
What are the advantages of the range?
• Easy to understand
• Related to the actual values in the set
• Usual used with ordinal and continuous data
What are the disadvantages of the range?
What is the inter-quartile range?
The difference between the upper quartile (75% data point) and the lower quartile (25% data point).
What are the advantages of the interquartile range?
• Used with continuous data
• Usually used with ordinal and continuous data
What are the disadvantages of the inter-quartile range?
• Unlikely to be a value in the set
What is standard deviation?
What are the advantages of standard deviation?
• Uses all the values in the set
What are the disadvantages of standard deviation?
What is a normal distribution?
This is normal because the mean, median and mode lies in the same place, the mean is the preferred method.
The mean is dragged in the direction of the skew so the median is the preferred measure (an easy test – ceiling effect).
What is a negatively skewed distribution?
Left
Right
What is a positively skewed distribution?
What is the preferred measure for a normal distribution?
What is the preferred method for a skewed distribution?
What are box and whisker plots?
They are helpful for indicating whether a distribution is skewed and if there any unusual observations (outliers) in the data set.
They are useful when large numbers of observations are involved or comparing two or more data sets.
What are the different ways of representing data?
Bar chart
Line graph
Scatter diagram
Pie chart
What are the levels of measurement
Nominal data
Continuous data
Ordinal data
Ratio data
Categorical data
Interval data
What is nominal data?
When quantities that are ordered y criteria such as positions in a rate. However it does not allow you to state whether the intervals between each value are equal.
categorically discrete data such as the name of your school or the car you drive.
This is like like ordinal data except the intervals between each value is equally split, such as the time of day on a clock.
What is ordinal data?
like ordinal data except the intervals between each value is equally split, such as the time of day on a clock.
interval data with a natural zero point (ruler in cm).
The median could be a measure of central tendency for this.
What is interval data?
this is like ordinal data except the intervals between each value is equally split, such as the time of day on a clock.
What is ratio data?
What is categorical data?
data which is divided into groups (types of animals). This data is nominal and discrete.
The mode would be the measure of central tendency as it cannot be used with arithmetic.
Representations could include a bar chart with gaps between the bars (grouped discrete data)
data restricted to a limited set of discrete values (shoes size and money)
What is continuous data?
data which can take any value within a range (height). The mean could be a measure of central tendency, this is because it would give you the arithmetic average for the large range of data which can then be used in statistics.
The median could be used if the data was skewed because this would be more representative of the data.
Representations could include a line graph or a histogram (grouped continuous)
What is discrete data?
data which is divided into groups (types of animals). This data is nominal and discrete. The mode would be the measure of central tendency as it cannot be used with arithmetic.
What types of data is the mean used for?
Continuous
What types of data is the median used for?
Skewed data
What types of data would be the mode be used for?