|
|
| (225 intermediate revisions by the same user not shown) |
| Line 1: |
Line 1: |
| − | ====Standard Deviation====
| |
| − | https://statistics.laerd.com/statistical-guides/measures-of-spread-standard-deviation.php
| |
| | | | |
| − | The standard deviation is a measure of the spread of scores within a set of data. Usually, we are interested in the standard deviation of a population. However, as we are often presented with data from a sample only, we can estimate the population standard deviation from a sample standard deviation. These two standard deviations - sample and population standard deviations - are calculated differently. In statistics, we are usually presented with having to calculate sample standard deviations, and so this is what this article will focus on, although the formula for a population standard deviation will also be shown.
| |
| − |
| |
| − |
| |
| − | <br />
| |
| − | : '''Population standard deviation''' (<math>\sigma</math>)
| |
| − | <blockquote>
| |
| − | <math>\sigma = \sqrt{\frac{\sum_{i=1}^{N}(x_{i} - \mu)^2}{N}}</math>
| |
| − |
| |
| − | <math>\mu: \text{population mean};\ \ \ N: \text{Number of scores in the population}</math>
| |
| − | </blockquote>
| |
| − |
| |
| − |
| |
| − | <br />
| |
| − | : '''Sample standard deviation formula''' (<math>s</math>)
| |
| − | <blockquote>
| |
| − | Sometimes our data is only a sample of the whole population. In this case, we can still estimate the Standard deviation; but when we use a sample as an estimate of the whole population, the Standard deviation formula changes to this:
| |
| − |
| |
| − | <math>s = \sqrt{\frac{\sum_{i=1}^{n}(x_{i} - \bar{x})^2}{n -1}}</math>
| |
| − |
| |
| − |
| |
| − | <math>\bar{x}: \text{Sample mean};\ \ \ n: \text{Number of scores in the sample}</math>
| |
| − | </blockquote>
| |
| − |
| |
| − |
| |
| − | <br />
| |
| − | *The Standard Deviation is the square root of the variance.
| |
| − | *This measure is the most widely used to express deviation from the mean in a variable.
| |
| − | *The higher the value the more widely distributed are the variable data values around the mean.
| |
| − | *Assuming the frequency distributions approximately normal, about 68% of all observations are within +/- 1 standard deviation.
| |
| − | *Approximately 95% of all observations fall within two standard deviations of the mean (if data is normally distributed).
| |
| − |
| |
| − |
| |
| − | <br />
| |