Standard Deviation and Standard Error of the Mean

This article is still missing information.

The Standard Error of Mean (SEM) quantifies uncertainty in estimate of the mean. The Standard Deviation (SD) indicates dispersion of the data from the mean.

Distribution of Data

Studies are often done on samples of a population, and the data is summarised using descriptive statistics. These findings are then generalised to the larger unstudied population using inferential statistics.

Data follows a normal distribution when the values are dispersed evenly around one representative value, and this is a prerequisite for parametric statistical analysis. This article discusses the use of mean, standard deviation, and standard error of mean.

Standard Deviation

Standard Deviation is the dispersion of individual observations about the mean. It indicates the variance of data.

The findings of a sample with a normal distribution can be described by mean and SD. The sample mean is denoted by X̄ which is the centre of distribution. SD is the dispersion of individual observations about the mean. It is the typical distance of an observation from the distribution centre or middle value. A low SD means less variability, while a high SD means more variability with the data being more spread out.

Approximately 68.7% of the observed values are placed within one SD from the mean, approximately 95.4% of the observed values are within two SD from the mean, and approximately 99.7% of the observed values are within three SD from the mean.

The sample results are used to make inferences regarding the population. The sample mean (X̄) estimates the true but unknown population mean (μ). While the sample SD (s) estimates the population SD (σ). Different samples may come up with different estimates of the population mean.

SD helps to visualise the effect size, i.e. the difference between groups. This helps to differentiate between statistical significance and clinical significance. Effect size is determined by calculating the difference between the means divided by the pooled or average SD from two groups. An effect size of 0.8 or more is a large effect, meaning that the two groups are separated by 0.8SD. An effect size of 0.5 is moderate, and 0.2 is small.

Using SD as a measure of variability is not appropriate in cases where data not not normally distributed. Using SD in these cases leads to an inflated measure of variability. Other measures of variability are more appropriate here, such as mean absolute deviation and the interquartile range. The data can also be transformed with various techniques.

Standard Error of Mean

Standard Error of Mean is the standard deviation of mean of random samples drawn from the original population. It indicates how close the sample mean is to the population mean.

If you take different samples of individuals and plot each individual sample mean, then you find that these sample means are also normally distributed. The mean of these samples is the mean of the original population. The standard deviation of these sample means is the SEM.

SEM is a measure of the precision with which sample mean (X̄) estimates the population mean (μ). The precision increases with increased sample size. It quantifies uncertainty in the estimate of the mean.

In reality only one sample is extracted from the population. Therefore the SEM is estimated using the SD and a sample size (estimated SEM).

The main purpose of SEM is to construct confidence intervals (CI) of the mean, which is the range of values believed to contain the actual true population value. This value isn't known usually but can be estimated from an appropriate sample. Wider CIs mean less precision; narrow CIs mean greater precision. SEM should not be used other than for calculating confidence intervals.

95% confidence intervals of the mean = X̄ +/- 1.96 SEM

Comparison

SD estimates the variability of observations.
SEM estimates the variability of possible values of means of samples.
SEM is expected to have less variability than SD, because mean values are considered for its calculation.

Many published articles misuse these terms by using SEM instead of SD to express variability of data, because it typically is a smaller number and so "looks better". The SEM can also be made smaller simply by increasing the sample size, because as more samples are taken from the population, the sample means move closer to the population mean.

Readers are usually more interested in knowing the variability within the sample, and not the proximity of the mean to the population mean. Therefore data should be summarised with SD and not SEM.

Readers can use reported SD values by knowing that 95% of the population fall within 2 standard deviations of the mean in a normal distribution.

2SEM contains 95% of the means of samples, while SD contains 95% of the observations on individuals.

Readers can also use SD values to determine effect sizes. SEM values cannot help with this.

Resources

Calibrating Effect Sizes - Bogduk 2022.pdf- 540 KB (f)

See open access articles by Lee and colleagues.^[1] and Barde^[2]

References

↑ Lee DK, In J, Lee S. Standard deviation and standard error of the mean. Korean J Anesthesiol. 2015 Jun;68(3):220-3. doi: 10.4097/kjae.2015.68.3.220. Epub 2015 May 28. PMID: 26045923; PMCID: PMC4452664.
↑ Barde MP, Barde PJ. What to use to express the variability of data: Standard deviation or standard error of mean? Perspect Clin Res. 2012 Jul;3(3):113-6. doi: 10.4103/2229-3485.100662. PMID: 23125963; PMCID: PMC3487226.

[1] Lee DK, In J, Lee S. Standard deviation and standard error of the mean. Korean J Anesthesiol. 2015 Jun;68(3):220-3. doi: 10.4097/kjae.2015.68.3.220. Epub 2015 May 28. PMID: 26045923; PMCID: PMC4452664.

[2] Barde MP, Barde PJ. What to use to express the variability of data: Standard deviation or standard error of mean? Perspect Clin Res. 2012 Jul;3(3):113-6. doi: 10.4103/2229-3485.100662. PMID: 23125963; PMCID: PMC3487226.

[1]

[2]