Pseudoreplication is the process of artificially inflating the number of samples or replicates. As a result, statistical tests performed on the data are rendered invalid. Pseudoreplication was originally defined in 1984 by Stuart H. Hurlbert as a special case of inadequate specification of random factors where both random and fixed factors are present. The problem of inadequate specification arises when treatments are assigned to units that are subsampled and the treatment F-ratio in an analysis of variance table is formed with respect to the residual mean square rather than with respect to the among unit mean square. The F-ratio relative to the within unit mean square is vulnerable to the confounding of treatment and unit effects, especially when experimental unit number is small. The problem is eliminated by forming the F-ratio relative to the correct mean square in the ANOVA table, where this is possible. The problem is addressed by the use of mixed models. Hurlbert reported "pseudoreplication" in 48% of the studies he examined, that used inferential statistics. Several studies examining scientific paperspublished up to 2016 similarly found about half of the papers were suspected of pseudoreplication. When time and resources limit the number of experimental units, and unit effects cannot be eliminated statistically by testing over the unit variance, it is important to use other sources of information to evaluate the degree to which an F-ratio is confounded by unit effects.
Replication
increases the precision of an estimate, while randomization addresses the broader applicability of a sample to a population. Replication must be appropriate: replication at the experimental unit level must be considered, in addition to replication within units.
Hypothesis testing
rely on appropriate replication to estimate statistical significance. Tests based on the t and F distributions assume homogeneous, normal, and independent errors. Correlated errors can lead to false precision and p-values that are too small.
Types
Hurlbert defined four types of pseudoreplication.
Simple pseudoreplication occurs when there is one experimental unit per treatment. Inferential statistics cannot separate variability due to treatment from variability due to experimental units when there is only one measurement per unit.
Temporal pseudoreplication occurs when experimental units differ enough in time that temporal effects among units are likely, and treatment effects are correlated with temporal effects. Inferential statistics cannot separate variability due to treatment from variability due to experimental units when there is only one measurement per unit.
Sacrificial pseudoreplication occurs when means within a treatment are used in an analysis, and these means are tested over the within unit variance. In Figure 5b the erroneous F-ratio will have 1 df in the numerator mean square and 4 df in the denominator mean square. The correct F-ratio will have 1 df in the numerator and 2 df in the denominator. The correct F-ratio controls for effects of experimental units but with 2 df in the denominator it will have little power to detect treatment differences.
Implicit pseudoreplication occurs when standard errors are estimated within experimental units. As with other sources of pseudoreplication, treatment effects cannot be statistically separated from effects due to variation among experimental units.