Skip to content

Commit

Permalink
Merge pull request #183 from florianhartig/182-outlier
Browse files Browse the repository at this point in the history
182 outlier
  • Loading branch information
florianhartig authored Jun 7, 2020
2 parents a937ca4 + 20a4e9a commit c35deca
Show file tree
Hide file tree
Showing 6 changed files with 203 additions and 137 deletions.
45 changes: 45 additions & 0 deletions Code/DHARMaManualTests/Calibration.Rmd
Original file line number Diff line number Diff line change
@@ -0,0 +1,45 @@
---
title: "Calibration of p-values"
author: "Florian Hartig"
date: "6/3/2020"
output: html_document
---

```{r setup, include=FALSE}
knitr::opts_chunk$set(echo = TRUE)
```


```{r}
reps = 200
pVals = matrix(ncol = 6, nrow = reps)
for(i in 1:reps){
testData = createData(sampleSize = 10000, overdispersion = 0, pZeroInflation = 0, randomEffectVariance = 0, family = gaussian())
fittedModel <- lm(observedResponse ~ Environment1 , data = testData)
simulationOutput <- simulateResiduals(fittedModel = fittedModel, n = 100)
pVals[i,1] = testOutliers(simulationOutput, plot = F, alternative = "two.sided")$p.value
pVals[i,2] = testOutliers(simulationOutput, plot = F, alternative = "greater")$p.value
pVals[i,3] = testOutliers(simulationOutput, plot = F, alternative = "less")$p.value
pVals[i,4] = testOutliers(simulationOutput, plot = F, alternative = "two.sided", margin = "upper")$p.value
pVals[i,5] = testOutliers(simulationOutput, plot = F, alternative = "greater", margin = "upper")$p.value
pVals[i,6] = testOutliers(simulationOutput, plot = F, alternative = "less", margin = "upper")$p.value
}
```



```{r}
par(mfrow = c(2,3))
for(i in 1:6) hist(pVals[,i], breaks = 50)
```



2 changes: 1 addition & 1 deletion DHARMa/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
Package: DHARMa
Title: Residual Diagnostics for Hierarchical (Multi-Level / Mixed) Regression Models
Version: 0.3.1
Version: 0.3.1.1
Date: 2020-05-10
Authors@R: c(person("Florian", "Hartig", email = "[email protected]", role = c("aut", "cre"), comment=c(ORCID="0000-0002-6255-9059")))
Description: The 'DHARMa' package uses a simulation-based approach to create
Expand Down
63 changes: 35 additions & 28 deletions DHARMa/R/tests.R
Original file line number Diff line number Diff line change
Expand Up @@ -145,54 +145,61 @@ testQuantiles <- function(simulationOutput, predictor = NULL, quantiles = c(0.25

#' Test for outliers
#'
#' This function tests if the number of observations that are strictly greater / smaller than all simulations are larger than expected
#' This function tests if the number of observations outside the simulatio envelope are larger or smaller than expected
#'
#' @param simulationOutput an object of class DHARMa with simulated quantile residuals, either created via \code{\link{simulateResiduals}} or by \code{\link{createDHARMa}} for simulations created outside DHARMa
#' @param alternative a character string specifying whether the test should test if observations are "greater", "less" or "two.sided" compared to the simulated null hypothesis
#' @param alternative a character string specifying whether the test should test if observations are "greater", "less" or "two.sided" (default) compared to the simulated null hypothesis
#' @param margin whether to test for outliers only at the lower, only at the upper, or both sides (default) of the simulated data distribution
#' @param plot if T, the function will create an additional plot
#' @details DHARMa residuals are created by simulating from the fitted model, and comparing the simulated values to the observed data. It can occur that all simulated values are higher or smaller than the observed data, in which case they get the residual value of 0 and 1, respectively. I refer to these values as simulation outliers, or simply outliers.
#'
#' Because no data was simulated in the range of the observed value, we actually don't know "how much" these values deviate from the model expecation, so the term "outlier" should be used with a grain of salt - it's not a judgement about the probability of a deviation from an expectation, but denotes that we are outside the simulated range. The number of outliers would usually decrease if the number of DHARMa simulations is increased.
#' Because no data was simulated in the range of the observed value, we don't know "how strongly" these values deviate from the model expectation, so the term "outlier" should be used with a grain of salt - it's not a judgment about the magnitude of a deviation from an expectation, but simply that we are outside the simulated range, and thus cannot say anything more about the location of the residual.
#'
#' Note also that the number of outliers will decrease as we increase the number of simulations. Under the null hypothesis that the model is correct, we expect nData /(nSim +1) outliers at each margin of the distribution. For a reason, consider that if the data and the model distribution are identical, the probability that a given observation is higher than all simulations is 1/(nSim +1).
#'
#' The probability of an outlier depends on the number of simulations (in fact, it is 1/(nSim +1) for each side), so whether the existence of outliers is a reason for concern depends also on the number of simulations. The expected number of outliers is therefore binomially distributed, and we can calculate a p-value from that
#' Based on this null expectation, we can test for an excess or lack of outliers. Per default, testOutliers() looks for both, so if you get a significant p-value, you have to check if you have to many or too few outliers. An excess of outliers is to be interpreted as too many values outside the simulation envelope. This could be caused by overdispersion, or by what we classically call outliers. A lack of outliers would be caused, for example, by underdispersion.
#'
#'
#' @author Florian Hartig
#' @seealso \code{\link{testResiduals}}, \code{\link{testUniformity}}, \code{\link{testDispersion}}, \code{\link{testZeroInflation}}, \code{\link{testGeneric}}, \code{\link{testTemporalAutocorrelation}}, \code{\link{testSpatialAutocorrelation}}, \code{\link{testQuantiles}}
#' @example inst/examples/testsHelp.R
#' @example inst/examples/testOutliersHelp.R
#' @export
testOutliers <- function(simulationOutput, alternative = c("two.sided", "greater", "less"), plot = T){

out = list()
out$data.name = deparse(substitute(simulationOutput))
testOutliers <- function(simulationOutput, alternative = c("two.sided", "greater", "less"), margin = c("both", "upper", "lower"), plot = T){

# check inputs
alternative = match.arg(alternative)
margin = match.arg(margin)
data.name = deparse(substitute(simulationOutput)) # remember: needs to be called before ensureDHARMa
simulationOutput = ensureDHARMa(simulationOutput, convert = "Model")

alternative <- match.arg(alternative)

out$alternative = alternative
out$method = "DHARMa outlier test based on exact binomial test"
# calculation of outliers
if(margin == "both") outliers = sum(simulationOutput$scaledResiduals %in% c(0, 1))
if(margin == "upper") outliers = sum(simulationOutput$scaledResiduals == 1)
if(margin == "lower") outliers = sum(simulationOutput$scaledResiduals == 0)

outLow = sum(simulationOutput$scaledResiduals == 0)
outHigh = sum(simulationOutput$scaledResiduals == 1)
# calculations of trials and H0
outFreqH0 = 1/(simulationOutput$nSim +1) * ifelse(margin == "both", 2, 1)
trials = simulationOutput$nObs
outFreqH0 = 1/(simulationOutput$nSim +1)

out$testlow = binom.test(outLow, trials, p = outFreqH0, alternative = "less")
out$testhigh = binom.test(outHigh, trials, p = outFreqH0, alternative = "greater")
out = binom.test(outliers, trials, p = outFreqH0, alternative = alternative)

out$statistic = c(outLow = outLow, outHigh = outHigh, nobs = trials, freqH0 = outFreqH0)
# overwrite information in binom.test

if(alternative == "greater") p = out$testlow$p.value
if(alternative == "less") p = out$testhigh$p.value
if(alternative == "two.sided") p = min(min(out$testlow$p.value, out$testhigh$p.value) * 2,1)
out$data.name = data.name
out$margin = margin
out$method = "DHARMa outlier test based on exact binomial test"

out$p.value = p
class(out) = "htest"
names(out$statistic) = paste("outliers at", margin, "margin(s)")
names(out$parameter) = "simulations"
names(out$estimate) = paste("frequency of outliers (expected:", out$null.value,")")

out$interval = "tst"

out$interval =

if(plot == T) {

hist(simulationOutput, main = "", max(round(simulationOutput$nSim / 5), 20))
hist(simulationOutput, main = "")

main = ifelse(out$p.value <= 0.05,
"Outlier test significant",
Expand Down Expand Up @@ -467,12 +474,12 @@ testTemporalAutocorrelation <- function(simulationOutput, time = NULL , alternat
#' @export
testSpatialAutocorrelation <- function(simulationOutput, x = NULL, y = NULL, distMat = NULL, alternative = c("two.sided", "greater", "less"), plot = T){

alternative <- match.arg(alternative)
data.name = deparse(substitute(simulationOutput)) # needs to be before ensureDHARMa
simulationOutput = ensureDHARMa(simulationOutput, convert = T)

if(any(duplicated(cbind(x,y)))) stop("testing for spatial autocorrelation requires unique x,y values - if you have several observations per location, either use the recalculateResiduals function to aggregate residuals per location, or extract the residuals from the fitted object, and plot / test each of them independently for spatially repeated subgroups (a typical scenario would repeated spatial observation, in which case one could plot / test each time step separately for temporal autocorrelation). Note that the latter must be done by hand, outside testSpatialAutocorrelation.")

alternative <- match.arg(alternative)

if( (!is.null(x) | !is.null(y)) & !is.null(distMat) ) message("both coordinates and distMat provided, calculations will be done based on the distance matrix, coordinates will only be used for plotting")
# if not provided, fill x and y with random numbers (Null model)
if(is.null(x)){
Expand All @@ -498,7 +505,7 @@ testSpatialAutocorrelation <- function(simulationOutput, x = NULL, y = NULL, di
out$method = "DHARMa Moran's I test for spatial autocorrelation"
out$alternative = "Spatial autocorrelation"
out$p.value = MI$p.value
# out$data.name = deparse(substitute(simulationOutput))
out$data.name = data.name

class(out) = "htest"

Expand Down
24 changes: 24 additions & 0 deletions DHARMa/inst/examples/testOutliersHelp.R
Original file line number Diff line number Diff line change
@@ -0,0 +1,24 @@
set.seed(123)

testData = createData(sampleSize = 200, overdispersion = 1, randomEffectVariance = 0)
fittedModel <- glm(observedResponse ~ Environment1 , family = "poisson", data = testData)
simulationOutput <- simulateResiduals(fittedModel = fittedModel)

# default outlier test (with plot)
testOutliers(simulationOutput)

# note that default is to test outliers at both margins for both an excess and a lack
# of outliers. Here we see that we mostly have an excess of outliers at the upper
# margin. You see that it is an exces because the frequency of outliers is 0.055,
# while expected is 0.008

# Let's see what would have happened if we would just have checked the lower margin
testOutliers(simulationOutput, margin = "lower", plot = FALSE)

# OK, now the frequency of outliers is 0, so we have too few, but this is n.s. against
# the expectation

# just for completeness, what would have happened if we would have checked both
# margins, but just for a lack of outliers (i.e. underdispersion)

testOutliers(simulationOutput, alternative = "less", plot = FALSE)
71 changes: 25 additions & 46 deletions DHARMa/man/testOutliers.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading

0 comments on commit c35deca

Please sign in to comment.