The function fit fits two exponential models to incidence data, of the
form: \(log(y) = r * t + b\) 
 where 'y' is the incidence, 't' is time
(in days), 'r' is the growth rate, and 'b' is the origin. The function fit
will fit one model by default, but will fit two models on either side of a
splitting date (typically the peak of the epidemic) if the argument split
is provided. When groups are present, these are included in the model as main
effects and interactions with dates. The function fit_optim_split() can be
used to find the optimal 'splitting' date, defined as the one for which the
best average R2 of the two models is obtained. Plotting can be done using
plot, or added to an existing incidence plot by the piping-friendly
function add_incidence_fit().
An incidence object, generated by the function
incidence(). For the plotting function, an incidence_fit
object.
An optional time point identifying the separation between the two models. If NULL, a single model is fitted. If provided, two models would be fitted on the time periods on either side of the split.
The confidence interval to be used for predictions; defaults to 95%.
A logical indicating if warnings from fit should be
hidden; FALSE by default. Warnings typically indicate some zero incidence,
which are removed before performing the log-linear regression.
The size, in days, of the time window either side of the split.
A logical indicating whether a plot should be added to the
output (TRUE, default), showing the mean R2 for various splits.
If groups are present, should separate split dates be
determined for each group? Defaults to TRUE, in which separate split dates
and thus, separate models will be constructed for each group. When FALSE,
the split date will be determined from the pooled data and modelled with the
groups as main effects and interactions with date.
currently unused.
For fit(), a list with the class incidence_fit (for a
single model), or a list containing two incidence_fit objects (when
fitting two models). incidence_fit objects contain:
$model: the fitted linear model
$info: a list containing various information extracted from the model
(detailed further)
$origin: the date corresponding to day '0'
The $info item is a list containing:
r: the growth rate
r.conf: the confidence interval of 'r'
pred: a data.frame containing predictions of the model,
including the true dates (dates), their numeric version used in the
model (dates.x), the predicted value (fit), and the lower
(lwr) and upper (upr) bounds of the associated confidence
interval.
doubling: the predicted doubling time in days; exists only if 'r' is
positive
doubling.conf: the confidence interval of the doubling time
halving: the predicted halving time in days; exists only if 'r' is
negative
halving.conf: the confidence interval of the halving time
For fit_optim_split, a list containing:
df: a data.frame of dates that were used in the optimization
procedure, and the corresponding average R2 of the resulting models.
split: the optimal splitting date
fit: an incidence_fit_list object containing the fit for each split.
If the separate_split = TRUE, then the incidence_fit_list object will
contain these splits nested within each group. All of the incidence_fit
objects can be retrieved with get_fit().
plot: a plot showing the content of df (ggplot2 object)
the incidence() function to generate the 'incidence'
objects. The get_fit() function to flatten incidence_fit_list objects to
a list of incidence_fit objects.
if (require(outbreaks)) { withAutoprint({
 dat <- ebola_sim$linelist$date_of_onset
 ## EXAMPLE WITH A SINGLE MODEL
 ## compute weekly incidence
 i.7 <- incidence(dat, interval=7)
 plot(i.7)
 plot(i.7[1:20])
 ## fit a model on the first 20 weeks
 f <- fit(i.7[1:20])
 f
 names(f)
 head(get_info(f, "pred"))
 ## plot model alone (not recommended)
 plot(f)
 ## plot data and model (recommended)
 plot(i.7, fit = f)
 plot(i.7[1:25], fit = f)
## piping versions
if (require(magrittr)) { withAutoprint({
  plot(i.7) %>% add_incidence_fit(f)
  ## EXAMPLE WITH 2 PHASES
  ## specifying the peak manually
  f2 <- fit(i.7, split = as.Date("2014-10-15"))
  f2
  plot(i.7) %>% add_incidence_fit(f2)
  ## finding the best 'peak' date
  f3 <- fit_optim_split(i.7)
  f3
  plot(i.7) %>% add_incidence_fit(f3$fit)
})}
})}
#> > dat <- ebola_sim$linelist$date_of_onset
#> > i.7 <- incidence(dat, interval = 7)
#> > plot(i.7)
 #> > plot(i.7[1:20])
#> > plot(i.7[1:20])
 #> > f <- fit(i.7[1:20])
#> > f
#> <incidence_fit object>
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#> [1] 0.03175771
#> 
#>   $r.conf (confidence interval):
#>           2.5 %     97.5 %
#> [1,] 0.02596229 0.03755314
#> 
#>   $doubling (doubling time in days):
#> [1] 21.8261
#> 
#>   $doubling.conf (confidence interval):
#>         2.5 %   97.5 %
#> [1,] 18.45777 26.69823
#> 
#>   $pred: data.frame of incidence predictions (20 rows, 5 columns)
#> > names(f)
#> [1] "model"  "info"   "origin"
#> > head(get_info(f, "pred"))
#>        dates dates.x      fit      lwr       upr
#> 1 2014-04-10     3.5 2.528815 1.611099  3.969283
#> 2 2014-04-17    10.5 3.158367 2.082082  4.791013
#> 3 2014-04-24    17.5 3.944645 2.687383  5.790104
#> 4 2014-05-01    24.5 4.926668 3.463102  7.008763
#> 5 2014-05-08    31.5 6.153167 4.453513  8.501484
#> 6 2014-05-15    38.5 7.685004 5.711842 10.339797
#> > plot(f)
#> > f <- fit(i.7[1:20])
#> > f
#> <incidence_fit object>
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#> [1] 0.03175771
#> 
#>   $r.conf (confidence interval):
#>           2.5 %     97.5 %
#> [1,] 0.02596229 0.03755314
#> 
#>   $doubling (doubling time in days):
#> [1] 21.8261
#> 
#>   $doubling.conf (confidence interval):
#>         2.5 %   97.5 %
#> [1,] 18.45777 26.69823
#> 
#>   $pred: data.frame of incidence predictions (20 rows, 5 columns)
#> > names(f)
#> [1] "model"  "info"   "origin"
#> > head(get_info(f, "pred"))
#>        dates dates.x      fit      lwr       upr
#> 1 2014-04-10     3.5 2.528815 1.611099  3.969283
#> 2 2014-04-17    10.5 3.158367 2.082082  4.791013
#> 3 2014-04-24    17.5 3.944645 2.687383  5.790104
#> 4 2014-05-01    24.5 4.926668 3.463102  7.008763
#> 5 2014-05-08    31.5 6.153167 4.453513  8.501484
#> 6 2014-05-15    38.5 7.685004 5.711842 10.339797
#> > plot(f)
 #> > plot(i.7, fit = f)
#> > plot(i.7, fit = f)
 #> > plot(i.7[1:25], fit = f)
#> > plot(i.7[1:25], fit = f)
 #> > if (require(magrittr)) {
#> +     withAutoprint({
#> +         plot(i.7) %>% add_incidence_fit(f)
#> +         f2 <- fit(i.7, split = as.Date("2014-10-15"))
#> +         f2
#> +         plot(i.7) %>% add_incidence_fit(f2)
#> +         f3 <- fit_optim_split(i.7)
#> +         f3
#> +         plot(i.7) %>% add_incidence_fit(f3$fit)
#> +     })
#> + }
#> Loading required package: magrittr
#> > plot(i.7) %>% add_incidence_fit(f)
#> > if (require(magrittr)) {
#> +     withAutoprint({
#> +         plot(i.7) %>% add_incidence_fit(f)
#> +         f2 <- fit(i.7, split = as.Date("2014-10-15"))
#> +         f2
#> +         plot(i.7) %>% add_incidence_fit(f2)
#> +         f3 <- fit_optim_split(i.7)
#> +         f3
#> +         plot(i.7) %>% add_incidence_fit(f3$fit)
#> +     })
#> + }
#> Loading required package: magrittr
#> > plot(i.7) %>% add_incidence_fit(f)
 #> > f2 <- fit(i.7, split = as.Date("2014-10-15"))
#> > f2
#> <list of incidence_fit objects>
#> 
#> attr(x, 'locations'): list of vectors with the locations of each incidence_fit object
#> 
#> 'before'
#> 'after'
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#>      before       after 
#>  0.02741985 -0.01014465 
#> 
#>   $r.conf (confidence interval):
#>              2.5 %       97.5 %
#> before  0.02407933  0.030760379
#> after  -0.01127733 -0.009011981
#> 
#>   $doubling (doubling time in days):
#>   before 
#> 25.27902 
#> 
#>   $doubling.conf (confidence interval):
#>           2.5 %   97.5 %
#> before 22.53377 28.78598
#> 
#>   $halving (halving time in days):
#>    after 
#> 68.32636 
#> 
#>   $halving.conf (confidence interval):
#>          2.5 %   97.5 %
#> after 61.46379 76.91397
#> 
#>   $pred: data.frame of incidence predictions (56 rows, 6 columns)
#> > plot(i.7) %>% add_incidence_fit(f2)
#> > f2 <- fit(i.7, split = as.Date("2014-10-15"))
#> > f2
#> <list of incidence_fit objects>
#> 
#> attr(x, 'locations'): list of vectors with the locations of each incidence_fit object
#> 
#> 'before'
#> 'after'
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#>      before       after 
#>  0.02741985 -0.01014465 
#> 
#>   $r.conf (confidence interval):
#>              2.5 %       97.5 %
#> before  0.02407933  0.030760379
#> after  -0.01127733 -0.009011981
#> 
#>   $doubling (doubling time in days):
#>   before 
#> 25.27902 
#> 
#>   $doubling.conf (confidence interval):
#>           2.5 %   97.5 %
#> before 22.53377 28.78598
#> 
#>   $halving (halving time in days):
#>    after 
#> 68.32636 
#> 
#>   $halving.conf (confidence interval):
#>          2.5 %   97.5 %
#> after 61.46379 76.91397
#> 
#>   $pred: data.frame of incidence predictions (56 rows, 6 columns)
#> > plot(i.7) %>% add_incidence_fit(f2)
 #> > f3 <- fit_optim_split(i.7)
#> > f3
#> $df
#>         dates   mean.R2
#> 1  2014-08-04 0.7650406
#> 2  2014-08-11 0.8203351
#> 3  2014-08-18 0.8598316
#> 4  2014-08-25 0.8882682
#> 5  2014-09-01 0.9120857
#> 6  2014-09-08 0.9246023
#> 7  2014-09-15 0.9338797
#> 8  2014-09-22 0.9339813
#> 9  2014-09-29 0.9333246
#> 10 2014-10-06 0.9291131
#> 11 2014-10-13 0.9232523
#> 12 2014-10-20 0.9160439
#> 13 2014-10-27 0.9071665
#> 
#> $split
#> [1] "2014-09-22"
#> 
#> $fit
#> <list of incidence_fit objects>
#> 
#> attr(x, 'locations'): list of vectors with the locations of each incidence_fit object
#> 
#> 'before'
#> 'after'
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#>      before       after 
#>  0.02982209 -0.01016191 
#> 
#>   $r.conf (confidence interval):
#>              2.5 %       97.5 %
#> before  0.02608945  0.033554736
#> after  -0.01102526 -0.009298561
#> 
#>   $doubling (doubling time in days):
#>   before 
#> 23.24274 
#> 
#>   $doubling.conf (confidence interval):
#>           2.5 %  97.5 %
#> before 20.65721 26.5681
#> 
#>   $halving (halving time in days):
#>    after 
#> 68.21031 
#> 
#>   $halving.conf (confidence interval):
#>          2.5 %   97.5 %
#> after 62.86899 74.54349
#> 
#>   $pred: data.frame of incidence predictions (57 rows, 6 columns)
#> 
#> $plot
#> > f3 <- fit_optim_split(i.7)
#> > f3
#> $df
#>         dates   mean.R2
#> 1  2014-08-04 0.7650406
#> 2  2014-08-11 0.8203351
#> 3  2014-08-18 0.8598316
#> 4  2014-08-25 0.8882682
#> 5  2014-09-01 0.9120857
#> 6  2014-09-08 0.9246023
#> 7  2014-09-15 0.9338797
#> 8  2014-09-22 0.9339813
#> 9  2014-09-29 0.9333246
#> 10 2014-10-06 0.9291131
#> 11 2014-10-13 0.9232523
#> 12 2014-10-20 0.9160439
#> 13 2014-10-27 0.9071665
#> 
#> $split
#> [1] "2014-09-22"
#> 
#> $fit
#> <list of incidence_fit objects>
#> 
#> attr(x, 'locations'): list of vectors with the locations of each incidence_fit object
#> 
#> 'before'
#> 'after'
#> 
#> $model: regression of log-incidence over time
#> 
#> $info: list containing the following items:
#>   $r (daily growth rate):
#>      before       after 
#>  0.02982209 -0.01016191 
#> 
#>   $r.conf (confidence interval):
#>              2.5 %       97.5 %
#> before  0.02608945  0.033554736
#> after  -0.01102526 -0.009298561
#> 
#>   $doubling (doubling time in days):
#>   before 
#> 23.24274 
#> 
#>   $doubling.conf (confidence interval):
#>           2.5 %  97.5 %
#> before 20.65721 26.5681
#> 
#>   $halving (halving time in days):
#>    after 
#> 68.21031 
#> 
#>   $halving.conf (confidence interval):
#>          2.5 %   97.5 %
#> after 62.86899 74.54349
#> 
#>   $pred: data.frame of incidence predictions (57 rows, 6 columns)
#> 
#> $plot
 #> 
#> > plot(i.7) %>% add_incidence_fit(f3$fit)
#> 
#> > plot(i.7) %>% add_incidence_fit(f3$fit)
