Model comparison on a real REM dataset

This vignette walks through the canonical “candidate-specifications → AIC ranking” workflow used in REM literature (Juozaitienė & Wit 2024, JRSS-A 188(4)) — with amorem’s bundled real-world dataset and the one-call compare_models() helper.

1. Load a bundled REM dataset

The package ships four real-world REM datasets directly. We use the McFarland (2001) high-school classroom session: 691 directed interactions among 20 students and one instructor, recorded on 16 October 1996.

data(classroom_events)
data(classroom_actors)
nrow(classroom_events)
#> [1] 691
head(classroom_events, 3)
#>    time sender receiver interaction_type weight
#> 1 0.125     14       12           social      1
#> 2 0.250     12       14           social      1
#> 3 0.375     18       12         sanction      1
table(classroom_actors$role)
#> 
#> instructor   grade_11   grade_12 
#>          2         13          5

classroom_events follows the package’s (sender, receiver, time, ...) contract — the same contract every downstream helper expects.

2. Build candidate specifications

compare_models() accepts a named list of character vectors. Each entry is one candidate specification; the vector contents are the endogenous statistic names. Here we compare three minimal specifications:

specs <- list(
  count        = c("reciprocity_count",
                   "transitivity_count"),
  continuous   = c("reciprocity_time_recent",
                   "transitivity_time_recent"),
  interrupted  = c("reciprocity_time_recent_interrupted",
                   "transitivity_time_recent_interrupted"))

These are the paper’s definitions r^(1c)/t(1c) (count), r^(4ac)/t(7ac) (continuous timing), and r^(4ai)/t(7ai) (interrupted timing).

3. Compare by AIC

res <- compare_models(classroom_events, specs, seed = 11)
res
#>         model n_terms n_obs   log_lik      AIC delta_AIC
#> 1       count       2   691 -305.5233 615.0466    0.0000
#> 2  continuous       2   691 -421.1231 846.2462  231.1996
#> 3 interrupted       2   691 -439.6904 883.3809  268.3343

The helper draws one case-control sample (default n_controls = 1) shared across every specification, computes the union of all requested statistics with endogenous_features(), builds case-minus-control differences, and fits one no-intercept binomial GLM per spec. Returned rows are sorted by AIC; the winning spec has delta_AIC = 0.

On Classroom the count-based specification wins this stripped-down comparison — recall that this is a no-smooth, no-random-effect baseline; richer fits (thin-plate splines on the differences, sender/receiver random effects) typically reweight the ranking in favour of the temporal definitions, matching the empirical findings of the paper.

Multiple controls per case

For 1:1 matching the helper fits a no-intercept binomial GLM on case-minus-control differences. Set n_controls > 1 to switch to stratified conditional logistic regression via survival::coxph — the right tool when you want more controls per case for tighter inference:

compare_models(classroom_events, specs,
               n_controls = 3, seed = 11)
#>         model n_terms n_obs   log_lik      AIC delta_AIC
#> 1       count       2   691 -5248.242 11880.48    0.0000
#> 2  continuous       2   691 -5368.088 12120.18  239.6934
#> 3 interrupted       2   691 -5449.816 12283.63  403.1494

The n_obs column now reports the number of strata (one per case), and survival is in the package’s Suggests — required only when n_controls > 1. AIC values across specs remain comparable because every spec sees the same shared case-control sample.

4. Inspect coefficients of a chosen specification

compare_models() returns AIC summaries. To inspect coefficients for a single spec, build the case-control sample once and fit directly:

stat_set <- specs$interrupted
cc <- sample_non_events(classroom_events, n_controls = 1,
                        scope = "all", mode = "one", seed = 11)
cc_feat <- endogenous_features(cc, stats = stat_set)
for (st in stat_set) cc_feat[[st]][is.na(cc_feat[[st]])] <- 0

cases <- cc_feat[cc_feat$event == 1L, ]
ctrls <- cc_feat[cc_feat$event == 0L, ]
cases <- cases[order(cases$stratum), ]
ctrls <- ctrls[order(ctrls$stratum), ]

df <- data.frame(
  one      = rep(1, nrow(cases)),
  d_rec    = cases[[stat_set[1]]] - ctrls[[stat_set[1]]],
  d_trans  = cases[[stat_set[2]]] - ctrls[[stat_set[2]]])

fit <- glm(one ~ d_rec + d_trans - 1, family = "binomial", data = df)
summary(fit)$coefficients
#>           Estimate Std. Error   z value     Pr(>|z|)
#> d_rec   -0.1008825 0.01912790 -5.274104 1.334064e-07
#> d_trans -0.1319044 0.02943224 -4.481630 7.407511e-06

The same recipe scales to any subset of the statistics in the catalogue. Use ?endogenous_features to see the full list.

5. Cross-implementation guarantee

Every statistic the post-hoc engine computes is also generated by simulate_relational_events() using the same paper definitions. The package ships a parity test (test-sim-vs-posthoc-parity.R) that runs the simulator on every shared stat and verifies that re-running endogenous_features() on the resulting event log reproduces the simulator’s columns row-for-row. This means: if you want to test a model selection pipeline against ground-truth coefficients, you can simulate data with simulate_relational_events() using known effects and use compare_models() to confirm the ranking.

set.seed(2026)
sim <- simulate_relational_events(
  n_events = 600,
  senders   = LETTERS[1:8],
  receivers = LETTERS[1:8],
  baseline_rate = 1,
  allow_loops = FALSE,
  endogenous_stats   = c("reciprocity_count", "transitivity_count"),
  endogenous_effects = c(reciprocity_count = 0.4, transitivity_count = 0.0))

# Among these three specs, the "count" spec is the true generative
# process. compare_models() should rank it first.
res2 <- compare_models(sim, specs, seed = 7)
#> Warning: glm.fit: fitted probabilities numerically 0 or 1 occurred
res2
#>         model n_terms n_obs    log_lik      AIC delta_AIC
#> 1       count       2   600  -60.15295 124.3059    0.0000
#> 2  continuous       2   600 -378.42664 760.8533  636.5474
#> 3 interrupted       2   600 -407.21051 818.4210  694.1151