Abstract
The multilevel hidden Markov model (MHMM) is a promising method to investigate intense longitudinal data obtained within the social and behavioral sciences. The MHMM quantifies information on the latent dynamics of behavior over time. In addition, heterogeneity between individuals is accommodated with the inclusion of individual-specific random effects, facilitating the study of individual differences in dynamics. However, the performance of the MHMM has not been sufficiently explored. We performed an extensive simulation to assess the effect of the number of dependent variables (1–8), number of individuals (5–90), and number of observations per individual (100–1600) on the estimation performance of a Bayesian MHMM with categorical data including various levels of state distinctiveness and separation. We found that using multivariate data generally alleviates the sample size needed and improves the stability of the results. Moreover, including variables only consisting of random noise was generally not detrimental to model performance. Regarding the estimation of group-level parameters, the number of individuals and observations largely compensate for each other. However, only the former drives the estimation of between-individual variability. We conclude with guidelines on the sample size necessary based on the level of state distinctiveness and separation and study objectives of the researcher.
| Original language | English |
|---|---|
| Pages (from-to) | 17-45 |
| Number of pages | 29 |
| Journal | Multivariate Behavioral Research |
| Volume | 59 |
| Issue number | 1 |
| Early online date | 17 May 2023 |
| DOIs | |
| Publication status | Published - 2024 |
Bibliographical note
Publisher Copyright:© 2023 The Author(s). Published with license by Taylor & Francis Group, LLC.
Funding
Funding: This work made use of the Dutch national e-infrastructure Snellius with the support of the SURF Cooperative using grant no. EINF-2570, which is (partly) financed by the Dutch Research Council (NWO).
| Funders | Funder number |
|---|---|
| SURF | EINF-2570 |
| Nederlandse Organisatie voor Wetenschappelijk Onderzoek |
Keywords
- Bayesian statistics
- Hidden Markov models
- Monte Carlo studies
- categorical data
- individual random effects
- intensive longitudinal data
- multilevel modeling