Abstract

Data saturation is the most commonly employed concept for estimating sample sizes in qualitative research. Over the past 20 years, scholars using both empirical research and mathematical/statistical models have made significant contributions to the question: How many qualitative interviews are enough? This body of work has advanced the evidence base for sample size estimation in qualitative inquiry during the design phase of a study, prior to data collection, but it does not provide qualitative researchers with a simple and reliable way to determine the adequacy of sample sizes during and/or after data collection. Using the principle of saturation as a foundation, we describe and validate a simple-to-apply method for assessing and reporting on saturation in the context of inductive thematic analyses. Following a review of the empirical research on data saturation and sample size estimation in qualitative research, we propose an alternative way to evaluate saturation that overcomes the shortcomings and challenges associated with existing methods identified in our review. Our approach includes three primary elements in its calculation and assessment: Base Size, Run Length, and New Information Threshold. We additionally propose a more flexible approach to reporting saturation. To validate our method, we use a bootstrapping technique on three existing thematically coded qualitative datasets generated from in-depth interviews. Results from this analysis indicate the method we propose to assess and report on saturation is feasible and congruent with findings from earlier studies.

Keywords

Qualitative researchComputer scienceSample size determinationData collectionSaturation (graph theory)Data scienceSample (material)Context (archaeology)Qualitative propertyData miningManagement scienceStatisticsMachine learningMathematicsEngineeringSociology

MeSH Terms

Data CollectionHumansQualitative ResearchResearch DesignSample Size

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
article
Volume
15
Issue
5
Pages
e0232076-e0232076
Citations
2120
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2120
OpenAlex
29
Influential
1803
CrossRef

Cite This

Greg Guest, Emily Namey, Mario Chen (2020). A simple method to assess and report thematic saturation in qualitative research. PLoS ONE , 15 (5) , e0232076-e0232076. https://doi.org/10.1371/journal.pone.0232076

Identifiers

DOI
10.1371/journal.pone.0232076
PMID
32369511
PMCID
PMC7200005

Data Quality

Data completeness: 86%