Data

My research draws on a range of original and archival sources compiled across fieldwork, statistical repositories, and libraries in the United States, Zambia, and Zimbabwe. All datasets will be made publicly available following publication and in accordance with IRB and informant agreements.

1. Census Classification Dataset (1955–2024)

This dataset compiles racial, ethnic, and tribal classification schemes from over 235 census forms. It includes original and translated versions of classification labels, the structure of identity questions, and year-by-year coding from 1955 to 2024 across 57 African countries and territories.
Source repositories: IPUMS International.

2. Oral Histories and Interview Materials

This dataset includes anonymized transcripts, selected audio excerpts, and field notes from 44 oral history interviews conducted with statisticians, bureaucrats, and communities in Zambia and Zimbabwe. Also included are interview guides and contextual photographs, made available according to the permissions granted by informants and IRB guidelines.

3. Historical Census Reports and Statistical Tables (1881–2024)

A collection of full-text census reports, summary tables, and classification figures covering the African continent. The materials include scanned documents from paper archives as well as digital records obtained from national statistical offices and libraries.
Source repositories: Princeton University Library, Library of Congress, Zambia Statistics Agency, Zimbabwe Statistics Agency, National Archives of Zambia, National Archives of Zimbabwe, East View, various online sources.