This archive contains the shareable data and code used in the reanalysis of Duflo (2001), "Schooling and Labor Market Consequences of School Construction in Indonesia: Evidence from an Unusual Policy Experiment."
The main data file, from the 1995 Intercensal survey (SUPAS), is not contained in this repository. It can however be found here.
The reanalysis also uses the 2005 SUPAS and 2011-12 SUSENAS survey data sets. The 2005 SUPAS data at IPUMS is used for the first. The second was obtained through the Harvard library system.
The "Regency-level vars" files contain figures on population, school attendance, planned school construction, and water and sanitation spending. The Duflo (2001) versions of the variables, which have been used in many studies, are here copied from the public data archive of Ashraf et al. (2020). The new versions carry the suffix "new". Images of the government documents they were reconstructed from are in the "Printed sources" folder.
Regencies and municipality boundaries in Indonesia have changed over time, mostly through subdivision, occasionally through merger. This complicates linking regency-level data from the 1971 census and mid-1970s presidential directives to the follow-ups in 1995, 2005, 2010, and 2013-14. IPUMS helpfully provides shapefiles that modern database and GIS software can use to make the linkages. The concordances folder contains concordances linking the 1995 coding to the 2005 and 2010-14 codings. The 1970s data are manually coded with respect to 1995. Notes in "Baseline variable reconstruction.xlsx" in the "Regency-level vars" folder document complications in this coding, including a few cases where the original and new differ.
"Duflo 2001.do" is a Stata do file that generates all results.