Code to reproduce the experiments of the paper Questioning the Survey Responses of Large Language Models.
We survey language models with the American Community Survey. The experiment results can be downloaded from here.
Use the following Jupyter notebooks:
- Main text: figures.ipynb
- Appendix: appendix.ipynb
- Prompt ablations: prompt-ablations/
- Survey ablations: survey-ablations/
The relevant files to reproduce the experiments are:
- script_fill_individual.py: obtain language models' responses to individual survey questions for language models from HugginFace's model hub.
- script_fill_openai.py: obtain GPT-3's responses to individual survey questions.
- script_fill_sequential.py: sample language models' responses to entire survey. questionnaires, where questions are presented sequentially while keeping previous answers in-context.
- discriminator_test.py: perform the discriminator test on the model generated data.