Setup

This repository accompanies our EMNLP 2020 paper “Are ‘Undocumented Workers’ the Same as ‘Illegal Aliens’? Disentangling Denotation and Connotation in Vector Spaces”. Paper. Recorded talk.

Setup

(Python 3.7 or higher required.)

git clone https://github.com/awebson/congressional_adversary
cd congressional_adversary
python3 -m venv congressional_env  # or your preferred virtual environment solution
source congressional_env/bin/activate
pip install -r requirements.txt
pip install -e . 
mkdir data

Then, download the training and evaluation data from this Google Drive link, extract with your favorite tar command, and copy them to the data directory you just made.

Note that this data is already fully preprocessed, so you don’t need to actually run any preprocessing script included in this repository. If you do, the raw corpus of Congressional Record is available from Gentzkow et al. (2019). The raw corpus of Partisan News is available from Kiesel et al. (2019).

Training

Run src/models/ideal_grounding.py for the CR bill and CR topic models, or src/models/proxy_grounding.py for the CR proxy and PN proxy models. (See paper Sections 3 and 4 for more details.) Each model source file contains a config dataclass where you can change the default parameters as well as the command line arguments. Hyperparameters with reproducible results are documented in paper Appendix A. Run tensorboard --logdir . to see the results of your experiments.

Why is this repository codenamed “Congressional Adversary”?

In typical lame academic humor, I thought it's funny that I implemented an adversarial neural net for Members of Congress, who are often adversarial, if not acrimonious, to each other.

We were also deciding between “Adversarial Congress” and “Congressional Adversary”. The former sounds like a political science book that refutes the deliberative theory of democracy, whereas the latter just sounds like someone’s evil archenemy. Since this was a paper submitted to Empirical Methods in Natural Language Processing, not the American Journal of Political Science, we went with “Congressional Adversary”.

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
src		src
.gitignore		.gitignore
license.txt		license.txt
readme.md		readme.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Setup

Training

Why is this repository codenamed “Congressional Adversary”?

About

Languages

License

awebson/congressional_adversary

Folders and files

Latest commit

History

Repository files navigation

Setup

Training

Why is this repository codenamed “Congressional Adversary”?

About

Resources

License

Stars

Watchers

Forks

Languages