Skip to content

Module Overview⚓︎

Reference

Jensenius, Francesca R., and Gilles Verniers. "Studying Indian politics with large-scale data: Indian election data 1961–today." Studies in Indian Politics 5, no. 2 (2017): 269-275.

Prakash, Nishith, Marc Rockmore, and Yogesh Uppal. "Do criminally accused politicians affect economic outcomes? Evidence from India." Journal of Development Economics 141 (2019): 102370.

Asher, Sam, Tobias Lunt, Ryu Matsuura, and Paul Novosad. "Development research at high geographic resolution: an analysis of night-lights, firms, and poverty in India using the shrug open data platform." The World Bank Economic Review 35, no. 4 (2021): 845-871.
@article{jv2017,
  title = {Studying Indian politics with large-scale data: Indian election data 1961--today},
  author = {Jensenius, Francesca R and Verniers, Gilles},
  journal = {Studies in Indian Politics},
  year = {2017}
}

@article{pru2019,
  title={Do criminally accused politicians affect economic outcomes? Evidence from India},
  author={Prakash, Nishith and Rockmore, Marc and Uppal, Yogesh},
  journal={Journal of Development Economics},
  year={2019}
}

@article{almn2021,
  title={Development research at high geographic resolution: an analysis of night-lights, firms, and poverty in India using the shrug open data platform},
  author={Asher, Sam and Lunt, Tobias and Matsuura, Ryu and Novosad, Paul},
  journal={The World Bank Economic Review},
  volume={35},
  number={4},
  year={2021},
  publisher={Oxford University Press}
}
About This module includes electoral data from the Trivedi Center for Political Data, which cover turnout, vote totals, and other info on each candidate and party for elections ranging from 1974–2022. It also includes data from candidate affidavits describing assets, liabilities, and open criminal charges, collected by ADR and Prakash et al. (2017) (see below).
Geographic Coverage National
Aggregations Assembly Constituency 2007, Assembly Constituency 2008
Producer Trivedi Center for Political Data (TCPD)
Source URL https://lokdhaba.ashoka.edu.in/
Notes
  • Source for vote results: Trivedi Center for Political Analysis at Ashoka University. Users of these data should cite Jensenius and Verniers (2017), see cite below.
  • Source for candidate assets/crime: Prakash et al. (2019), see cite below.
  • These link to other SHRUG data using standard SHRUG constituency identifiers (ac07_id and ac08_id), which consistently identify constituencies. Trivedi identifiers (tr_ac_id) are consistent with the ECI's internal notation, but are inconsistent over time. ac07_id identifies pre-2007 delimitation constituencies, ac08_id post-delimitation.
  • The variable sh_election_id uniquely identifies an election and takes the form ss-nn-aaa[-p], where ss is the state code, nn is the assembly number (i.e. 1st assembly, 2nd assembly, etc.) aaa is the SHRUG constituency identifier, and p is the optional poll number, used only for bye-elections. The variable sh_cand_id uniquely identifies a candidate in the dataset; it takes the value of the candidate vote rank appended to the sh_election_id. These fields link the ADR asset/criminality and the ECI vote result data. Some candidate ids in ADR are currently missing, but can be matched on names relatively easily.
  • When constituency names are non-unique within states, the SHRUG names (ac07_name and ac08_name) have been prefixed with the either the 3-digit ECI code (3rd delimitation) or the first four letters of the district name (4th delimitation). For example, ac08_name is rajk-jetpur and vado-jetpur for the two different jetpur constituencies in Gujarat. The field tr_ac_name contains the name as described in the ECI data.
  • Data on candidate characteristics are from affidavits filed with the Electoral Commission of India. These were hand-entered from scanned PDFs by the Association for Democratic Reform (ADR) and re-entered by Prakash et al. (2019) for winners and runners-up from 2004 to 2007. The re-entered data corresponds very closely with the original data on presence of any criminal charges (less than 5% error), but Prakash et al. (2019) code a much higher number of criminal accusations against most candidates.
  • Where both data sources are available, we use the data from the Prakash et al. (2019) re-entry where possible.
  • If you link additional candidates between ADR and ECI, please let us know so we can improve the breadth of these links. If you update these data with new ADR data, please let us know.

Release Details⚓︎

Release Number 2.1
Release Name pakora
Last updated October 30, 2024

See anything amiss? Let us know!