This is a cleaned version of the replication data from Clark and Golder (2006).
The data are election level, so that each row of the data set represents one election.
# load packages
library(tidyverse)
# load data
cg <- read_rds("parties.rds") %>%
glimpse()
## Observations: 555
## Variables: 10
## $ country <chr> "Albania", "Albania", "Albania", "Argenti...
## $ year <dbl> 1992, 1996, 1997, 1946, 1951, 1954, 1958,...
## $ average_magnitude <dbl> 1.00, 1.00, 1.00, 10.53, 10.53, 4.56, 8.1...
## $ eneg <dbl> 1.106929, 1.106929, 1.106929, 1.342102, 1...
## $ enep <dbl> 2.190, 2.785, 2.870, 5.750, 1.970, 1.930,...
## $ upper_tier <dbl> 28.57, 17.86, 25.80, 0.00, 0.00, 0.00, 0....
## $ en_pres <dbl> 0.00, 0.00, 0.00, 2.09, 1.96, 1.96, 2.65,...
## $ proximity <dbl> 0.00, 0.00, 0.00, 1.00, 1.00, 0.20, 1.00,...
## $ social_heterogeneity <fct> Bottom 3rd of ENEG, Bottom 3rd of ENEG, B...
## $ electoral_system <fct> Single-Member District, Single-Member Dis...
country: Country Nameyear: Yearaverage_magnitude: Average District Magnitude# histogram
qplot(average_magnitude, data = cg)
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
enep: The Effective Number of Electoral Parties# histogram
qplot(enep, data = cg)
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
eneg: The Effective Number of Ethnic Groups# histogram
qplot(eneg, data = cg)
## `stat_bin()` using `bins = 30`. Pick better value with `binwidth`.
electoral_system: The Type of Electoral Systemaverage_magnitude."Single-Member Distict": when average_magnitude = 1."Small-Magnitude PR": when 1 < average_magnitude \(\leq\) 7."Large-Magnitude PR": when average_magnitude > 7.# bar plot
qplot(electoral_system, data = cg)
See Clark and Golder (2006) for the definitions of upper_tier, en_pres, and proximity.
social_heterogeneity: Terciles of ENEGeneg."Bottom 3rd of ENEG""Middle 3rd of ENEG""Top 3rd of ENEG"