This is a replica to the awesome data.fivethirtyeight.com website. Read more here

FiveThirtyEightFiveThirtyEight replica

/pollster-ratings

pollster-ratings

FiveThirtyEight's Pollster Ratings

This directory contains the data behind FiveThirtyEight's pollster ratings.

See also:

Past data:

pollster-stats-full.xlsx contains a spreadsheet with all of the summary data and calculations involved in determining the pollster ratings as well as descriptions for each column.

pollster-ratings.csv has ratings and calculations for each pollster. A copy of this data and descriptions for each column can also be found in pollster-stats-full.xlsx.

raw-polls.csv contains all of the polls analyzed to give each pollster a grade. Descriptions for each column are in the table below.

HeaderDefinition
pollnoFiveThirtyEight poll ID number
raceElection polled
yearYear of election (not year of poll)
locationLocation (state or Congressional district, or "US" for national polls)
type_simpleType of election (5 categories)
type_detailDetailed type of election (this distinguishes between Republican and Democratic primaries, for example, whereas type_simple does not)
pollsterPollster name
methodologyMethodology used to conduct this poll. One or more of the following values: <ul><li>Live Phone — Live telephone interviews, may or may not include calls to cell phones</li><li>IVR — Interactive voice response, otherwise known as automated polls or "robopolls"</li><li>Mail— By U.S. mail or other “snail mail” service</li><li>Online— Poll conducted by Internet; generally this mean by web browser, or application-based polling of mobile phones</li><li>Text — Poll conducted by text message</li><li>Face-to-face — Poll conducted in person</ul> Mixed method polls list the methods involved in the poll separated by slashes.
partisanFlag for internal/partisan poll. "D" indicates Democratic poll, "R" indicates Republican poll, "I" indicates poll put out by independent candidate's campaign. Note that different sources define these categories differently and our categorization will often reflect the original source's definition. In other words, these definitions may be inconsistent and should be used carefully.
polldateMedian field date of the poll
samplesizeSample size of the poll. Where missing, this is estimated from the poll's margin of error, or similar polls conducted by the same polling firm. A sample size of 600 is used if no better estimate is available.
cand1_nameName of Candidate #1. Candidates #1 and #2 are defined as the top two finishers in the election (regardless of whether or not they were the top two candidates in the poll). In races where a Democrat and a Republican were the top two finishers, Candidate #1 is the Democrat.
cand1_partyName of Candidate #1. Candidates #1 and #2 are defined as the top two finishers in the election (regardless of whether or not they were the top two candidates in the poll). In races where a Democrat and a Republican were the top two finishers, Candidate #1 is the Democrat.
cand1_pctCandidate #1's share of the vote in the poll.
cand2_nameName of Candidate #2. Candidates #1 and #2 are defined as the top two finishers in the election (regardless of whether or not they were the top two candidates in the poll). In races where a Democrat and a Republican were the top two finishers, Candidate #2 is the Republican.
cand2_partyParty of Candidate #2.
cand2_pctCandidate #2's share of the vote in the poll.
cand3_pctShare of the vote for the top candidate listed in the poll, other than Candidate #1 and Candidate #2.
margin_pollProjected margin of victory (defeat) for Candidate #1. This is calculated as cand1_pct - cand2_pct. In races between a Democrat and a Republican, positive values indicate a Democratic lead; negative values a Repubican lead.
electiondateDate of election
cand1_actualActual share of vote for Candidate #1
cand2_actualActual share of vote for Candidate #2
margin_actualActual margin in the election. This is calculated as cand1_actual - cand2_actual. In races between a Democrat and a Republican, positive values indicate a Democratic win; negative values a Republican win.
errorAbsolute value of the difference between the actual and polled result. This is calculated as abs(margin_poll - margin_actual)
biasStatistical bias of the poll. This is calculated only for races in which the top two finishers were a Democrat and a Republican. It is calculated as margin_poll - margin_actual. Positive values indicate a Democratic bias (the Democrat did better in the poll than the election). Negative values indicate a Republican bias.
rightcallFlag to indicate whether the pollster called the outcome correctly, i.e. whether the candidate they had listed in 1st place won the election. A 1 indicates a correct call and a 0 an incorrect call; 0.5 indicates that the pollster had two or more candidates tied for the lead and one of the tied candidates won.
commentAdditional information, such as alternate names for the poll.

Files

NameDownload