Voter File

Building a Voter File: Improving the Data

Of course, once you have a file for a given state, the work is not over. Even keeping the file at its current state, let alone improving it, will be an ongoing process.

First of all, from the day it goes up, the data gets staler and staler. Ideally, every state in the union will be undergo the entire process frequently--especially for states with hot races in election years.

Aside from regular rejuvenation, however, a file can be improved upon through contact with reality. No matter how diligent the Secretary of State, certain problems on a file will slip through the cracks. People will move, die, or get convicted; phone numbers will change, or go bad; party registrations will be altered. All of these changes can be captured by a well-tuned field organization, and appended to the file, so that as election day gets closer the file can asymptotically approach perfection (this is a somewhat idealized picture; bear with me).

When volunteers go out and canvass neighborhoods or phone bank, they can verify if an address is attached to the right name or whether or not a phone number is good; they can also gather information that is simply unavailable from other sources, like a person's top issue priorities. All of this information is gathered, centralized, and scanned in, so that the state voter file is as up-to-the-minute as possible. In the past, this was done by hand (when it was done at all); now, the use of new technologies like computers, palm pilots and bar coding of responses has greatly increased the efficiency of doorknocking and other field techniques.

Overdetermined Roundup

Your weekly roundup of what's been going on at Overdetermined.net. Enjoy!

  • Dirty D has a follow-up to his post last week about Howard Dean, this time reacting to the appointment of Tim Kaine as DNC Chair

Building a Voter File Part 3: Using the Data (An Overview)

Once you've gone through this process, you should have a list with millions of entries, each containing personal and consumer information--ideally for every registered voter, and all non-registered adults.  So what can you do with it? Plenty.

Once it's compiled, the data has to be accessed.  Various people can be granted different levels of access--making the whole file available to any volunteer would raise serious privacy concerns, not to mention possibly giving access to rival campaigns or, god forbid, the other party.  For low-level volunteers, this access can be extremely limited, while higher-level operatives can be granted more generous permissions.  Broader access can be granted through a web interface like the DNC's Votebuilder, RNC's Voter Vault, or Catalist's Q-tool.  Using some relatively simple Boolean logic, you can create lists of all the people in a state, district or precinct who share certain characteristics--for example, you might want to find all registered black voters under the age of 40.  With a certain (ever-diminishing) amount of inaccuracy, this is a trivial list to pull.

As you can imagine, this is extremely useful.  You can use these tools to do everything from create walk lists for your volunteers to pull samples for polls or blanket a state with direct mail.  Which is why these files are considered so valuable, and why making them is big business--with big consequences.

Building a Voter File Part 2: Appending Overview

Cross-posted from Overdetermined.net. Find the latest entries in the series there!

Once the data is (yes, is, prescriptivists--I went there) in a standardized format, we move from the realm of "interesting" into "faintly creepy".  The information from Secretaries of State or state parties is generally pretty innocuous--name, address, maybe phone number or age.  The appended consumer data, on the other hand, is more unsettling.  There's nothing on there that would do real damage if anyone knew it--no credit card numbers, nothing that people could use to steal your identity--but it can be kind of strange to think who realizes that you own two dogs and a cat.

Most of this consumer data is gathered by for-profit companies, who then retail it to both the state parties and the for-profit companies that are creating these files (if you take a look at our resources page, InfoUSA is one such vendor).  They get their information anywhere they can--state licensing agencies (think it might be worthwhile for the McCain campaign to know who has a gun license?), magazine subscription lists, grocery store value card memberships...basically, if you have to fill out a form for it, somebody wants it, and will get it unless prevented by law. 

Moreover, based on this consumer information, it's possible to predict other characteristics (within limits, which I'll go into in a later post).  For example, the RNC might conduct a truly massive poll that measured all kinds of behavior--TV habits, income, type of location, and lots of other things besides.  Based on that poll, they might determine that there's a high correlation between a given cluster of characteristics and certain behaviors.  For instance: only a survey can tell you how much radio someone listens to.  But it's possible to know for everyone where they live, their age, and whether or not they own a boat.  If all males 54-65 who have boat licenses listen to Rush Limbaugh, it can be a good predictor. 

This use of consumer data is at least a partial definition of the oft-abused term "microtargeting" (this WaPo article, although overwrought, is a good introduction).  Rest assured I'll have more to say on the topic in the future; but this is the overview.  Stick around; tomorrow, I'll go into how this data gets used.

Overdetermined Roundup

A quick roundup of what's been going on at Overdetermined.net. Enjoy!

  • The latest installment of Building a Voter File discusses matching between lists without a persistent and unique ID. Hint: read the earlier entries first, it will make things much more comprehensible.

Building a Voter File: From the Raw Data Up

As I mentioned yesterday, this is the first entry in another series from Overdetermined.net. Later entries in this series can be found there. Enjoy!

Karl Rove got your mom's phone number the other day.  Not directly, but through a mutual acquaintance--probably a secretary of state.  Both parties--and plenty of other organizations besides--are engaged in creating massive stores of information on voters all across the country.  This is their story.

In order to run any sort of political campaign, it’s crucial to know something about your voters.  Who in a given neighborhood is registered to vote? Who isn’t, but should be? Who falls into what demographic group? Polling can tell you something about people—even a small group can contain useful information about an entire demographic, thanks to the magic of sampling—but to get bulk information about who lives where and what they’re like, you need a full-fledged voter file. 

In a very basic outline, the process goes something like this: data is collected, combined with other sources, and put into a useful format.  This data can be analyzed, improved, added to, and eventually used to run a successful campaign (Nate Silver has a useful primer).  Today, we’re going to focus on the very beginning of this process.

One symptom of America’s unupgraded beta of democracy is our insistence on federalism, even when it makes our lives difficult, and the way we collect voter data is no exception.  Every state’s voter file contains different information, arranged in different ways, with different formats for presenting the information.  Better than nothing, but not very useful if you want to run a national campaign, or even append information to everyone in the country (as opposed to a particular state).  Before any of the interesting stuff can begin, the information needs to be in one single format.

First of all, the file needs to contain a unique identifier for each person.  The states are required to assign these IDs—thanks to the Help America Vote Act, or HAVA—but their identifiers are only unique statewide, and sometimes not even that (for example, it might have to be combined with the county a voter is registered in).  Only an ID assigned by the organization creating the file can be guaranteed to be unique (and if the file doesn’t contain a unique identifier, it’s extremely difficult and unreliable to verify people’s identities without tracking them via unique ID, thanks to inconsistencies in reporting various types of data).  If your name is John on one source of information, Johnny on another, and Jhon on yet a third, it’s a challenge to ensure that all this really refers to the same person. 

In addition to a simple list of who lives in the state, the voter file will usually contain other useful information--address, previous vote history, registration information, date of birth.  Of course, not all of these are available from every state.  Not every state keeps track of party registration, for instance.  Some states have detailed vote history, others record only a few elections.  But at the end of this process, a raw info-dump has been turned into something that can be combined with information from any other state.  Tomorrow: appending!

Build a Voter File with FaceBook

Body: 

Why Build a Voter File?

[[http://en.wikipedia.org/wiki/Voter_file|Voter Files]] are used to identify potential supporters in your local area, sift out voters more likely to support your opponent, and maximize the effectiveness of Get Out The Vote (GOTV) and awareness raising campaigns.

Building and maintaining and an accurate voter file will increase the effectiveness off your campaign, save you time and energy, and help you identify and build support for future campaigns.

Problems with Voter Files and Campus Activism

College kids are a transitory bunch. We change addresses just about every year, we may be registered in our home state rather than the state in which we attend school, and frequently we don’t have landlines. It makes it hard for local campaigns, state party activists, and even our own campus activists, to efficiently rally us around a cause or turn us out during an election. It also means that the state party’s voter file for young voters is basically useless (that is, if you can even get your hands on it). Clearly, if we’re going to GOTV our peers on campus, we need a new method.

The Solution: FaceBook, Your Campus Registry, Some Elbowgrease

With a FaceBook account, access to your student registry, and some time, you can build your own voter file that will blow away anything the state party can give you. Here’s how you do it:

  1. Sign up for FaceBook and join your campus network.
  2. Perform an advanced search within your campus network to identify your fellow students and categorize them by their political persuasions (Very liberal, liberal, moderate, conservative, very conservative, apathetic).
  3. Copy their name and political viewpoints into Excel.
  4. Get a copy of your campus registry - online, if possible. This will contain the names, addresses, and phone numbers of all the students at your university.
  5. If you have an electronic version of the registry, dump this into your Excel document, making sure that the first and last names from the registry occupy the same columns as your data from FaceBook (ie all last names from both data sources in column A, all first names in column B) . If you don’t have an electronic version, you’ll have to enter the registry data by hand - a daunting task that will take a lot of man-hours. If this is the case, start by looking up your most hardcore supporters first (those who self identify as “Very Liberal,”) and work your way down the list to less politically intense students.
  6. Sort your data by last name (alphabetical).
  7. The name/address data from the registry should now match up with the name/political persuasion data from FaceBook. Merge the data as best you can. You can remove Republicans (cut and paste them into a new Excel doc. You never know when this data might come in handy).
  8. Prioritize the names. People who are very liberal are your “1’s” in political speak - your most hardcore supporters. Next are those who identify as liberals, followed by moderates, then those who are apathetic.
  9. You will most likely be left with a lot of names that have no political correlation. What can I say - not everyone is on the FaceBook. These are you “undetermineds.” Hopefully you will have removed a lot of Republicans from this list (this is why we searched for data on people of all political persuasions. Undetermined students are of a lower priority in your activities than those who self identify as Very Liberal or Liberal (or even Moderate), but they are another pool of potential supporters.

And there you go - an accurate, and easily updated campus voter file. Now it’s time to start reaching out to those potential supporters.

Syndicate content