Punchscan is an invented by David Chaum. Punchscan is designed to offer integrity, privacy, and transparency. The system is voter-verifiable, provides an mechanism, and issues a to each voter. The system won grand prize at the 2007 .
The which Punchscan incorporates is open source; the was released on 2 November 2006 under a revised . However, Punchscan is software independent; it draws its security from functions instead of relying on like . For this reason, Punchscan can be run on , like , and still maintain unconditional integrity.
The Punchscan team, with additional contributors, has since developed Scantegrity.
A Punchscan has two layers of paper. On the top layer, the are listed with a or beside their name. Below the candidate list, there are a series of round holes in the top layer of the ballot. Inside the holes on the bottom layer, the corresponding symbols are printed.
To cast a vote for a candidate, the voter must locate the hole with the symbol corresponding to the symbol beside the candidate’s name. This hole is marked with a -style ink dauber, which is purposely larger than the hole. The voter then separates the ballot, chooses either the top or the bottom layer to keep as a , and the other layer. The receipt is at the polling station for .
The order of the symbols beside the candidate names is generated for each ballot, and thus differs from ballot to ballot. Likewise for the order of the symbols in the holes. For this reason, the receipt does not contain enough information to determine which candidate the vote was cast for. If the top layer is kept, the order of the symbols through the holes is unknown. If the bottom layer is kept, the order of the symbols beside the candidates name is unknown. Therefore, the voter cannot prove to someone else how they voted, which prevents or voter intimidation.
As an example, consider a two candidate election between and , as illustrated in the preceding diagram. The order of the letters beside the candidates’ names could be A and then B, or B and then A. We will call this ordering , and let =0 for the former ordering and =1 for the latter. Therefore,
: order of symbols beside candidate list,
Likewise we can generalize for other parts of a ballot:
: order of symbols through the holes,
: which hole is marked,
: result of the ballot,
Note that the order of the candidates’ names are fixed across all ballots. The result of a ballot can be calculated directly as,
- (Equation 1)
However, when one layer of the ballot is shredded, either or is destroyed. Therefore, there is insufficient information to calculate from the receipt (which is scanned). In order to calculate the election results, an electronic is used.
Before the election, the database is created with a series of columns as such. Each row in the database represents a ballot, and the order that the ballots are stored in the database is (using a that each candidate can ). The first column, , has the shuffled order of the serial numbers. contains a pseudorandom generated from the key, and it will act as a . will store an intermediate result. contains a bit such that:
The result of each ballot will be stored in a separate column, , where the order of the ballots will be reshuffled again. Thus contains the row number in the column where the result will be placed.
After the election is run and the values have been scanned in, is calculated as:
And the result is calculated as,
This is equivalent to equation 1,
The result column is published and given the ballots have been shuffled (twice), the order of the results column does not indicate which result is from which ballot number. Thus the election authority cannot trace votes to serial numbers.
For an election with candidates, the above procedure is followed using -n equations.
Basic auditing procedures
The voter’s ballot receipt does not indicate which candidate the voter cast their ballot for, and therefore it is not secret information. After an election, the election authority will post an image of each receipt online. The voter can look up her ballot by typing in the serial number and she can check that information held by the election authority matches her ballot. This way, the voter can be confident that her ballot was cast as intended.
Any voter or interested party can also inspect part of the database to ensure the results were calculated correctly. They cannot inspect the whole database, otherwise they could link votes to ballot serial numbers. However, half of the database can be safely inspected without breaking privacy. A random choice is made between opening or (this choice can be derived from the secret key or from a source, such as or the ). This procedure allows the voter to be confident that the set of all ballots were counted as cast.
If all ballots are counted as cast and cast as intended, then all ballots are counted as intended. Therefore, the integrity of the election can be proven to a very high probability.
To further increase the integrity of a Punchscan election, several further steps can be taken to protect against a completely corrupt election authority.
Since , , and in the database are all generated pseudorandomly, multiple databases can be created with different random values for these columns. Each database is independent of the others, allowing the first half of some of the databases to be opened and inspected and the second half of others. Each database must produce the same final tally. Thus if an election authority were to tamper with the database to skew the final tally, they would have to tamper with each of the databases. The probability of the tampering being uncovered in the audit increases exponentially with the number of independent databases. With even a modest number of databases, the integrity of the election is probabilistically certain.
Prior to an election, the election authority prints the ballots and creates the database(s). Part of this creation process involves to the unique information contained on each ballot and in the databases. This is accomplished by applying a cryptographic to the information. Though the result of this function, the commitment, is made public, the actual information being committed to remains sealed. Because the function is one-way, it is computationally infeasible to determine the information on the sealed ballot given only its publicly posted commitment.
Prior to an election, twice as many ballots are produced as the number intended to use in the election. Half of these ballots are selected randomly (or each candidate could choose a fraction of the ballots) and opened. The rows in the database corresponding to these selected ballots can be checked to ensure the calculations are correct and not tampered with. Since the election authority does not know a priori which ballots will be selected, passing this audit means the database is well formed with a very high probability. Furthermore, the ballots can be checked against their commitments to ensure with high probability that the ballot commitments are correct.