
An Explanation of Estimating Vote Counts on
reddit
Adam J. O’Neil
July 26, 2018
The first rule of reddit: never acknowledge that you are a redditor to anyone in
real life. Ever.
Here’s a fun mathematical experiment for the day: on the social network reddit,
users can decide which stories gain traction by upvoting or downvoting posts. An
upvote counts as +1 vote, a downvote -1 vote, and no action at all indicates a net
change of 0. As recently as a few years ago, reddit would show the total upvotes
and downvotes, as well as a percentage indicating the ratio of these numbers to
the total of votes. They removed that a while ago, but they kept two important
numbers that we could use to calculate a close estimate of the upvote/downvote
totals, as well as the total number of votes overall.
These two numbers are the net votes and percentage of users that voted upvote,
which are displayed for every post. The net votes here would be the upvotes +
downvotes. The percentage of upvotes (percent liked) would therefore be the ratio
of upvotes divided by (upvotes + downvotes). We can use these two equations to set
up a system of equations to solve for each individual variable. In addition, I will
add a third equation that ties these two variables to the total vote count, which is
upvotes + downvotes = total votes.
Equations
First, let V
T
= total votes, V
N
= net votes, P = percent liked (as a probability, not
percent), U = number of users that upvoted, D = number of users that downvoted.
V
N
will be an integer, P will be a probability greater than 0.5 and less than or equal
to 1. We only know P and V
N
at the moment. Here is the foundation:
U − D = V
N
(1)
U
U + D
= P (2)
U + D = V
T
(3)
So, practically, we can solve for either U or D easily through substitution – I am
not going to bother with adding or subtracting equations. Let’s start by rewriting
1