I need expert help on the math behind the following voting mechanism, any comment towards solutions are greatly appreciated!
--
A country is holding a poll to determine the top 100 restaurants out of its 100 thousand domestic restaurants.
Assuming each restaurant is unique, no chains or franchises, all of them share equal and fair chance of exposure to all customers nationwide.
In the end there were 1 million valid voters, with each voter named 5 restaurants. Say each voter entered 5 non-repeat and valid entries.
Is there a way to estimate: 1. the minimum votes a restaurant needs to be in top 100; 2. how many votes does #1 get?
--
I'm not a math expert so my educated guess is each restaurant has a probability of 0.1% chance to be voted but that's as far as I could go, I have absolutely 0 idea how to proceed next…
Look forward to your help, thanks!
--
[UPDATE 07/17/2019]
Thank you guys for all the insightful inputs! I’m confident that we’re on the right track to finding out the best answer and we’re close.
If we are to put our solutions to test in a real world scenario however, I think it’s highly unlikely that a mere 2 digit votes will render a lucky restaurant Top 100. I assume we can all agree on this?
So now the question is: how can we apply common sense to the equation? How do we identify the distribution model in real world?
The Gaussian distribution seems to be an adequate one to describe the world we live in, where heavenly restaurants and unholy ones are extremely rare, and passable/half-decent ones constitute the majority.
Let's add more conditions to define the scenario:
- Each restaurant has a Twitter account
- The voting result is positively correlated with Twitter follower counts
- The #1 restaurant has 10,000,000 followers whereas #100 has 100K
Does this make more sense?
It is incredibly unlikely but possible to have a massive tie: each restaurant gets $10$ votes. So, you could be the joint winner with just $10$ votes. If you want to be the unique winner then you need just one voter to switch. You will then have a clear winner with $11$ votes, lots will have $10$, and one poor unique loser will have just $9$.
You cannot be a winner, joint or not, with less than $10$ as if you have less than $10$ then at least one other will have more than $10$.
The other extreme, and again incredibly unlikely, is that one restaurant gets all of the votes and the others none.
Very, very unlikely but possible. If you want to exclude those cases then it gets more tricky. First, you would have to define what you accepted.
Update: I missed that each voter had $5$ votes. This makes a difference to the numbers but not my general point. Thanks to Henning for spotting this. The text above applies to the $1$ vote each variant.
The massive tie could still occur but each restaurant will have $50$ votes.
Similarly, some of the other numbers need to be adjusted.