If there is an online document with 100,000 completely unique words and every time you download it, 5% of the words are randomly deleted, how many times do you have to download it before you get all 100,000 words?
2026-02-24 13:49:03.1771940943
How many times do you have to download a 100,000 word document if a random 5% of the document is deleted every time you download it?
56 Views Asked by Bumbble Comm https://math.techqa.club/user/bumbble-comm/detail At
1
There are 1 best solutions below
Related Questions in PROBABILITY
- How to prove $\lim_{n \rightarrow\infty} e^{-n}\sum_{k=0}^{n}\frac{n^k}{k!} = \frac{1}{2}$?
- Is this a commonly known paradox?
- What's $P(A_1\cap A_2\cap A_3\cap A_4) $?
- Prove or disprove the following inequality
- Another application of the Central Limit Theorem
- Given is $2$ dimensional random variable $(X,Y)$ with table. Determine the correlation between $X$ and $Y$
- A random point $(a,b)$ is uniformly distributed in a unit square $K=[(u,v):0<u<1,0<v<1]$
- proving Kochen-Stone lemma...
- Solution Check. (Probability)
- Interpreting stationary distribution $P_{\infty}(X,V)$ of a random process
Related Questions in COUPON-COLLECTOR
- Coupon Collector when $n=3$
- Expected number of 5-sticker packs needed to complete a Panini soccer album
- Coupon Collector problem with unequal probabilities
- Convergence in the coupon collector problem
- Black and white balls probability
- Coupon Collector Problem with multiple copies and X amount of coupons already collected
- Coupon Collector Problem: Expectation for a Fraction of Unique Samples
- Show that $n\sum_{k=1}^n\frac{1}{k}\sim n\log{n},$ Collector's problem.
- How many sticker packets do I need to complete an album?
- Coupon collector expectation using definition
Trending Questions
- Induction on the number of equations
- How to convince a math teacher of this simple and obvious fact?
- Find $E[XY|Y+Z=1 ]$
- Refuting the Anti-Cantor Cranks
- What are imaginary numbers?
- Determine the adjoint of $\tilde Q(x)$ for $\tilde Q(x)u:=(Qu)(x)$ where $Q:U→L^2(Ω,ℝ^d$ is a Hilbert-Schmidt operator and $U$ is a Hilbert space
- Why does this innovative method of subtraction from a third grader always work?
- How do we know that the number $1$ is not equal to the number $-1$?
- What are the Implications of having VΩ as a model for a theory?
- Defining a Galois Field based on primitive element versus polynomial?
- Can't find the relationship between two columns of numbers. Please Help
- Is computer science a branch of mathematics?
- Is there a bijection of $\mathbb{R}^n$ with itself such that the forward map is connected but the inverse is not?
- Identification of a quadrilateral as a trapezoid, rectangle, or square
- Generator of inertia group in function field extension
Popular # Hahtags
second-order-logic
numerical-methods
puzzle
logic
probability
number-theory
winding-number
real-analysis
integration
calculus
complex-analysis
sequences-and-series
proof-writing
set-theory
functions
homotopy-theory
elementary-number-theory
ordinary-differential-equations
circles
derivatives
game-theory
definite-integrals
elementary-set-theory
limits
multivariable-calculus
geometry
algebraic-number-theory
proof-verification
partial-derivative
algebra-precalculus
Popular Questions
- What is the integral of 1/x?
- How many squares actually ARE in this picture? Is this a trick question with no right answer?
- Is a matrix multiplied with its transpose something special?
- What is the difference between independent and mutually exclusive events?
- Visually stunning math concepts which are easy to explain
- taylor series of $\ln(1+x)$?
- How to tell if a set of vectors spans a space?
- Calculus question taking derivative to find horizontal tangent line
- How to determine if a function is one-to-one?
- Determine if vectors are linearly independent
- What does it mean to have a determinant equal to zero?
- Is this Batman equation for real?
- How to find perpendicular vector to another vector?
- How to find mean and median from histogram
- How many sides does a circle have?
Take a look at the first word. There is a 5% it is deleted the first time. The probability it is deleted on both the first and second download is $0.05^2$. The probability it is deleted on all downloads after $N$ downloads of the document is $0.05^N$. That is never equal to 0 no matter how big $N$ is. Thus, you cannot ever be certain that even the first word of the document is downloaded, let alone all the words in the document.
There is a formula in the answer to this question.
I implemented it two different ways in Mathematica. The first way is literally from the formula as written. The second way is an attempt to make it more numerically stable. I assume $n$ is large and$m$ is a fraction (between 0 and 1) of $n$ so that $m=f \times n$. Then, I take the log of the terms with the binomial coefficients because those numbers are huge in this case. Do the calculations on the log-scale and then exponentiate back to get the correct term. I check that both give almost the same answer in the case of that question where $m$ and $n$ are relatively small ($n=100$ and $m=10$, which means $f=0.1$). The first function won't even run with large numbers like this problem, i.e. $n=100000$.
Next, I found that for your question P2[8,100000,0.95] is 0.999996
P2[5,100000,0.95] is 0.969233
P2[4,100000,0.95] is 0.535181
P2[3,100000,0.95] is 0.00000356
That is, you will almost never download the whole document after 3 download attempts, there is a 53.5% of success after 4 download attempts, 96.9% chance after 5 download attempts.
Intuition: on average, each download captures 95% of the the missing words. So, after two downloads, you have only about 250 missing words. After 3 downloads, about 12 missing words. The probability that you capture all those 12 words in the fourth download is $0.95^{12} \approx 0.54$. If you don't capture all of them, you will most likely only have 1 or 2 missing now and you are practically guaranteed to catch them in the next one or two downloads.