The golden ratio in statistics of literature

865 Views Asked by At

Let a book, for example, or a poem... It consists in words and letters and symbols like : ;,!... Let $W_b$=the number of words of the book. Let $L_b$=the number of letters of the book. The number $N_b$ is called : $\frac{L_b}{W_b}=N_b$. Let now a dictionary of scrabble. Let $W_d$=the number of words of the dictionary. Let $L_d$=the number of the letters of the dictionary. The number $N_d$ is called : $\frac{L_d}{W_d}=N_d$. I calculated $\frac{N_d}{N_b}=j$, it is always near to $j=1.618$. Is this result known in the literature ? Are there analytical proofs of this result ? The statistics are they the only way to prove this result ? A talk is also a sequence of words and we can calculate the $j$ of a talk, if it is far from $1.618$, there is a problem, perhaps a sickness. Can we consider this method as a good test of normality and good health ? Can we prove it ?

1

There are 1 best solutions below

0
On

If those observations are correct, they are the same as saying that average word length ($N_t$) does not vary so much between different texts of type $t$.

The 4-digit accuracy 1.618... is meaningless, but it is possible that $t=$ dictionaries is a category with higher average word length than $t=$ novels, which have higher average word length than $t=$ children's books, and lower wordlength than $t=$ organic chemistry handbooks.