According to my findings, we can use Anova analysis to compare a set of mean values. ANOVA depends on 3 main assumptions; Normality, Homogeneity of variance, Independent observations.
According to central limit theorem, when the sample size is large, mean(x) has a normal distribution, even though the distribution of x is not normal.
My question is, can we use ANOVA analysis to compare means, even if the original distributions of each data set is not normal and size of each data set is greater than 1000?
Shifted-exponential data: Here is a demonstration for particular datasets, showing that ANOVA can have relatively poor power distinguishing among samples of size 1000 from slightly-shifted exponential distributions, all with population SD $\sigma = 1.$
I'm not saying an ANOVA never works on exponential data; I am saying that there is good reason for the normality assumption. (The distribution of the F-statistic under $H_0$ is not as expected unless data are normal.)
Sample means are slightly different:
ANOVA not significant:
Kruskal-Wallis detects different shifts:
The boxplots at the left below shows the three shifted-exponential samples, each of size 1000.
Shifted-normal data: By contrast both tests detect similar shifts in normal populations.
Boxplots at right below show the three normal samples.