I learned WSR (Wilcoxon signed rank test) several years ago, and today one of my friends suggest different test statistics on WSR. At first, I thought he was wrong; however, I did found some sources which were citing the positive rank sum as W test statistic. Such as these links suggests: http://courses.wcupa.edu/rbove/Berenson/CD-ROM%20Topics/topice-10_5.pdf http://www.stat.umn.edu/geyer/5601/notes/wilcox.pdf
According to my undergrads stat, we choose the min(W+,W-) as the test statistic. for example: http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Nonparametric/BS704_Nonparametric6.html
On wiki, it suggest that W = abs(W+ minus W-). https://en.wikipedia.org/wiki/Wilcoxon_signed-rank_test#Test_procedure
Why does WSR test has so many "alternative" test statistics? Is there many versions of WSR? When to use which test statistics? I am confused now. Please share any insights with me, many thanks.
#I would like to edit this question. I just tried to write functions to calculate z scores in r. using two equations below, I was able to get same absolute z-scores using Negative rank sum, Positive rank sum, or absolute rank sum:
these data were replicates from an online source https://www.youtube.com/watch?v=TqCg2tb4wJ0
n =non-zero ranking sample size
w+ = 75
w- = 16
|w| = 75-16=59
n = 13
for neg or pos
z <- function(w,n){ (w-(n*(n+1)/4))/(sqrt((n*(n+1)*(2*n+1)/24)))
}
for absolute
z1 <- function(w,n){ (w)/sqrt((n*(n+1)*(2*n+1))/6) }
z(75,13) [1] 2.061627
z(16,13) [1] -2.061627
z1(59,13) [1] 2.061627
But I'm still confused with it... Should it considered to be irresponsible for articles and papers to use positive rank sum as w test statistics? Usually people use critical values sheet of w to determine the result right? http://users.stat.ufl.edu/~winner/tables/wilcox_signrank.pdf
On the case above(the youtube link), if we use positive rank sum, the result would be completely opposite.
Please let me know if I was thinking in the right direction.
Example of 2-sample tests in R and in Minitab (software at hand at the moment):
R (simulate fake data):
Output
Minitab (input same data used in R):
Output
Notice that the "W-statistics" differ, but the P-values are 0.0084 in both software packages. The observation
113.73happens to occur in both samples. This causes minor difficulties with the distribution theory, handled slightly differently by the two programs.