I currently am tasked with building a multiple regression model with two predictor variables to consider. That means there are potentially three terms in the model, Predictor A (PA), Predictor B (PB) and PA*PB.
In one instance, I made a LS model containing all three terms, and did simple t-tests. I divided the parameter estimates by their standard errors to calculate t-statistics, and determined that only the intercept and PA*PB coefficients were significantly different from zero.
In another instance, I did stepwise regression by first creating a model with only PA, and then fit a model to PA and PB, and did an F-test based on the Sum of Squares between the two models. The F-test concluded that PB was a significant predictor to include in the model, and when I repeated the procedure, the PA*PB coefficient was found to reduce SSE significantly as well.
So in summary, the t-test approach tells me that only the cross-product term PA*PB has a significant regression coefficient when all terms are included in the model, but the stepwise approach tells me to include all terms in the model.
Based on these conflicting results, what course of action would you recommend?
Removing variables just because they don't have marginal significance is bad. If you want to use a significance-based approach, the stepwise method is much better. The glaring problem with just whacking variables a bunch of variables at once cause they're individually insignificant is that they may well be jointly significant. The stepwise approach at least doesn't have this problem.
There's usually no good reason to use a significance based approach. If your goal is prediction, the best thing to do is to test each model for out of sample performance (according to some assorted metrics) and see which one does the best. There are also information-criteria (cp, aic, etc) that are supposed to evaluate out-of-sample performance based on in-sample performance and a model complexity penalty, but again, why use these if you have enough data to test out of sample performance directly? (As with most one-size-fits-all advice, this is a bit strong. These things and even stepwise regression have their place and can be good solutions sometimes. I'm just saying what I think is usually best in a generic situation, if there is such a thing.)