The question is for 1 $\leq$ p,q $\leq$ $\infty$
I can prove that HL Maximal function is weak (1,1) using the method of finite collection of bounded balls. I can also prove that it is strong (p,p) as it is a standard proof. However, what can be said about other cases?
Terence Tao's lecture notes make this comment without proof:
Dimensional analysis (analysing how $f$ and $Mf$ react under dilation of the domain $\mathbb R^d$ by a scaling parameter λ) shows that no weak or strong type (p, q) estimates are available in the off-diagonal case p $\neq$ q.
Could this be elaborated upon, perhaps in a mathematically rigorous way? Exactly how to do this dimensional analysis and how does it prove there are no estimates for off-diagonal case?
The definition is $$Mf(x)=\sup_{r>0} \frac{1}{|B(x,r)|}\int_{B(x,r)}|f(t)|dt$$ where $B(x,r)=\{y \in \Bbb R^n: ||x-y||<r\}$
Then dilate your $f$ by some $\delta>0$, $f^{\delta}(x):=f(\delta x)$
Now take $L^p$ norms, $||f^{\delta}||_p=(\int_{\Bbb R^n}|f^{\delta}(t)|^pdt)^{1/p}=(\int_{\Bbb R^n}|f(\delta t)|^pdt)^{1/p}$
Now a change of variable $\delta t=y$ gives, $||f^{\delta}||_p=(\int_{\Bbb R^n}|f(\delta t)|^pdt)^{1/p}=({\delta}^{-n}\int_{\Bbb R^n}|f(y)|^pdy)^{1/p}={\delta}^{-n/p}||f||_p \dots (*)$
Now $$Mf^{\delta}(x)=\sup_{r>0} \frac{1}{|B(x,r)|}\int_{B(x,r)}|f^{\delta}(t)|dt=\sup_{r>0} \frac{1}{|B(x,r)|}\int_{B(x,r)}|f(\delta t)|dt$$ $$=\sup_{r>0} \frac{{\delta}^{-n}}{|B(x,r)|}\int_{B(\delta x,\delta r)}|f(y)|dy \text{ ( same substitution as above)}$$ $$=\sup_{r>0} \frac{1}{|B(\delta x,\delta r)|}\int_{B(\delta x,\delta r)}|f(y)|dy$$ $$=\sup_{\lambda>0} \frac{1}{|B(\delta x,\lambda)|}\int_{B(\delta x,\lambda)}|f(y)|dy$$ $$=Mf(\delta x)=(Mf)^{\delta}(x)$$
Then $||Mf^{\delta}||_q=(\int_{\Bbb R^n} |Mf^{\delta}(x)|^q dx)^{1/q}= (\int_{\Bbb R^n} |Mf(\delta x)|^q dx)^{1/q}= {\delta}^{-n/q}||Mf||_q \le C \delta^{-n/p}||f||_p$
But then, ${\delta}^{-n(\frac{1}{q}-\frac{1}{p})}=1 \implies p=q$
Now for the weak off-diagonal case, it's a simple application of the Off diagonal version of Marcinkewicz interpolation combining with the fact that the Maximal function is not strong in the off-diagonal setup, which we just proved!