Consider a probability space $(\Omega,\mathcal{F},\mathbb{P})$ along with a random variable $X:\Omega\rightarrow\mathbb{R}$ such that $(\mathbb{R},\mathcal{B},\mu)$ is an induced probability space where $\mathcal{B}$ is to denote the Borel $\sigma$-algebra and $\mu$ is a pushforward measure such that $\mu=\mathbb{P}\circ X^{-1}$. Hence, $\mathbb{P}(X\in B)=\mu(B)$ for all $B\in\mathcal{B}$.
It then follows that $\mu$ - as a Lebesgue-Stieltjes measure $\mu(a,b]=F(b)-F(a)$ - is linked to a probability distribution function $F$ such that $F$ is monotonically increasing, right-continuous, and bounded to take values in $\,[0,1]$ with $F(-\infty)=0$ and $F(\infty)=1$.
I do not really feel comfortable with this result, but I was told that
$$\text{d}\mu=\mu(x,x+\text{d}x]=F(x+\text{d}x)-F(x)=\text{d}F.$$
Therefore, one can define the Lebesgue-Stieltjes integral of an arbitrary function $g$ as
$$\int_Bg\,\text{d}F=\int_{B}g\,\text{d}\mu\qquad\forall\quad B\in\mathcal{B}$$
where the righthandside is just the conventional Lebesgue integral w.r.t $\mu$.
Assuming $\mu\ll\lambda$ where $\lambda$ is the 1-dim Lebesgue measure, $F$ is differentiable such that $\text{d}F=F^\prime\,\text{d}x$. It thus follows that $$\int_Bg\,F^\prime\,\text{d}x=\int_{B}g\,\text{d}\mu\qquad\forall\quad B\in\mathcal{B}.$$ Moreover, since $\mu\ll\lambda$ the Radon-Nikodym theorem dictates that there exists a density function $f$ such that $$\int_{B}g\,\text{d}\mu=\int_{B}g\,f\,\text{d}\lambda$$
And finally, since $\lambda$ is a Lebesgue-Stieltjes measure with $F$ such that $x\mapsto x$ such that $\text{d}\lambda=\text{d}x$
$$\int_{B}g\,F^\prime\,\text{d}x=\int_{B}g\,f\,\text{d}\lambda=\int_Bg\,f\,\text{d}x$$ Since this statement holds true for an arbitrary function $g$ as well as for every set $B\in\mathcal{B}$ it would seem to follow that $F^\prime=f$ such that the pdf of a continuous random variable $X$ is simply the derivative of the corresponding cdf.
I would like to be able to transfer this line of reasoning to a bivariate case such that where a vector-valued measurable function $\boldsymbol{X}:\Omega\rightarrow\mathbb{R}^2$ gives a random variable that allows for the construction of an induced probability space $(\mathbb{R}^2,\mathcal{B}_2,\mu)$. At this point $\mathcal{B}_2=\mathcal{B}\otimes\mathcal{B}$ is the product $\sigma$-algebra of $\mathcal{B}$ and $\mu$ is a pushforward measure $\mu=\mathbb{P}\circ \boldsymbol{X}^{-1}$.
Starting from
$\mu(a,b]\times(c,d]=F(b,d)-F(a,d)-F(b,c)+F(a,c)$
(which is just my starting point since $\mathbb{P}(a<X\leq b,c<Y\leq d)=F(b,d)-F(a,d)-F(b,c)+F(a,c)$
and not because I would have read anywhere that the 2-dim Lebegue-Stieltjes measure is defined this way)
I would like to see why $$\text{d}\mu=\text{d}F$$
such that I can arrive at a statement to the effect that
$$\int_{B}g\,\dfrac{\partial^2 F}{\partial x\partial y}\,\text{d}x\,\text{dy}=\int_Bg\,f\,\text{d}x\,\text{d}y.$$
Thank you so much for taking the time to read my question. I would appreciate any pointers you might have for me - especially on why $\text{d}\mu=\text{d}F$ should be true in the bivariate case.
Best regards,
Jon
It seems to me that you are going in circles.
Let $\mu$ denote some probability measure on $(\mathbb R^2,\mathcal B_2)$.
Then $\mu$ induces a function $F:\mathbb R^2\to\mathbb R$ that is prescribed by: $$(b,d)\mapsto P((-\infty,b]\times(-\infty,d])$$
This function on its turn induces a (unique) measure $\mu_F$ that is characterized by: $$\mu_F((a,b]\times(c,d])=F(b,d)-F(a,d)-F(b,c)+F(a,c)$$ Observe that we could also write this as:$$\mu_F((a,b]\times(c,d])=\mu((a,b]\times(c,d])$$
So actually now we have two measures $\mu$ and $\mu_F$ that coincide on the collection: $$\mathcal A=\{(a,b]\times(c,d]\mid -\infty<a<b<\infty, -\infty<c<d<\infty\}$$
This collection is closed under intersection so $\mu$ and $\mu_F$ will also coincide on the sigma-algebra generated by $\mathcal A$ which is $\mathcal B_2$.
In a way this closes the circle: $\mu=\mu_F$.
The expression $\text{d}\mu=\text{d}F$ is just a notation for $\mu=\mu_F$.