Definition A: Let $(X,\mathcal{A},\mu)$ be a probability space. Let $T: X\rightarrow X$ is $\mu$-invariant ($\mu(T^{-1}E)=\mu(E)$ for all $E\in \mathcal A$). Then $T$ is ergodic if for every $E\in \mathcal{A}$ with $T^{-1}(E) = E$, we have $\mu(E)=0$ or $1$.
Definition B: Let $(X,\mathcal{A},\mu)$ be a probability space. Let $T: X\rightarrow X$ is $\mu$-invariant ($\mu(T^{-1}E)=\mu(E)$ for all $E\in \mathcal A$). Then $T$ is ergodic if for every $E\in \mathcal{A}$ with $T^{-1}(E) \subset E$, we have $\mu(E)=0$ or $1$.
The difference between two definitions is $T^{-1}(E) = E$ vs. $T^{-1}(E) \subset E$. Definition A comes from Definition 2.13 of the book by Einsiedler and Ward and Definition B is from the entry "Ergodicity" of Wikipedia. There are more things to check for the Definition B. So B is stronger than A.
Can we prove they are equivalent?
One can show that the first definition is equivalent to the following one:
This comes from the fact that for any such $A$ there is a set $B$ s.t. $T^{-1}(B)=B$ and $\mu(B\Delta A)=0$.
Now, in the second definition $E\supset T^{-1}(E)$, which implies that $\mu(E\Delta T^{-1}(E))=0$ because $\mu(E)=\mu(T^{-1}(E))$, and so it is equivalent to the first definition.