Before the $\text{AR}(1)$ model, first look at a simpler example
$$y_t=\rho^t y_0+\epsilon_t$$
where $0<\rho<1$ and $\epsilon_t\overset{\text{i.i.d.}}{\sim} \text{Bernoulli} \left(\frac{1}{2} \right)$ i.e. $\epsilon_t$ has $50\%$ chance of being $1$ and $50\%$ chance of being $0$. Similarly, $y_0$ also follows the distribution $\text{Bernoulli} \left(\frac{1}{2}\right)$.For finite $t$, $y_t$ has 4 outcomes with 25% chance for each: $0$, $1$, $\rho^t$ and $\rho^t+1$ and
\begin{align} P(y_t=1,y_0=1) &= P(y_0=1)P(y_t=1|y_0=1) \\ &= \frac{1}{2} \times 0 \\ &= 0 \\ &\neq P(y_0=1)P(y_t=1) \\ &= \frac{1}{2}\times\frac{1}{4} \\ &= \frac{1}{8} \end{align}
Obviously, $y_t$ converges to $\epsilon_t$ almost surely (since $P(\underset{t\to\infty}{\lim}\rho^ty_0=0)=1$), which is independent of $y_0$, and $y_t-\epsilon_t=O_{a.s.}(\rho^t)$. It seems true to me that
$$\lim_{t\to\infty}[P(y_t=1,y_0=1)-P(y_0=1)P(y_t=1)]=\frac{1}{4}-\frac{1}{4}=0$$
and
$$\lim_{t\to\infty}[P(y_t=1|y_0=1)-P(y_t=1)]=0$$
Hence the sequence looks mixing (strong and uniform). I am not sure about the size of mixing and the rate of convergence i.e
$$P(y_t=1,y_0=1) - P(y_0=1)P(y_t=1) = \mathcal{O}(?)$$
Now, an $\text{AR}(1)$ model with Bernoulli errors takes the form below:
$$y_t=\rho y_{t-1}+\epsilon_t$$
where $y_0=0$ and $\epsilon_t\overset{\text{i.i.d.}}{\sim}\text{Bernoulli} \left(\frac{1}{2}\right)$. For this model, one can have
$$y_{t+n}=\rho^n y_{t}+\sum_{i=1}^{n}\rho^{n-i}\epsilon_{t+i}$$
Similar to the first example, this sequence also looks uniform/strong mixing, though I am not sure how to show it. Could anyone shed some light for me?
The sequence is ergodic, but not strong mixing at leasts for $\rho \in (0, 1/2]$.
Ergodicity
To see it's ergodic, note that any strongly stationary causal solution of the $AR(1)$ equation $$x_t = \rho x_{t-1} + \epsilon_t$$ where $\epsilon_t \sim \mathrm{IID}(0, \sigma^2)$ is ergodic. To see this, we note that $$x_t = \sum_{k=0}^\infty \rho^k \epsilon_{t-k} = f(\epsilon_t, \epsilon_{t-1, \ldots})$$ is the measurable image of an $L^2$ IID (hence ergodic) sequence; since ergodicity is preserved under measurable maps (cf. Billingsley Theorem 36.4), $x_t$ is ergodic.
In your example, to get to your desired form, write your errors as $\gamma_t = \frac{1}{2} + \epsilon_t$ where $\epsilon_t \sim \mathrm{Unif}(\{\pm 1/2\})$, set $\mu = \frac{1}{2(1-\rho)}$, and let $x_t = y_t - \mu$. Then, you get that $x_t$ satisfies the $AR(1)$ dynamics $x_t = \rho x_{t-1} + \gamma_t$, implying it (and thus $y$) is ergodic.
Mixing
That $y_t$ is not strongly mixing for $\rho \in (0, 1/2]$ is precisely the content of this paper. They conjecture that the restriction on $\rho$ may not be necessary, but I don't know what happens for $\rho > 1/2$.
References
Billingsley. 1995. Probability and Measure.
Andrews, Donald WK. "Non-strong mixing autoregressive processes." Journal of Applied Probability 21.4 (1984): 930-934.