Let $T:(X, B, \mu) \rightarrow (X, B, \mu)$ be a measure preserving system. For a finite partition $\zeta$ of $X$ we define
$\zeta^n:=\zeta \vee T^{-1}\;\zeta \; \vee ... \vee \; T^{-(n-1)}\zeta$
Then the measure theoretic entropy of $T$ with respect to $\zeta$ is defined as
$h(T, \zeta ):=\displaystyle{\lim_{n\to \infty}}\frac{1}{n}H(\zeta^n)$, where $H(\eta)$ denotes the entropy of the partition $\eta$.
Brin and Stuck prove that
$h(T, \zeta) = \displaystyle{\lim_{n\to\infty}} H(\zeta | T^{-1}\zeta^n)$, where $H(\eta | \epsilon)$ is the conditional entropy of $\eta$ with respect to $\epsilon$.
They remark that this proposition shows that "$h(T,\zeta)$ is the average information added by the present state on the condition that all past states are known."
I'm unsure of how $T^{-1}\zeta^n$ can be interpreted as the past in the limit. Can anyone provide an explanation?
I would erase the word "added", not really appropriate there.
You should think as follows (although this is hard to make rigorous, if at all possible):
To be precise, they should really have written something like: "is the limit of the average information at the present state given that a growing number of past states are known". This latter formulation avoids a reference to $\sigma$-algebras (although it is really unavoidable to refer to them with the formulation in the book).