Given a subset $A \subset \mathbb{R}$ with the length of an open interval $\mu_L(I_k) = b_k -a_k : I \doteq [a_k,b_k]$
The lebesgue measure is defined as $$ \lambda^{\ast} (A) \doteq \inf \Big\{ \sum_{j \geq 1} \mu_L(I_j) : A \subset \bigcup_{j \geq 1} I_j \Big\} $$
Why is it not defined simply as $$ \lambda^{\ast} (A) \doteq \sum_{j \in J} \mu_L(I_j) : J \rightarrow A = \bigcup_{j \in J} I_j $$
Why is it necessary to have the infimum and why is it necessary for it to be a set? I'm still a little unclear as to why we need to take the infimum
The intuition for Lebesgue measure is geometric. Given a reasonable set $X\subset\mathbb{R}^n$, how would you approximate its volume? One way is to approximate its volume from the outside as a sum of volumes of cubes. This is the concept of outer measure.
Say in $\mathbb{R}^2$, to get the area of a disc one might first approximate its area by a square containing the disc. But this is clearly not a good approximation. So let's try chopping this square into 16 squares, 4 rows of 4 squares each, and look at just the squares that intersect the disc. The sum of the areas of these squares (which is well defined, by the way, since we know how to get area of squares) is a better approximation to the area of the circle than our first approximation. But even this is still not a great approximation, so we just keep chopping our rectangles finer and finer. Each approximation is smaller than the last, and is a better approximation to the area of the disc. This explains why we would take the infimum.
In $\mathbb{R}^1$, the intuition is the same, except the sets we are trying to measure are one-dimensional and the "rectangles" become "intervals." We know how to get lengths of intervals, so to get "lengths" of measurable sets we approximate them from the outside as sums of lengths of intervals whose union covers the set.