I am about to teach applied mathematics for engineering. So I will teach how to use Laplace Transform to solve differential equations. Some of these differential equations involve the Dirac delta "function" as a forcing term, for example: $$y''(t)+y'(t)-y(t)=3\delta(t-1).$$ The students have as prerequisite: Calculus II and Calculus III. I don't know how to introduce the Dirac delta function:
1) Should I first talk about the distributions as linear functionals acting on test functions. Then define Dirac delta function as $\langle\,\delta,\varphi\rangle=\varphi(0)$.
or
2) Define $\delta$ as "something" that has a Laplace Transform $\mathcal{L(\delta)}=1$, that is $\delta$ is a limit (in some sense) of a sequence of functions $f_n$ having a tall spike at the origin with $\lim_{n\to \infty}\mathcal{L(f_n)}=1$. That object can be viewed as a "function" defined by $$\delta(t)=\begin{cases} 0, & t\neq0\\ \infty, & t=0 \end{cases}$$

I'm not a professor nor teacher, but an engineering student, which might be interesting for you if I tell you how my mates and I dealt with the Dirac delta function.
Our professor had taught us just saying that the Dirac delta function was not a function and had showed us the properties that you mentioned above. However, I found myself kind of curious about it and wanted to know more since at the same semestre I was learning distributions.
Nevertheless, most of us did not understand quite well the Dirac delta, despite knowing its Laplace Transform (LT), some of its properties and also that the Dirac delta was related to the LT of the Heaviside step function. We just learnt by heart, which, for me, is not good. Probably, because he did not teach like your first option.
From a college student that had to deal with such a thing, I think that your first option is better, because it would show me what it really is and its purpose, which, in my opinion, makes easier to use the Dirac delta function.
Good luck!