We define the space of linear and continuous operators from $\mathcal{S}(\mathbb{R}^d)$ to $\mathcal{S}'(\mathbb{R}^d)$ as $\mathcal{L}(\mathcal{S}(\mathbb{R}^d), \mathcal{S}'(\mathbb{R}^d))$.
The Schwartz kernel theorem says that there is an isomorphism between $\mathcal{L}(\mathcal{S}(\mathbb{R}^d), \mathcal{S}'(\mathbb{R}^d))$ and $\mathcal{S}'(\mathbb{R^d}\times \mathbb{R}^d)$, thanks to the relation that associated to a "kernel" $ K \in \mathcal{S}'(\mathbb{R^d}\times \mathbb{R}^d)$ the operator $\mathcal{K}$ defined by $$\langle \mathcal{K}\{\varphi_1\} , \varphi_2 \rangle = \langle \varphi_1\otimes \varphi_2 , K \rangle $$ for any $\varphi_1, \varphi_2 \in \mathcal{S}(\mathbb{R}^d)$.
It is often only implicit that the adequate topology on the dual is the weak*-topology. Is there a good reason to consider this topology for the dual?
Is the result false if one endows $\mathcal{S}'(\mathbb{R}^d)$ with the strong topology?
I try to give an answer, but I think the question is "wide". We can also reason in terms of the space of distributions, then, essentially, these results are valid for restriction on the space of tempered distributions. First, there is a result that demonstrates the equivalence of the following properties
If $\Omega \subseteq \mathbb{R}^n$ is open and if $u : \mathcal{D}(\Omega) \longrightarrow \mathbb{C}$ is a linear map, then the following properties are equivalent
$u: \mathcal{D}(\Omega) \longrightarrow \mathbb{C}$ is continuous
$u: \mathcal{D}(\Omega) \longrightarrow \mathbb{C}$ is limited
If $\lbrace \varphi_k\rbrace \subset \mathcal{D}(\Omega)$ converges to $\varphi \in \mathcal{D}(\Omega)$ respect the topology of the test functions $\mathcal{T}$, then $u(\varphi_k) \rightarrow u(\varphi)$.
The restriction of $u: \mathcal{D}(\Omega) \longrightarrow \mathbb{C}$ to $\mathcal{D}_K(\Omega)$ is continuous for all compact $K \subset \Omega$
For all compact set $K \subset \Omega$ there are $N\in \mathbb{N}$ and $C_K > 0$ such that $|u(\varphi)| \leq C_K \left \| D^\alpha \varphi \right \|_{K_{N}}$, where $\left \| D^\alpha \varphi \right \|_{K_{N}} := \sup_{x \in K} |D^\alpha \varphi| , \forall \varphi \in \mathcal{D}_K(\Omega)$
From this fact follows a good definition of distribution. The couple $(\mathcal{D}(\Omega),\mathcal{D}'(\Omega))$ is a dual couple in the sense that $u(\varphi)=0$ $\forall u \in \mathcal{D}'(\Omega)$ $\Longrightarrow$ $\varphi=0$, ($L^1_{loc}(\Omega) \subset \mathcal{D}'(\Omega))$ then $\int_{\Omega} |\varphi|^2 dx = \langle\overline{\varphi}, \varphi \rangle=0$ and generalizing this situation if $u$ is a distribution $\varphi$ a test function, use this notation $\langle \varphi, u \rangle$=$\langle u, \varphi \rangle$= $u(\varphi)$. Furthermore in $\mathcal{D}'(\Omega)$ there is weak*-toplogy $\sigma(\mathcal{D}'(\Omega), \mathcal{D}(\Omega))$ defined through the separable family of seminorm $\mathcal{F}:=\lbrace p_\varphi(u)=|u(\varphi)| : \varphi \in \mathcal{D}(\Omega) \rbrace$ and $\mathcal{D}'(\Omega)$ is locally convex spacea. The sequential convergence $\langle \varphi, u_k \rangle \rightarrow \langle \varphi , u \rangle$ $\forall \varphi \in \mathcal{D}(\Omega)$ is the weak* convergence, i.e. the convergence with respect to weak*-topology. These last facts are also valid in the space of temperate distributions, it is the same.
Now, if $\Omega_1, \Omega_2 \subseteq \mathbb{R}^n$ are open sets. If $T:\mathcal{D}(\Omega_1) \longrightarrow \mathcal{D}(\Omega_2)$ is linear map such that $T'u=u \circ T \in \mathcal{D}'(\Omega_1)$ $\forall u \in \mathcal{D}'(\Omega_{2})$, then $T':\mathcal{D}'(\Omega_2) \longrightarrow \mathcal{D}'(\Omega_1)$ is dual or transponse operator of $T$, and it is characterized by the following relation \begin{align*} \displaystyle \langle \varphi, T'u \rangle = \langle T \varphi, u \rangle \end{align*} it occurs that $T$ and $T'$ are respectively weak and weak* continuous, and the following result exists
The result in 6. is widely used to define new distributions by duality, also to define the transpose kernel of a Schwartz kernel, and from estimates with seminorm on $\mathcal{D}$ or $\mathcal{S}$ in your case. Essentially the Schwartz kernel theorem says that
it's just to have the sequentially continuous map definition $\mu : \mathcal{D}(Y) \rightarrow \mathcal{D}'(X)$ is necessary to have a convergence with respect to weak*-topology. I believe that all these are good regions not to consider other types of topologies, on the other hand the whole theory of distributions is based on particular topologies in the spaces of the test functions that allow an extension by duality and considering the weak* topology.