In a paper I was reading an ordered pair had a slightly different definition $\langle a,b \rangle = \{a,\{a,b\}\}$ instead the normal Kuratowski definition which is that $\langle a,b \rangle = \{\{a\},\{a,b\}\}$.
notice in the first one the $a$ is without braces.
now I know that it is wrong , but why is it wrong?
is this definition not as good as the real one?
related (a question that used the same different definition)
How about sitting to write a proof and see where you get stuck? I am going to assume $\sf ZF$ as my set theory here, and the axiom of foundation is going to play a role, too.
Suppose that $\{a,\{a,b\}\}=\{c,\{c,d\}\}$. Note that $\in$ is a linear order on this set ($a\in\{a,b\}$ and $c\in\{c,d\}$), so $a=c$ since it is the minimal element. From this follows that $\{a,b\}=\{c,d\}$, and since $a=c$ it follows that $(\{a,b\}\setminus\{a\})=(\{c,d\}\setminus\{c\})$, namely $\{b\}=\{d\}$. Therefore $b=d$.
So we see that $a=c$ and $b=d$. And therefore this is a correct definition for ordered pairs, assuming the axiom of foundation (which was used to conclude that $\{a,b\}\notin\{a,b\}$ or $\{a,b\}\notin a$ and so on).
So why Kuratowski's definition prevails?
Tradition. As time goes by this becomes more and more engraved into the basics of set theory. I always find it important to explain to my students that we can find other definitions as well, but this is the canonical one that we use.
Because we have one, and frankly it doesn't matter that much. It is rarely the case that the choice of encoding ordered pairs into sets matters. You can think about most proofs as proof schemata that say "plug the definition of ordered pair here; plug the definition of function there; plug the definition of the real numbers here ... and combine these inference "templates" to prove the theorem".
I'm not saying that there aren't situations where you care about the definition of ordered pairs, there are such times, but those are rare, and when you get there you usually understand very well the situation, the fact that you can essentially replace the definition by another, and why the Kuratowski definition fails.
There are probably other reasons, but these two seem like plenty.