I am reading Do Carmo's differential geometry book and the definition of a regular surface in the second chapter is given to be this: 
I have few doubts about this definition:
1) Why we need to find a neighbourhood of point $p$? Is it because we can't always define a map $X$ that will work for the whole surface and we are trying to find local maps for every point. And later in chapter 3, when author talks about this parametrization/map for a surface, he says surface parametrized at point $p$, what does "at point $p$" means, are we talking about local parametrizations?
2) $X$ is differentiable to infinite order....is this really necessary? What if map $X$ is differentiable to a large but finite order?
3) X is continuous by condition 1. But I don't understand the use of continuity as described in the book. In book it is given that we need condition 2 for one to oneness of the map so that we can have single tangent plane at each point (basically to avoid self-intersections). So why don't just make condition 2 to be $X$ being one to one function?
4) I couldn't understand why we need condition 3 very clearly. Could you please give a geometric intuition for this?
1) Because you usually can't map the whole surface to a planar set. Like the sphere, for example: every world map has to cut it somewhere.
2) Not really necessary. Much of the theory works for $C^k$ instead of $C^\infty$ as long as $k$ is large enough. But some constructions become more difficult and statements cumbersome, because one has to keep track of how many derivatives you took so far, and how many you can still take. So there's a nonzero cost for unclear benefit (especially at the textbook level).
3) Continuous and one-to-one is not the same as being a homeomorphism. (There are some situations when they are, but that is something one has to think through.) Stating the definition as it is allows us to say: locally, as far as topology is concerned, the surface is exactly like a plane. That's a nice thing to have. All local topological properties of the plane are immediately available to us, because they are preserved by homeomorphisms.
4) This is easier to illustrate with an example of a curve. The curve $y= x^{2/3} $ is visibly non-smooth - it has a cusp at $(0,0)$. But it admits a parametrization $(x,y)=(t^3,t^2)$ which is infinitely differentiable. There seems to be a disconnect between the geometric roughness of the curve and the smoothness of its parametrization. Imposing regularity ensures that we don't have this problem; by some form of the implicit function theorem, a surface with a smooth regular parametrization is indeed a smooth-looking geometric object.
A related surface example: $z=x^{2/3}$ with parametrization $(u,v)\mapsto (u^3,v,u^2)$. The differential map has rank $1$ at some points, and this is what allows the cusp to form.
Ultimately, the proof of the pudding is in the eating. Complicated as it is, this definition succeeds at matching a formal mathematical concept to an informal geometric notion of smooth surface. This is something you get convinced of when you actually use it.