What is the difference between $\vdash A $ and $\models A$?
I'm not asking about the general difference between syntactic entailment and semantic entailment.
I specifically don't understand the difference in the case where the antecedents are empty.
According to Wikipedia, $\vdash A $ is a theorem, whereas $\models A$ is a tautology. I think I understand the double turnstile being a tautology, but with the single turnstile is $A$ an axiom (which is supposedly a purely syntactic entity)?
I'm confused on how a formula could be derivable without premises.
There are in any system derivation rules which operate without premises. You can think of these as logical axioms: having a rule of the form "$\vdash A$ is a correct sequent" in our deduction system amounts to $A$ being a "starting sentence" which we're allowed for free.
However, $\vdash A$ may be a correct sequent without $\vdash A$ being one of our basic sequent rules. For example, one of the standard sequent rules in many systems is "$\vdash x=x$ is a correct sequent." From this we can get $\vdash (x=x)\vee (x=x)$ by applying further sequent rules, even though "$\vdash (x=x)\vee (x=x)$ is a valid sequent" is not explicitly one of our starting rules.
Note that in the above I'm talking about deriving sequents, not sentences or formulas. This is a useful shift: it's often best to think of the deductive apparatus of first-order logic as defining a set of correct sequents, expressions of the form "$\Gamma\vdash A$" for $\Gamma$ a set of formulas and $A$ a formula, via induction starting with some basic rules (e.g. "$\vdash x=x$ is correct" or "If $\Gamma\vdash A$ and $\Gamma\vdash B$ are correct, then $\Gamma\vdash A\wedge B$ is correct"). Talking about deducing one formula $A$ from a set of formulas $\Gamma$ is then equivalent to talking about deducing the correctness of the sequent $\Gamma\vdash A$.