While reading some old paper on the foundations of set theory, I came across a symbol $\mid$ that I eventually determined was the Sheffer stroke, which is a fancy word for NAND.
Wikipedia, and also the paper, has only this to say about working with the Sheffer stroke:
All [well-formed formulas] of the form $$((U \mid (V \mid W)) \mid ((Y \mid (Y \mid Y)) \mid ((X \mid V) \mid ((U \mid X) \mid (U \mid X)))))$$ are axioms. Instances of $$(U \mid (V \mid W)),\, U \vdash W$$ are inference rules.
This is a remarkable achievement in concise formulation: it leaves out everything that I understand about axiomatic proofs. My question could be summarized as "what the heck?" but for the sake of people trying to answer it, I'll try to be a little bit more precise.
How do you actually work with the thing above? What do the parts of a proof in language I understand become when you use the Sheffer stroke instead? If the only axiom is the disaster above, how do you ever conclude anything that makes sense?
I'm guessing that an example of a proof of something, anything using this formal system, with an explanation, would answer my question and address my confusion.
It turns out this is just one Sheffer stroke axiom. There's an old paper by Lukasiewicz, which got entitled Generalizing Deduction if I recall correctly, where he finds that Nicod's axiom, which could get translated to DDpDqrDDtDttDDsqDDpsDps in Polish notation has a substitution instance which is also a single axiom: DDpDqrDDsDssDDsqDDpsDps. Wajsberg found another single axiom in 1931 DDpDqrDDDsrDDpqDDpsDpDpq, and Lukasiewicz in 1931 DDpDqrDDpDrpDDsqDDpsDps. Over 60 more 23-letter single axioms in the Sheffer stroke got found by Ernst, Fitelson, and Harris. There also exist single axioms for calculi with more than one connective, and a theorem which ensures the existence of single axioms for non-classical propositional calculi which have certain theorems which hold.
The key observation consists in that Nicod's axiom has the same form as all of the assumptions in the argument "if DpDqr, and p, then r", where the second 'p' does not differ at all from the 'p' in 'DpDqr'. Or, if translated to infix, Nicod's axiom has form '(U|(V|W))', and has form 'U' also simultaneously.
More generally, all single axioms which operate under a single rule of inference work similarly, since if they didn't, the rule of inference couldn't get used to infer anything, since every formula before using a rule of inference initially has to get obtained from a substitution of the axiom.
So, the first step for a formal proof involving Nicod's axiom involves making two substitution instances of that axiom, one of the form 'DpDqr', the other of the form 'p', where 'p' has the same form as in 'DpDqr', and then infer 'r'. The easiest way to do this might be to use condensed (Sheffer stroke) detachment, where the axiom sort of suggests a substitution all by itself. Informally, that's a way of doing "as little" substitution as possible in the formulas to infer something, and thus what gets inferred is at least as general as anything else derivable in one detachment. To do condensed detachment, first make sure that the formulas have no variables in common. For Nicod's axiom:
DDpDqrDDtDttDDsqDDpsDps
DDaDbcDDdDddDDebDDaeDae
will do. The following diagram, which recasts 1. and 2. with some spacing, I think helps:
Now, and this case is fairly simple for condensed detachment, substitute p with DaDbc, q with DdDdd, and r with DDebDDaeDae and we get:
So, we have the required forms to use the rule of inference, since 3. has the form 'DxDyz', with 2. as the 'x', having the same for as in 3.
Or in other words, since the first part of 3. matches that of 2. we can now infer:
I'm not clear about your second question.
If this is taken to mean something that we might find more comprehensible to understand in natural language, we might note that just by applying a definition DpDpp turns into Cpp ("(p$\rightarrow$p)"). Uniform substitution in the axiom, and/or proven (object-language) theorems, using the rule of inference can prove formulas like that.
The following is a first-order proof via William McCune's Prover9 which has clues as to how to construct a propositional calculus proof of DxDxx: