Duncan Marsh in his book Applied Geometry for Computer Graphics and CAD defines the perspective projection in World Coordinates as: $$ M_{WC}=n^TV - ( n \cdot V )I_4 $$ where $V$ is the projection viewpoint and $n$ the projection viewplane $$ M_{WC} = \small\begin{bmatrix}-n_2v_2-n_3v_3-n_4v_4&n_1v_2&n_1v_3&n_1v_4\\n_2v_1&-n_1v_1-n_3v_3-n_4v_4&n_2v_3&n_2v_4\\n_3v_1&n_3v_2&-n_1v_1-n_2v_2-n_4v_4&n_3v_4\\n_4v_1&n_4v_2&n_4v_3&-n_1v_1-n_2v_2-n_3v_3\end{bmatrix} $$
Others authors define the same perspective projection in Camera Coordinates with $$ M_{CC} = \begin{bmatrix}1&0&0&0\\0&1&0&0\\0&0&1&f\\0&0&0&0\end{bmatrix} $$ where the projection viewpoint is origin and the projection viewplane is a plane parallel to the $x$-$y$ plane and intercept the $z$-axis in $-f$.
Both matrix a the same and $ M_{WC} = M_{CC}RT$ Where $R$ is a rotation that makes the view plane parallel to $x$-$y$ plane and $T$ is a translate the origin to the view point and $RT$ is the camera Extrinsic Matrix.
$$ T = \begin{bmatrix}1&0&0&0\\0&1&0&0\\0&0&1&0\\-v_1/v_4&-v_2/v_4&-v_3/v_4&1\end{bmatrix} $$
My problem is I can not figure $R$.
I’m still not quite sure what it is you’re trying to do, but I’ll point out some errors in your question that are likely preventing you from accomplishing it. I’ll follow the old computer-graphics convention used by Marsh: points are represented by row vectors and transformations are applied by right-multiplying by a matrix.
First, the definition of the camera-coordinate projection matrix $M_{CC}$ is garbled. There are two conventions in use for the direction of the camera axis and location of the viewplane, and you seem to be mixing them. The one I prefer because it doesn’t change orientation on the viewplane is to have the camera pointing in the negative $z$ direction so that the viewplane is $z=f$ with $f\lt0$. We want the point $(x,y,z)$ in Cartesian coordinates to be projected to $\left(\frac{fx}z,\frac{fy}z,f\right)$. In homogeneous coordinates, this is $x:y:z:1\mapsto x:y:z:\frac zf$ so $$M_{CC}=\begin{bmatrix}1&0&0&0\\0&1&0&0\\0&0&1&\frac1f\\0&0&0&0\end{bmatrix}.$$ We can check this against the formula Marsh gives for the projection matrix, which holds in any Cartesian coordinate system. In camera coordinates, $\mathbf V=(0,0,0,1)$ and $\mathscr n=(0,0,1,-f)$, hence $$\mathscr n^T\mathbf V-(\mathbf V\cdot\mathscr n)\,I_4=\begin{bmatrix}f&0&0&0\\0&f&0&0\\0&0&f&1\\0&0&0&0\end{bmatrix}$$ which, since a homogeneous transformation matrix can be multiplied by a non-zero scalar without affecting the transformation it represents, is clearly equivalent to $M_{CC}$ above.
A more serious error is that the equality $M_{WC} = M_{CC}R\,T$ doesn’t hold: $M_{CC}$ wants its input in camera coordinates, but $M_{WC}$ wants world coordinates, so multiplying the same coordinate tuple by the two sides of this equation doesn’t make sense. Let the transformation from world to camera coordinates be given by the product $TR$, per convention. A point expressed in world coordinates must first be converted to camera coordinates before applying $M_{CC}$, so the correct equality is $$M_{WC}=TRM_{CC}R^{-1}T^{-1}=(TR)M_{CC}(TR)^{-1},$$ that is, the two projection matrices are related by a change of basis, as one might expect.
Here a few other things that might help with your matrix manipulations: