I like pictures, and the relevant picture is:
![enter image description here](https://cdn.statically.io/img/i.sstatic.net/M6JI8Sqp.jpg)
You can ignore the formulae, they are just high school trig, tho, so nothing prohibitive. (The do look bad, though, I think it's because they are crammed in).
The only formula you need to grasp is the good ol Pythagorean Theorem:
$$ c^2 = a^2 + b^2 $$
which is invoked here as:
$$ (\rm Total\ Energy)^2 =
(\rm Rest\ Energy)^2 +
(\rm Momentum\ Energy)^2
$$
which is remarkably simple. Rest mass is the energy required to sit still and move forward in time. While energy from momentum is what it costs to move through space (always: in some frame).
The minimum energy is when you are motionless and just moving through time with the famous:
$$ E_0 = mc^2 $$
(I put the subscript on to avoid confusion, eventhough it is unsightly).
Meanwhile, the energy from motion is just:
$$ E_p = pc $$
and of course, it's all a photon ever has.
One thing that trips up our intuition (which is Newtonian), is that the concepts of momentum and kinetic energy were developed at low speed $v\ll c$, so we ignored the rest mass, and had this thing called kinetic energy (the purple part) which is just to the total energy minus the rest energy:
$$ T = E-mc^2 \rightarrow \frac{p^2}{2m} = \frac 1 2 mv^2 $$
where the RHS is in the low speed limit, and are the Newtonian formulae from Physics 1. Of course, when Newton did his thing, we were not splitting the atom and extracting kinetic energy from "mass" (not to be confused with "matter"), so the $mc^2$ was ignored, as it was effectively constant.
All we had to work with was kinetic energy, we did not know it was the difference between two relativistic energy concepts.
Kind of amazing the relativity has Newtonian physics as its low speed limit.
Also in the low speed limit we have:
$$ p = mv $$
which we intuitively associate with "inertia"--why being hit by a bus is worse than a smart car. Or a 400 pound lineman in a trot can level you as effectively as a 200 pound free safety at full speed (though the latter has more energy, and hurts more).
The problem is, we then associate momentum with mass in motion, but it's really energy in motion, and a photon is a pure chunk of energy, in maximal motion.
The irony is that if you look at momentum per total energy:
$$ \frac p E = \frac{\gamma mv}{\gamma mc^2} = v/c $$
you find that mass makes it smaller, and moving at the speed-of-light with no mass gives you the maximal momentum per unit energy.