What about ditching the text altogether, then in between clips have someone telling us about the next step and techniques before cutting to the clip?
So in effect you have a presenter to the video who can illustrate things clearly and carry it through. Plus you wouldn't have to worry about speelings.
Just a thought but might be a bit tricky as I don't know your resources.
