I have a 30 minute movie from a collegue of mine who held a speech. This speech was notorious for it's "uuuuuuuhm"-moments. Now I want to string all of these together so I get one big "uuuhmmm... hmmmm... uhhhmm" movie (get your mind out of the gutter

). I want to do this by cutting and deleting all the other pieces, so I only have a lot of 1 to 2 second clips left on the timeline.
I know how to ripple-delete these, but how can I make it so that the audio and video has a short (3 to 5 frames) cross-fade? I'd hate to add these to 100 clips by hand... :(