So I know you look at the timeline and see where the audio spikes peak. And that's where you are speaking in a clip.
However, is it possible to find myself speaking when in a loud noisy environment? Where the audio timeline has a lot of spikes in it and isn't a flat line (compared to a quiet enviroment)?
Here's an example screenshot of the type of loud environment I'm in:
Look at where the line is on the timeline. That's the start of an interaction with a girl until about 14:40. As you can see there's spaced blue gaps. However, I'm unsure on whether all of the interactions have these blue spaces in between them and I really don't want to miss any out.
As a fill in I'm basically picking up girls on the street in a loud environment recording constantly for hours non stop and don't remember the bits where I spoke and didn't speak. It would take FOREVER to just play the whole clip and find out where I start speaking. This speaking occurs 15+ times throughout the whole hour clips and takes up about one sixth of the total recording time all together.
I will around a hundred of hours of footage with hundreds of interactions.
Is there a way I can make it clearer to find where my interactions start?