Deep audio fingerprinting for word search

Question

Simply speaking, I'm trying to somehow search an audio clip for a list of words, and if found, I mark the time stamps. My use-case is profanity check with a list of pre-defined profane words.

Is there any successfull approaches, samples, tools or APIs, possibly through deep learning, to perform this? I'm new to audio processing.

score 1 · Answer 1 · answered Jan 14 '20 at 16:39

1

You would need to perform some kind of speech-to-text to get the audio transcription with the corresponding synchronization wrt the audio. Then search in the transcription.

You could use DSAlign by mozilla

answered Jan 14 '20 at 16:39

shoegazerstella

21
1

Deep audio fingerprinting for word search

1 Answers1