Aug-21-2017, 12:05 PM
(This post was last modified: Aug-21-2017, 12:05 PM by AceScottie.)
Update:
I have been looking at the source and found ares which match what im looking for
( file __init__.py, class Recognizer, line 444)
is there a way to lower these values without editing the source values. i dont mind having 2-3 words per process i just dont want to process an entire paragraph per loop as it takes too long.
I have been looking at the source and found ares which match what im looking for
self.pause_threshold = 0.8 # seconds of non-speaking audio before a phrase is considered complete self.phrase_threshold = 0.3 # minimum seconds of speaking audio before we consider the speaking audio a phrase - values below this are ignored (for filtering out clicks and pops) self.non_speaking_duration = 0.5 # seconds of non-speaking audio to keep on both sides of the recordingso means you have to talk for at least 0.3 seconds and the length of the audio has to be at least 1.3 seconds (0.5 + 0.3 + 0.5)
( file __init__.py, class Recognizer, line 444)
is there a way to lower these values without editing the source values. i dont mind having 2-3 words per process i just dont want to process an entire paragraph per loop as it takes too long.