Questions tagged [voice]
The voice tag has no summary.
113 questions
0 votes
0 answers
18 views
Question Isolating Speech From Clattering Noise - Similar Volume, Specific Issue
I have absolutely no experience with this field but have been trying to analyze some audio files where someone broke into my house (long explanation follows but can be skipped; more included because ...
0 votes
1 answer
42 views
How does the first step of vocoding actually compress the vocals?
I only recently learned of the use of the vocoder as a vocal signal compression technique. However, I'm having trouble understanding how it actually reduces the size of whatever goes in. If I think of ...
1 vote
0 answers
19 views
Recovering audio contained repeated data
I have some audio files affected by an issue during recording, apparently chunks of data were duplicated, like this. Doing a search I realised that I already asked about this, but I have got a very ...
0 votes
0 answers
109 views
How to detect known audio samples within a stream of audio
If I've got a bunch of short .wav files, 16 bit linear pcm, and an incoming stream of audio that's been run through a voice codec (g.722, g.711, or others) and then converted back to pcm, what's the ...
2 votes
2 answers
506 views
Anonymize / Obfuscate speech when doing audio classification
Let me preface that I am new to audio processing and audio analysis ;) (I asked the same question on reddit, I wanted to increase it's reach)) I am trying to classify specific events (like a gong or ...
1 vote
1 answer
823 views
How to generate human voice fft?
I want to generate human voice using fft. For this, I analyzed a fft of my mother saying "e" and the result was that the points are in a normal distribution. Then I created a fft using a ...
0 votes
1 answer
1k views
Voice activity detection (VAD) libraries 2023
I am trying to use (not implement VAD algorithm) voice activity detection to get timestamps for a given audio but facing hard time doing so. What I am trying to achieve ? Find an offline library for ...
4 votes
1 answer
9k views
As of 2023, is it possible to extract two human voice from single audio track?
Isolation of different human voices from audio Separate two voices from a speech signal Several years ago it was hard to extract voice from music and almost impossible to separate two human voices ...
3 votes
0 answers
172 views
How to extract human voice from cluttered signal?
I have a signal which has human speech, background voice and noise as it can be seen in below figure. I have calculated its power spectral density (PSD) using many different methods which can be seen ...
0 votes
1 answer
228 views
What's the difference between male and female voice? [duplicate]
If I record the voice of a man and a woman, what are the main differences I get in the various spectra and harmonics in Fourier analysis?
0 votes
0 answers
44 views
What happens if I register my entry?
Taking as an example that I want to record my voice. How does it appear in the frequency spectrum? Can I also view it on other spectra?
1 vote
1 answer
68 views
Similar voice features for imitated voice
What kind of audio signal features/properties are appropriate for signal similarity measurement invariant to imitation? Basically I would like to do the following: Having a template (e.g. a spoken ...
1 vote
1 answer
66 views
Did Dialup Modem use closed or open loop power/volume control? How did they determine Tx level?
I have been reading about dialup modems, but one thing I cannot seem to find out about is how implementers determined optimal transmit power/volume. Is this part of the echo cancelation framework? The ...
3 votes
0 answers
488 views
How timbre shifting is done?
I've recently came across two programs - Morphvox, VCSdiamond that are able to preform pitch shift, but also timbre shift. As far as I know the timbre is nothing but the amplitude of the harmonics in ...
0 votes
0 answers
35 views
Separating several kinds of information in a sound file
I have a 90min audio recording of a lecture using a handheld recorder. Since the recorder was in the shirt pocket of the speaker, I can clearly hear that the their voice is much louder than the ...