Code analysis PCM human voice frequency conversion processing in JAVA

Author：Eve Cole Update Time：2025-08-05 22:00:02

The project requires the use of voice-changing processing of PCM human voice audio data . After struggling for a week, I finally found a framework for pure Java implementation - TarsosDSP. Very powerful! Real-time audio processing! Of course I only used file processing. In fact, the logic is the same

TarsosDSP's GitHub address: https://github.com/JorenSix/TarsosDSP integrates it into its own project projects.

Specific Java tool class code:

 /** * Voice change* @param rawPcmInputStream Raw PCM data input stream* @param speedFactor Variation rate(0,2) greater than 1 is accelerating the speech speed, less than 1 is a slowing the speech speed* @param rateFactor Tone change rate(0,2) greater than 1 is a lowering the pitch (deep), less than 1 is a boosting the pitch (sharp) * @return PCM data input stream after voice change*/ public static InputStream speechPitchShift(final InputStream rawPcmInputStream,double speedFactor,double rateFactor) { TarsosDSPAudioFormat format = new TarsosDSPAudioFormat(16000,16,1,true,false); AudioInputStream inputStream = new AudioInputStream(rawPcmInputStream, JVMAudioInputStream.toAudioFormat(format),AudioSystem.NOT_SPECIFIED); JVMAudioInputStream stream = new JVMAudioInputStream(inputStream); WaveformSimilarityBasedOverlapAdd w = new WaveformSimilarityBasedOverlapAdd(WaveformSimilarityBasedOverlapAdd.Parameters.speechDefaults(speedFactor, 16000)); int inputBufferSize = w.getInputBufferSize(); int overlap = w.getOverlap(); AudioDispatcher dispatcher = new AudioDispatcher(stream, inputBufferSize, overlap); w.setDispatcher(dispatcher); AudioOutputToByteArray out = new AudioOutputToByteArray(); dispatcher.addAudioProcessor(w); dispatcher.addAudioProcessor(new RateTransposer(rateFactor)); dispatcher.addAudioProcessor(out); dispatcher.run(); return new ByteArrayInputStream(out.getData()); }

The data transcript (AudioOutputToByteArray) code is as follows:

 public class AudioOutputToByteArray implements AudioProcessor { private boolean isDone = false; private byte[] out = null; private ByteArrayOutputStream bos; public AudioOutputToByteArray() { bos = new ByteArrayOutputStream(); } public byte[] getData() { while (!isDone && out == null) { try { Thread.sleep(10); } catch (InterruptedException ignored) {} } return out; } @Override public boolean process(AudioEvent audioEvent) { bos.write(audioEvent.getByteBuffer(),0,audioEvent.getByteBuffer().length); return true; } @Override public void processingFinished() { out = bos.toByteArray().clone(); bos = null; isDone = true; }}

You can play audio through this tool:

 /** * Play PCM * * Do not call in non-desktop environments. . . What do you know what will happen* @param rawPcmInputStream Raw PCM data input stream* @throws LineUnavailableException */ public static void play(final InputStream rawPcmInputStream) throws LineUnavailableException { TarsosDSPAudioFormat format = new TarsosDSPAudioFormat(16000,16,1,true,false); AudioInputStream inputStream = new AudioInputStream(rawPcmInputStream, JVMAudioInputStream.toAudioFormat(format),AudioSystem.NOT_SPECIFIED); JVMAudioInputStream stream = new JVMAudioInputStream(inputStream); AudioDispatcher dispatcher = new AudioDispatcher(stream, 1024 ,0); dispatcher.addAudioProcessor(new AudioPlayer(format,1024)); dispatcher.run(); }