Citing Please cite our paper (s) if you find this repository useful. The first paper proposes the Audio Spectrogram Transformer while the second paper describes the training pipeline that we applied ...
AUDIO_DATABASE = "D:/Neuroscience/Forrest Gump/ad-av/av/audio/Seg99" # --> add path for the relevant folder FULLaudio = "D:/Neuroscience/Forrest Gump/ad-av/av/audio ...
Abstract: The increasing ability of deep learning models to produce realistic-sounding synthetic speech poses serious problems for privacy, public trust, and digital security. To counter this danger, ...
Abstract: In this paper, we propose a deep learning (DL)-based task-driven spectrum prediction framework, named DeepSPred. The DeepSPred comprises a feature encoder and a task predictor, where the ...