This repo is the implementation of a research project aimed at enhancing Acoustic Side-Channel Attacks (ASCAs) using a novel combination of Vision Transformers (VTs) and Large Language Models (LLMs).
This study proposes a novel heterogeneous stacking ensemble learning model for the fusion of phonocardiogram (PCG) spectrogram texture and deep features to detect heart failure with preserved ejection ...
(this project essentially demonestrates the use of MediaPipe Hands (Google) and Faster-Whisper (Int8 Quantization), along with WebRTCVAD.) Runs silently in the System Tray with a "Headless" option (no ...
Abstract: The rise of synthetic speech technologies has triggered growing concerns about the increasing difficulty in distinguishing between real and fake voices. In this context, we propose novel ...
Abstract: In this study, we explore the use of Vector Quantized Variational Autoencoders (VQ-VAE) for real-time audio spectrogram inpainting, with a focus on minimizing environmental impact. We ...