Get PII durations (start-end time) from an Audio file using Transcription/other techniques

17 Views Asked by Aakash Basu At 05 March 2024 at 06:56

I have a use-case where I want to:

Locate all PII data in any given Audio file (done: using GPT/similar models)
Transcribe the audio and then mask all those PII in the text file (done using whisper/similar models)
Also, in the original audio mask the PII portions with beeps. (Remaining)

The typical problem is, a transcription model isn't giving back the times (start/end time) of each word spoken. Hence, it becomes very difficult to locate back the PII basis the transcription output.

Anyone figured out any way to solve the same? On-prem models or API based services, anything is fine, some direction is what I am looking for.

Original Q&A

Get PII durations (start-end time) from an Audio file using Transcription/other techniques

There are 0 best solutions below

Related Questions in PYTHON

Related Questions in AUDIO

Related Questions in WAV

Related Questions in OPENAI-API

Related Questions in LARGE-LANGUAGE-MODEL

Trending Questions

Popular # Hahtags

Popular Questions