ABSTRACT
In recent years, semi-structured interviews gained more and more importance in cyber security research. Transcribing audio recordings of such interviews is a crucial step in qualitative data analysis, but it is also a work-intensive and time-consuming task. While outsourcing presents a common option, maintaining research quality requires precise transcriptions -- a task further compounded by technical jargon and established expressions in the research field. In this study, we compare different transcription services and evaluate their outcome quality within the context of cyber security. Our findings provide insights for researchers navigating the complex landscape of transcription services, offering informed choices to enhance the accuracy and validity of qualitative data analysis.
- Amazon Transcribe. [n. d.] https://aws.amazon.com/transcribe/.Google Scholar
- Amberscript. [n. d.] https://www.amberscript.com.Google Scholar
- AssemblyAI. [n. d.] https://www.assemblyai.com/.Google Scholar
- Audiotranskription. [n. d.] https://www.audiotranskription.de/.Google Scholar
- Maamoun M. Al-Aynati and Katherine A. Chorneyko. 2003. Comparison of voice-automated transcription and human transcription in generating pathology reports. Archives of Pathology & Laboratory Medicine, 127, 6, 721--725.Google ScholarCross Ref
- Christina Davidson. 2009. Transcription. Imperatives for qualitative research. International Journal of Qualitative Methods, 8, 2, 35--52.Google ScholarCross Ref
- Alessandro Duranti. 2006. Transcripts, like shadows on a wall. Mind, Culture, and Activity, 13, 4, 301--310.Google ScholarCross Ref
- Damjan Fujs, An?e Mihelič, and Simon L. R. Vrhovec. 2019. The power of interpretation. Qualitative methods in cybersecurity research. In Proceedings of the 14th International Conference on Availability, Reliability and Security. New York, NY, USA, 1--10.Google Scholar
- Google Cloud. [n. d.] https://cloud.google.com/speech-to-text.Google Scholar
- GoTranscript. [n. d.] https://www.gotranscript.com.Google Scholar
- Microsoft Azure. [n. d.] https://azure.microsoft.com/services/cognitive-service s/speech-to-text/.Google Scholar
- Ann Morrison, Stephen Viller, Tamara Heck, and Kate Davis. 2017. Mixing quantitative with qualitative methods. Current practices in designing experiments, gathering data and analysis with mixed methods reporting. In Proceedings of the 29th Australian Conference on Computer-Human Interaction. New York, NY, USA, 654--655.Google ScholarDigital Library
- OpenAI. [n. d.] https://openai.com/blog/whisper/.Google Scholar
- Qualtranscribe. [n. d.] https://www.qualtranscribe.com.Google Scholar
- Alec Radford, Jong Wook Kim, Tao Xu, Greg Brockman, Christine McLeavey, and Ilya Sutskever. 2023. Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning, 28492--28518.Google Scholar
- Rev. [n. d.] https://www.rev.com/services/audio-transcription.Google Scholar
- Scribie. [n. d.] https://www.scribie.com.Google Scholar
- Salman Seyedi. 2023. A Comparison of HIPAA-Compliant Transcription Services for Virtual Psychiatric Interviews. Retrieved Aug. 16, 2023 from https://p syarxiv.com/vyz9/.Google ScholarCross Ref
- Susanne Wollin-Giering, Markus Hoffmann, Jonas Höfting, and Carla Ventzke. 2023. Automatic transcription of qualitative interviews. Social Studies of Science and Technology. TU Berlin.Google Scholar
Index Terms
Poster: From Hashes to Ashes - A Comparison of Transcription Services
Recommendations
The power of interpretation: Qualitative methods in cybersecurity research
ARES '19: Proceedings of the 14th International Conference on Availability, Reliability and SecurityCybersecurity is a hot topic and researchers have published extensively on studies conducted using a variety of different research methods. This paper aims to determine which qualitative research methods were most used and for studying which topics. A ...
Telephone & email interviews: using the respondents' context to determine the best interview mode
iConference '11: Proceedings of the 2011 iConferenceThis poster explains reasons for choosing different interview modes. Although many people assume that face-to-face contact is the best, perhaps only, mode for effective interviewing, a substantial body of research suggests otherwise. In particular, ...
Respeak: A Voice-based, Crowd-powered Speech Transcription System
CHI '17: Proceedings of the 2017 CHI Conference on Human Factors in Computing SystemsSpeech transcription is an expensive service with high turnaround time for audio files containing languages spoken in developing countries and regional accents of well-represented languages. We present Respeak - a voice-based, crowd-powered system that ...
Comments