DOI: 10.1515/lingvan-2022-0130 ISSN: 2199-174X

The Red Hen Audio Tagger

Sabyasachi Ghosal, Austin Bennett, Mark Turner
  • Linguistics and Language
  • Language and Linguistics

Abstract

The International Distributed Little Red Hen Lab, usually called “Red Hen Lab” or just “Red Hen”, is dedicated to research into multimodal communication. In this article, we introduce the Red Hen Audio Tagger (RHAT), a novel, publicly available open source platform developed by Red Hen Lab. RHAT employs deep learning models to tag audio elements frame by frame, generating metadata tags that can be utilized in various data formats for analysis. RHAT seamlessly integrates with widely used linguistic research tools like ELAN: the researcher can use RHAT to tag audio content automatically and display those tags alongside other ELAN annotation tiers. RHAT additionally complements existing Red Hen pipelines devoted to natural language processing, speech-to-text processing, body pose analysis, optical character recognition, named entity recognition, computer vision, semantic frame recognition, and so on. These cooperating Red Hen pipelines are research tools to advance the science of multimodal communication.