Using term clouds to represent segment-level semantic content of podcasts

Fuller, Marguerite, Tsagkias, Manos, Newman, Eamonn ORCID: 0000-0002-0310-0539, Besser, Jana, Larson, Martha, Jones, Gareth J.F. ORCID: 0000-0003-2923-8365 and de Rijke, Maarten (2008) Using term clouds to represent segment-level semantic content of podcasts. In: the Workshop on Searching Spontaneous Conversational Speech at Thirty-First Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2008), 24 July 2008, Singapore.

Abstract
Metadata
Downloads
Documents

[+][-]

Abstract

Spoken audio, like any time-continuous medium, is notoriously difficult to browse or skim without support of an interface providing semantically annotated jump points to signal the user where to listen in. Creation of time-aligned metadata by human annotators is prohibitively expensive, motivating the investigation of representations of segment-level semantic content based on transcripts generated by automatic speech recognition (ASR). This paper examines the feasibility of using term clouds to provide users with a structured representation of the semantic content of podcast episodes. Podcast episodes are visualized as a series of sub-episode segments, each represented by a term cloud derived from a transcript generated by automatic speech recognition (ASR). Quality of segment-level term clouds is measured quantitatively and their utility is investigated using a small-scale user study based on human labeled segment boundaries. Since the segment-level clouds generated from ASR-transcripts prove useful, we examine an adaptation of text tiling techniques to speech in order to be able to generate segments as part of a completely automated indexing and structuring system for browsing of spoken audio. Results demonstrate that the segments generated are comparable with human selected segment boundaries.

Metadata

Item Type:	Conference or Workshop Item (Paper)
Event Type:	Workshop
Refereed:	Yes
Uncontrolled Keywords:	Speech browsing; term clouds; TextTiling
Subjects:	Computer Science > Information technology
DCU Faculties and Centres:	Research Initiatives and Centres > Centre for Digital Video Processing (CDVP)
Use License:	This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License
ID Code:	16190
Deposited On:	05 Aug 2011 12:52 by Shane Harper . Last Modified 25 Oct 2018 11:57

Documents

Full text available as:

[thumbnail of Using_Term_Clouds_to_Represent_Segment-Level_Semantic_content_of_podcasts.pdf]

Preview

PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
328kB

Downloads

Downloads per month over past year

Archive Staff Only: edit this record