Efficient keyword spotting by capturing long-range interactions with temporal lambda networks

Tura Vecino, Biel; Escuder Folch, Santiago; Diego, Ferran; Segura Perales, Carlos; Luque Serrano, Jordi

Repository landing page

oai:upcommons.upc.edu:2117/366331

Efficient keyword spotting by capturing long-range interactions with temporal lambda networks

Authors: Biel Tura Vecino
Santiago Escuder Folch
Ferran Diego
Carlos Segura Perales
Jordi Luque Serrano
Publication date: 1 January 2021
Publisher
Doi

Abstract

Models based on attention mechanisms have shown unprecedented speech recognition performance. However, they are computationally expensive and unnecessarily complex for keyword spotting, a task targeted to small-footprint devices. This work explores the application of Lambda networks, an alternative framework for capturing long-range interactions without attention, for the keyword spotting task. We propose a novel ResNet-based model by swapping the residual blocks by temporal Lambda layers. Furthermore, the proposed architecture is built upon uni-dimensional temporal convolutions that further reduce its complexity. The presented model does not only reach state-of-the-art accuracies on the Google Speech Commands dataset, but it is 85% and 65% lighter than its Transformer-based (KWT) and convolutional (ResNet15) counterparts while being up to 100× faster. To the best of our knowledge, this is the first attempt to explore the Lambda framework within the speech domain and therefore, we unravel further research of new interfaces based on this architecture.Peer ReviewedPostprint (author's final draft

Similar works

Full text

Open in the Core reader

Download PDF

UPCommons. Portal del coneixement obert de la UPC

oai:upcommons.upc.edu:2117/366...

Last time updated on 17/05/2022

This paper was published in UPCommons. Portal del coneixement obert de la UPC.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.