<?xml version="1.0"?>
<metadata xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:dc="http://purl.org/dc/elements/1.1/"><dc:title>Hardware–software co-design of an audio feature extraction pipeline for machine learning applications</dc:title><dc:creator>Vreča,	Jure	(Avtor)
	</dc:creator><dc:creator>Pilipović,	Ratko	(Avtor)
	</dc:creator><dc:creator>Biasizzo,	Anton	(Avtor)
	</dc:creator><dc:subject>FPGA</dc:subject><dc:subject>MFCC</dc:subject><dc:subject>keyword spotting</dc:subject><dc:subject>chisel</dc:subject><dc:description>Keyword spotting is an important part of modern speech recognition pipelines. Typical contemporary keyword-spotting systems are based on Mel-Frequency Cepstral Coefficient (MFCC) audio features, which are relatively complex to compute. Considering the always-on nature of many keyword-spotting systems, it is prudent to optimize this part of the detection pipeline. We explore the simplifications of the MFCC audio features and derive a simplified version that can be more easily used in embedded applications. Additionally, we implement a hardware generator that generates an appropriate hardware pipeline for the simplified audio feature extraction. Using Chisel4ml framework, we integrate hardware generators into Python-based Keras framework, which facilitates the training process of the machine learning models using our simplified audio features.</dc:description><dc:publisher>MDPI</dc:publisher><dc:date>2024</dc:date><dc:date>2024-03-25 10:55:57</dc:date><dc:type>Neznano</dc:type><dc:identifier>18556</dc:identifier><dc:identifier>UDK: 004</dc:identifier><dc:identifier>ISSN pri članku: 2079-9292</dc:identifier><dc:identifier>DOI: 10.3390/electronics13050875</dc:identifier><dc:identifier>COBISS_ID: 186803203</dc:identifier><dc:source>Švica</dc:source><dc:language>sl</dc:language><dc:rights>© 2024 by the authors.</dc:rights></metadata>
