A Biologically Plausible Speech Recognition Framework Based on Spiking Neural Networks

Page view(s)

Checked on Jul 26, 2025

Please use this identifier to cite or link to this item: https://oar.a-star.edu.sg/communities-collections/articles/16368

Title:

A Biologically Plausible Speech Recognition Framework Based on Spiking Neural Networks

Journal Title:

International Joint Conference on Neural Networks (IJCNN)

DOI:

10.1109/IJCNN.2018.8489535

Publication URL:

Authors:

Yansong Chua

Keywords:

Publication Date:

15 October 2018

Citation:

Abstract:

Humans perform remarkably well for speech recognition using sparse and asynchronous events carried by electrical impulses. Motivated by the observations that human brains primarily learn features from environmental stimuli in an unsupervised manner and consume extremely low power for complex cognitive tasks, we propose a biologically plausible speech recognition mechanism using unsupervised self-organizing map (SOM) for feature representation and event-driven spiking neural network (SNN) for spatiotemporal pattern classification. Moreover, we improve the biological realism of the proposed framework by using mel-scaled filter bank as the front-end, so as to mimic the human auditory system. Our experiments on the TIDIGITS dataset achieve speech recognition accuracy surpassing those of other bio-inspired systems. The proposed SOM-SNN framework can be implemented using the artificial silicon cochlear and neuromorphic processor, so as to fully exploit the potential of event-based speech recognition system.

License type:

Funding Info:

This research is supported by Programmatic grant no. A1687b0033 from the Singapore governments Research, Innovation and Enterprise 2020 plan (Advanced Manufacturing and Engineering domain).

Description:

URI:

https://oar.a-star.edu.sg/communities-collections/articles/16368

ISBN:

Collections:

Institute for Infocomm Research

Files uploaded:

Manuscripts in This Item:

File	Size	Format	Action
There are no attached files.