JOURNAL ARTICLE
UPGRADING THE JANET NEURAL NETWORK BY INTRODUCING A NEW STORAGE BUFFER OF WORKING MEMORY.
Published In: Neural Network World, 2023, n. 6. P. 433 1 of 3
Database: The Belt and Road Initiative Reference Source 2 of 3
Authored By: Tolic, A.; Boshkoska, B. M.; Skansi, S. 3 of 3
Abstract
Recurrent neural networks (RNNs), along with long short-term memory networks (LSTMs), have been successfully used on a wide range of sequential data problems and have been entitled as extraordinarily powerful tools for learning and processing such data. However, the search for a new or derived architecture that would model very long-term dependencies is still an active area of research. In this paper, a relatively psychologically plausible architecture named event buffering JANET (EB-JANET) is proposed. The architecture is derived from the forgetgate-only version of the LSTM, which is also called just another network (JANET). The new architecture implements a new working memory mechanism that operates on information represented as dynamic events. The event buffer, as a container of events, is a reference to the state of the relevant pre-activation values on the basis of which historical candidate values were generated relative to the current timestep. The buffer is emptied as needed and depending on the context of information. The proposed architecture has achieved world-class results and it outperforms JANET on multiple benchmark datasets. Moreover, the new architecture is applicable to a wider class of problems and showed superior resilience when processing longer sequences, as opposed to JANET which experienced catastrophic failures on certain tasks. [ABSTRACT FROM AUTHOR]
Additional Information
- Source:Neural Network World. 2023/11, Issue 6, p433
- Document Type:Article
- Subject Area:Psychology
- Publication Date:2023
- ISSN:1210-0552
- DOI:10.14311/NNW.2023.33.024
- Accession Number:175737071
- Copyright Statement:Copyright of Neural Network World is the property of Czech Technical University in Prague, Faculty of Transportation Sciences and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Looking to go deeper into this topic? Look for more articles on EBSCOhost.