JOURNAL ARTICLE

FINB: a Japanese named entity recognition model based on multi-feature integration method.

  • Published In: Computer Journal, 2025, v. 68, n. 4. P. 419 1 of 3

  • Database: Academic Search Ultimate 2 of 3

  • Authored By: Wang, Yingjie; Zhang, Chengye; Bai, Fengbo; Wang, Zumin; Qin, Jing 3 of 3

Abstract

This article focuses on improving Japanese named entity recognition (NER) by proposing a model called FINB (Feature Integration Network with BERT) that integrates multiple linguistic features inherent to Japanese text. Unlike English, Japanese lacks clear word boundaries and uses three character types—hiragana, katakana, and kanji—posing challenges for NER. FINB extracts pronunciation features via romanization (Romaji) and glyph features by converting characters into images, combining these with character embeddings in a BERT-BiLSTM-CRF architecture to enhance semantic understanding. Experiments on the Kyoto University Web Document Leads Corpus (KWDLC) and Japanese Wikipedia datasets demonstrate that FINB outperforms existing models, effectively reducing errors caused by homophones and improving recognition accuracy across various entity types. The study suggests that multi-feature integration leveraging language-specific characteristics can advance NER performance in Japanese and potentially other languages with similar complexities.

Additional Information

  • Source:Computer Journal. 2025/04, Vol. 68, Issue 4, p419
  • Document Type:Article
  • Subject Area:Literature and Writing
  • Publication Date:2025
  • ISSN:0010-4620
  • DOI:10.1093/comjnl/bxae121
  • Accession Number:185320677
  • Copyright Statement:Copyright of Computer Journal is the property of Oxford University Press / USA and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.