JOURNAL ARTICLE

NUMA-aware parallel sparse LU factorization for SPICE-based circuit simulators on ARM multi-core processors.

  • Published In: International Journal of High Performance Computing Applications, 2025, v. 39, n. 3. P. 405 1 of 3

  • Database: Academic Search Ultimate 2 of 3

  • Authored By: Zhou, Junsheng; Yang, Wangdong; Dong, Fengkun; Lin, Shengle; Cai, Qinyun; Li, Kenli 3 of 3

Abstract

The article focuses on the development and evaluation of HLU, a parallel sparse direct solver designed to improve the performance of sparse lower-upper (LU) factorization in circuit simulators resembling the Simulation Program with Integrated Circuit Emphasis (SPICE). HLU introduces a fine-grained scheduling method based on an elimination tree in pipeline mode to maximize task-level parallelism and reduce thread waiting time, alongside two NUMA (non-uniform memory access) node affinity strategies for optimized thread mapping and memory placement on multi-NUMA systems. Experimental results on Huawei Kunpeng 920 and Intel Xeon Gold 5120 servers demonstrate that HLU achieves significant speedups—up to 9.14× over KLU and 1.26× over NICSLU (both established sparse solvers) on circuit simulation matrices—and also shows improved performance on general sparse matrices. The solver addresses challenges related to the irregular sparsity and strong data dependencies of circuit matrices, ARM architecture memory consistency, and NUMA system memory access latency, making it applicable to both circuit simulation and broader scientific computing domains.

Additional Information

  • Source:International Journal of High Performance Computing Applications. 2025/05, Vol. 39, Issue 3, p405
  • Document Type:Article
  • Subject Area:Computer Science
  • Publication Date:2025
  • ISSN:1094-3420
  • DOI:10.1177/10943420241241491
  • Accession Number:184747394
  • Copyright Statement:Copyright of International Journal of High Performance Computing Applications is the property of Sage Publications Inc. and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.