Logo do repositório
 
A carregar...
Miniatura
Publicação

Multithreaded and GPU-Based Implementations of a Modified Particle Swarm Optimization Algorithm with Application to Solving Large-Scale Systems of Nonlinear Equations

Utilize este identificador para referenciar este registo.

Orientador(es)

Resumo(s)

This paper presents a novel Graphics Processing Unit (GPU) accelerated implementation of a modified Particle Swarm Optimization (PSO) algorithm specifically designed to solve large-scale Systems of Nonlinear Equations (SNEs). The proposed GPU-based parallel version of the PSO algorithm uses the inherent parallelism of modern hardware architectures. Its performance is compared against both sequential and multithreaded Central Processing Unit (CPU) implementations. The primary objective is to evaluate the efficiency and scalability of PSO across different hardware platforms with a focus on solving large-scale SNEs involving thousands of equations and variables. The GPU-parallelized and multithreaded versions of the algorithm were implemented in the Julia programming language. Performance analyses were conducted on an NVIDIA A100 GPU and an AMD EPYC 7643 CPU. The tests utilized a set of challenging, scalable SNEs with dimensions ranging from 1000 to 5000. Results demonstrate that the GPU accelerated modified PSO substantially outperforms its CPU counterparts, achieving substantial speedups and consistently surpassing the highly optimized multithreaded CPU implementation in terms of computation time and scalability as the problem size increases. Therefore, this work evaluates the trade-offs between different hardware platforms and underscores the potential of GPU-based parallelism for accelerating SNE solvers.

Descrição

Funding information: This work was partially supported by IntellMax–Optimization, Artificial Intelligence and Data Science, Lda., Portugal. This work was also supported by the Interactive Technologies Institute (ITI/LARSyS), funded by the Portuguese Foundation for Science and Technology (FCT) through the projects 10.54499/LA/P/0083/2020, 10.54499/UIDP/50009/2020, and 10.54499/UIDB/50009/2020. The HPC resources used in this study werec provided by the Portuguese National Distributed Computing Infrastructure (INCD) through the FCT Advanced Computing Projects 2023.09611.CPCA.A1 and 2024.07086.CPCA.A1. Publisher Copyright: © 2025 by the authors.

Palavras-chave

metaheuristic optimization nonlinear equations systems parallel GPU algorithms swarm-based algorithms Control and Systems Engineering Signal Processing Hardware and Architecture Computer Networks and Communications Electrical and Electronic Engineering

Contexto Educativo

Citação

Projetos de investigação

Unidades organizacionais

Fascículo

Editora

Licença CC

Métricas Alternativas