001451955 000__ 05950cam\a2200649\i\4500 001451955 001__ 1451955 001451955 003__ OCoLC 001451955 005__ 20230310003334.0 001451955 006__ m\\\\\o\\d\\\\\\\\ 001451955 007__ cr\cn\nnnunnun 001451955 008__ 230101s2022\\\\sz\a\\\\o\\\\\101\0\eng\d 001451955 019__ $$a1356004876 001451955 020__ $$a9783031238215$$q(electronic bk.) 001451955 020__ $$a3031238214$$q(electronic bk.) 001451955 020__ $$z9783031238208 001451955 020__ $$z3031238206 001451955 0247_ $$a10.1007/978-3-031-23821-5$$2doi 001451955 035__ $$aSP(OCoLC)1355865871 001451955 040__ $$aYDX$$beng$$erda$$epn$$cYDX$$dGW5XE$$dEBLCP$$dOCLCQ 001451955 049__ $$aISEA 001451955 050_4 $$aQA76.88 001451955 08204 $$a004.1/1$$223/eng/20230106 001451955 1112_ $$aCARLA (Conference)$$n(9th :$$d2022 :$$cPorto Alegre, Brazil). 001451955 24510 $$aHigh performance computing :$$b9th Latin American conference, CARLA 2022, Porto Alegre, Brazil, September 26-30, 2022 : revised selected papers /$$cPhilippe Navaux, Carlos J. Barrios H., Carla Osthoff, Ginés Guerrero (eds.). 001451955 24630 $$aCARLA 2022 001451955 264_1 $$aCham :$$bSpringer,$$c[2022] 001451955 264_4 $$c©2022 001451955 300__ $$a1 online resource (xii, 233 pages) :$$billustrations (chiefly color). 001451955 336__ $$atext$$btxt$$2rdacontent 001451955 337__ $$acomputer$$bc$$2rdamedia 001451955 338__ $$aonline resource$$bcr$$2rdacarrier 001451955 4901_ $$aCommunications in computer and information science ;$$v1660 001451955 500__ $$aConference proceedings. 001451955 500__ $$aIncludes author index. 001451955 5050_ $$aIntro -- Preface -- Organization -- Contents -- A Comparative Evaluation of Parallel Programming Python Tools for Particle-in-Cell on Symmetric Multiprocessors -- 1 Introduction -- 2 Background -- 2.1 Particle-in-Cell -- 2.2 Python Parallel Programming -- 2.3 Related Work -- 3 Implementation -- 3.1 Profiling -- 3.2 Code Transformation -- 4 Experimental Results -- 4.1 Setup -- 4.2 Experiments -- 5 Discussion -- 6 Final Remarks -- References -- Accelerating GNN Training on CPU+Multi-FPGA Heterogeneous Platform -- 1 Introduction -- 2 Background -- 2.1 GNN Models -- 2.2 Mini-Batch GNN Training 001451955 5058_ $$a2.3 Related Work -- 3 GNN Training on CPU+Multi-FPGA Platform -- 4 Optimizations -- 4.1 Graph Partitioning and Workload Balancing -- 4.2 Optimized GNN Kernels -- 5 Experiments -- 5.1 Experimental Setup -- 5.2 Hardware Parameter Selection and Resource Utilization -- 5.3 Performance Metrics -- 5.4 Comparison with Multi-GPU Platform -- 5.5 Scalability -- 5.6 Impact of Optimizations -- 6 Conclusion -- References -- Implementing a GPU-Portable Field Line Tracing Application with OpenMP Offload -- 1 Introduction -- 2 Background -- 2.1 Directive-Based Programming for Accelerators with OpenMP 001451955 5058_ $$a2.2 Simulating Plasma Confinement in Stellarator Devices -- 2.3 Related Work -- 3 Directive-Based GPU Offloading Implementation -- 3.1 Breakdown of the Execution Flow -- 3.2 Data Management for Offloading -- 3.3 Parallelism Implementation -- 4 Results -- 4.1 Experimental Setup -- 4.2 Baseline Comparison: Single CPU Node Versus Single GPU -- 4.3 Multi-GPU Scalability -- 4.4 Economic Analysis -- 5 Conclusions -- References -- Quantitative Characterization of Scientific Computing Clusters -- 1 Introduction -- 2 Related Work -- 3 Background -- 3.1 Cluster Overhead and Coupling 001451955 5058_ $$a3.2 Cluster Performance Profile -- 4 Performance Evaluation -- 4.1 Experimental Setup -- 4.2 Threats to Validity -- 4.3 Results -- 4.4 Clusters Performance Profiles -- 5 Discussion -- 6 Conclusion -- References -- Towards Parameter-Based Profiling for MARE2DEM Performance Modeling -- 1 Introduction -- 2 Dataset and Application Background -- 2.1 CSEM Data -- 2.2 MARE2DEM -- 2.3 Refinement Groups -- 3 Methodology and Experimental Context -- 4 Results -- 4.1 Performance Characterization of the Microkernels -- 4.2 Iterations and Refinement Groups -- 5 Conclusion -- References 001451955 5058_ $$aTime-Power-Energy Balance of BLAS Kernels in Modern FPGAs -- 1 Introduction -- 2 FPGAs and NLA -- 2.1 BLAS -- 2.2 FPGAs -- 3 Evaluated Kernels -- 3.1 Vitis Libraries -- 3.2 Matrix-Matrix Multiplication (MMM) -- 4 Experimental Evaluation -- 4.1 Setup -- 4.2 Experimental Results and Discussion -- 5 Conclusions -- References -- Improving Boundary Layer Predictions Using Parametric Physics-Aware Neural Networks -- 1 Introduction -- 2 Related Work -- 3 Methodology -- 3.1 Boundary Layer Problem -- 3.2 Architecture Design -- 4 Experimental Results -- 4.1 First Setting: Reaction-Diffusion Problem 001451955 506__ $$aAccess limited to authorized users. 001451955 520__ $$aThis book constitutes the proceedings of the 9th Latin American Conference on High Performance Computing, CARLA 2022, held in Porto Alegre, Brazil, in September 2022. The 16 full papers presented in this volume were carefully reviewed and selected from 56 submissions. CARLA, the Latin American High Performance Computing Conference, is an international academic meeting aimed at providing a forum to foster the growth and strength of the High Performance Computing (HPC) community in Latin America and the Caribbean through the exchange and dissemination of new ideas, techniques, and research in HPC and its application areas. 001451955 588__ $$aOnline resource; title from PDF title page (SpringerLink, viewed January 6, 2023). 001451955 650_0 $$aHigh performance computing$$vCongresses. 001451955 655_7 $$aConference papers and proceedings.$$2lcgft 001451955 655_0 $$aElectronic books. 001451955 7001_ $$aNavaux, Philippe O. A.,$$eeditor. 001451955 7001_ $$aBarrios H., Carlos J.,$$eeditor. 001451955 7001_ $$aOsthoff, Carla,$$eeditor. 001451955 7001_ $$aGuerrero, Ginés,$$eeditor. 001451955 77608 $$iPrint version: $$z3031238206$$z9783031238208$$w(OCoLC)1354511405 001451955 830_0 $$aCommunications in computer and information science ;$$v1660. 001451955 852__ $$bebk 001451955 85640 $$3Springer Nature$$uhttps://univsouthin.idm.oclc.org/login?url=https://link.springer.com/10.1007/978-3-031-23821-5$$zOnline Access$$91397441.1 001451955 909CO $$ooai:library.usi.edu:1451955$$pGLOBAL_SET 001451955 980__ $$aBIB 001451955 980__ $$aEBOOK 001451955 982__ $$aEbook 001451955 983__ $$aOnline 001451955 994__ $$a92$$bISE