default search action
30th Euro-Par 2024: Madrid, Spain - Part III
- Jesús Carretero, Sameer Shende, Javier García-Blas, Ivona Brandic, Katzalin Olcoz, Martin Schreiber:
Euro-Par 2024: Parallel Processing - 30th European Conference on Parallel and Distributed Processing, Madrid, Spain, August 26-30, 2024, Proceedings, Part III. Lecture Notes in Computer Science 14803, Springer 2024, ISBN 978-3-031-69582-7
Theory and Algorithms
- Eunji Lee, Yoonsang Han, Gordon Euhyun Moon:
Accelerated Block-Sparsity-Aware Matrix Reordering for Leveraging Tensor Cores in Sparse Matrix-Multivector Multiplication. 3-16 - Stef Graillat, Fabienne Jézéquel, Théo Mary, Roméo Molina, Daichi Mukunoki:
Reduced-Precision and Reduced-Exponent Formats for Accelerating Adaptive Precision Sparse Matrix-Vector Product. 17-30 - Marc Baboulin, Simplice Donfack, Oguz Kaya, Théo Mary, Matthieu Robeyns:
Mixed Precision Randomized Low-Rank Approximation with GPU Tensor Cores. 31-44 - Andrzej Lingas:
Boolean Matrix Multiplication for Highly Clustered Data on the Congested Clique. 45-58 - Roy Nissim, Oded Schwartz, Yuval Spiizer:
Minimizing I/O in Toom-Cook Algorithms. 59-73 - Filippo Ziche, Nicola Bombieri, Federico Busato, Rosalba Giugno:
GPU-Accelerated BFS for Dynamic Networks. 74-87 - Qasim Abbas, Mohsen Koohi Esfahani, Ian M. Overton, Hans Vandierendonck:
QClique: Optimizing Performance and Accuracy in Maximum Weighted Clique. 88-102 - Ivo Gabe de Wolff, Daniel Anderson, Gabriele K. Keller, Aleksei Seletskiy:
A Fast Wait-Free Solution to Read-Reclaim Races in Reference Counting. 103-118 - Kåre von Geijer, Philippas Tsigas:
How to Relax Instantly: Elastic Relaxation of Concurrent Data Structures. 119-133 - Sharon Boddu, Maleq Khan:
ALZI: An Improved Parallel Algorithm for Finding Connected Components in Large Graphs. 134-147
Multidisciplinary, Domain-Specific and Applied Parallel and Distributed Computing
- Dian-Lun Lin, Umit Ogras, Joshua San Miguel, Tsung-Wei Huang:
TaroRTL: Accelerating RTL Simulation Using Coroutine-Based Heterogeneous Task Graph Scheduling. 151-166 - Thiago Maltempi, Sandro Rigo, Márcio Machado Pereira, Hervé Yviquel, Jessé Costa, Guido Araujo:
Combining Compression and Prefetching to Improve Checkpointing for Inverse Seismic Problems in GPUs. 167-181 - Guofeng Feng, Hongyu Wang, Zhuoqiang Guo, Mingzhen Li, Tong Zhao, Zhou Jin, Weile Jia, Guangming Tan, Ninghui Sun:
Accelerating Large-Scale Sparse LU Factorization for RF Circuit Simulation. 182-195 - Jiajun Song, Jiajun Luo, Rongwei Lu, Shuzhao Xie, Bin Chen, Zhi Wang:
A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning. 196-211 - Xianlong Zhou, Pei Li, Jiageng Chen, Shixiong Yao:
Accelerating Stencil Computation with Fully Homomorphic Encryption Using GPU. 212-224 - Cristian Catalin Tatu, Javier Conejero, Fernando Vázquez-Novoa, Rosa M. Badia:
GPU Cache System for COMPSs: A Task-Based Distributed Computing Framework. 225-239 - Richard Angersbach, Sebastian Kuckuk, Harald Köstler:
Code Generation for Octree-Based Multigrid Solvers with Fused Higher-Order Interpolation and Communication. 240-254 - Zhuoyao Huang, Nan Zhang, Jingran Shen, Georgios Diamantopoulos, Zhengchang Hua, Nikos Tziritas, Georgios Theodoropoulos:
Distributed Simulation for Digital Twins of Large-Scale Real-World DiffServ-Based Networks. 255-269 - F. Romero, Marcos Lupión, Nicolás C. Cruz, Luis F. Romero, Pilar Martínez Ortigosa:
On the Use of GPU Computing for Accelerating EEG Preprocessing. 270-282 - Dazheng Liu, Xiaoli Ren, Jianping Wu, Wenjuan Liu, Juan Zhao, Shaoliang Peng:
Pipe-AGCM: A Fine-Grain Pipelining Scheme for Optimizing the Parallel Atmospheric General Circulation Model. 283-297 - Helena S. Silva, Maria Clicia Stelling de Castro, Fabrício A. B. da Silva, Alba C. M. A. Melo:
A Framework for Automated Parallel Execution of Scientific Multi-workflow Applications in the Cloud with Work Stealing. 298-311 - Subhajit Sahu, Kishore Kothapalli, Hemalatha Eedi, Sathya Peri:
DF* PageRank: Incrementally Expanding Approaches for Updating PageRank on Dynamic Graphs. 312-326 - Jie Jia, Yi Liu, Yanke Liu, Yifan Chen, Fang Lin:
AdapCK: Optimizing I/O for Checkpointing on Large-Scale High Performance Computing Systems. 342-355 - YuAng Chen, Jeffery Xu Yu:
Efficient SpMV for Graph Matrices Through Vectoring and Caching. 356-370 - Xiaokang Fan, Zhen Ge, Sifan Long, Tao Tang, Chun Huang, Lin Peng, Canqun Yang:
VLASPH: Smoothed Particle Hydrodynamics on VLA SIMD Architectures. 371-385 - Tiago Carneiro, Engin Kayraklioglu, Guillaume Helbecque, Nouredine Melab:
Investigating Portability in Chapel for Tree-Based Optimization on GPU-Powered Clusters. 386-399 - Júnior Löff, Dalvan Griebler, Luiz Gustavo Fernandes, Walter Binder:
MPR: An MPI Framework for Distributed Self-adaptive Stream Processing. 400-414
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.