default search action
P. Sadayappan
Person information
- affiliation: University of Utah, Salt Lake City, UT, USA
- affiliation (former): Ohio State University, Columbus, USA
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j66]Dhabaleswar K. Panda, Vipin Chaudhary, Eric Fosler-Lussier, Raghu Machiraju, Amit Majumdar, Beth Plale, Rajiv Ramnath, Ponnuswamy Sadayappan, Neelima Savardekar, Karen Tomko:
Creating intelligent cyberinfrastructure for democratizing AI. AI Mag. 45(1): 22-28 (2024) - [c281]Chendi Li, Yufan Xu, Sina Mahdipour Saravani, Ponnuswamy Sadayappan:
Accelerated Auto-Tuning of GPU Kernels for Tensor Computations. ICS 2024: 549-561 - [e8]Gabriel Rodríguez, P. Sadayappan, Aravind Sukumaran-Rajam:
Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler Construction, CC 2024, Edinburgh, United Kingdom, March 2-3, 2024. ACM 2024 [contents] - [i16]Tripti Agarwal, Harvey Dam, Dorra Ben Khalifa, Matthieu Martel, P. Sadayappan, Ganesh Gopalakrishnan:
What Operations can be Performed Directly on Compressed Arrays, and with What Error? CoRR abs/2406.11209 (2024) - [i15]Ashim Gupta, Sina Mahdipour Saravani, P. Sadayappan, Vivek Srikumar:
An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers. CoRR abs/2406.11307 (2024) - 2023
- [j65]Eric Heisler, Aadesh Deshmukh, Sandip Mazumder, Ponnuswamy Sadayappan, Hari Sundar:
Multi-discretization domain specific language and code generation for differential equations. J. Comput. Sci. 68: 101981 (2023) - [j64]Nicolas Tollenaere, Guillaume Iooss, Stéphane Pouget, Hugo Brunie, Christophe Guillon, Albert Cohen, P. Sadayappan, Fabrice Rastello:
Autotuning Convolutions Is Easier Than You Think. ACM Trans. Archit. Code Optim. 20(2): 20:1-20:24 (2023) - [c280]Jon Roose, Miheer Vaidya, Ponnuswamy Sadayappan, Sivasankaran Rajamanickam:
TenSQL: An SQL Database Built on GraphBLAS. HPEC 2023: 1-8 - [c279]Han D. Tran, Siddharth Saurav, P. Sadayappan, Sandip Mazumder, Hari Sundar:
Scalable parallelization for the solution of phonon Boltzmann Transport Equation. ICS 2023: 215-226 - [c278]Süreyya Emre Kurt, Jinghua Yan, Aravind Sukumaran-Rajam, Prashant Pandey, P. Sadayappan:
Communication Optimization for Distributed Execution of Graph Neural Networks. IPDPS 2023: 512-523 - [c277]M. Emin Ozturk, Omid Asudeh, Gerald Sabin, P. Sadayappan, Aravind Sukumaran-Rajam:
A Performance Portability Study Using Tensor Contraction Benchmarks. IPDPS Workshops 2023: 591-600 - [c276]Lizhi Xiang, Miao Yin, Chengming Zhang, Aravind Sukumaran-Rajam, P. Sadayappan, Bo Yuan, Dingwen Tao:
TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition. PPoPP 2023: 260-273 - [c275]Martin Kong, Raneem Abu Yosef, Atanas Rountev, P. Sadayappan:
Automatic Generation of Distributed-Memory Mappings for Tensor Computations. SC 2023: 64:1-64:13 - [c274]Tripti Agarwal, Harvey Dam, Ponnuswamy Sadayappan, Ganesh Gopalakrishnan, Dorra Ben Khalifa, Matthieu Martel:
What Operations can be Performed Directly on Compressed Arrays, and with What Error? SC Workshops 2023: 252-262 - [i14]Eric Heisler, Siddharth Saurav, Aadesh Deshmukh, Sandip Mazumder, Ponnuswamy Sadayappan, Hari Sundar:
Automating GPU Scalability for Complex Scientific Models: Phonon Boltzman Transport Equation. CoRR abs/2305.19400 (2023) - 2022
- [c273]Yufan Xu, Qiwei Yuan, Erik Curtis Barton, Rui Li, P. Sadayappan, Aravind Sukumaran-Rajam:
Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs. PACT 2022: 252-264 - [c272]Lizhi Xiang, P. Sadayappan, Aravind Sukumaran-Rajam:
High-Performance Architecture Aware Sparse Convolutional Neural Networks for GPUs. PACT 2022: 265-278 - [c271]Yufan Xu, Saurabh Raje, Atanas Rountev, Gerald Sabin, Aravind Sukumaran-Rajam, P. Sadayappan:
Training of deep learning pipelines on memory-constrained GPUs via segmented fused-tiled execution. CC 2022: 104-116 - [c270]Miheer Vaidya, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Comprehensive Accelerator-Dataflow Co-design Optimization for Convolutional Neural Networks. CGO 2022: 325-335 - [c269]Süreyya Emre Kurt, Saurabh Raje, Aravind Sukumaran-Rajam, P. Sadayappan:
Sparsity-Aware Tensor Decomposition. IPDPS 2022: 952-962 - [c268]Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
Memory Optimizations in an Array Language. SC 2022: 31:1-31:15 - [d4]Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
futhark-mem-sc22. Version v0.1.8. Zenodo, 2022 [all versions] - [d3]Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
futhark-mem-sc22. Version v1.0.0. Zenodo, 2022 [all versions] - [d2]Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
futhark-mem-sc22. Version v1.1.0. Zenodo, 2022 [all versions] - [d1]Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
futhark-mem-sc22. Version v1.1.1. Zenodo, 2022 [all versions] - [i13]Lizhi Xiang, Miao Yin, Chengming Zhang, Aravind Sukumaran-Rajam, P. Sadayappan, Bo Yuan, Dingwen Tao:
TDC: Towards Extremely Efficient CNNs on GPUs via Hardware-Aware Tucker Decomposition. CoRR abs/2211.03715 (2022) - [i12]Paolo Bientinesi, David A. Ham, Furong Huang, Paul H. J. Kelly, P. Sadayappan, Edward Stow:
Tensor Computations: Applications and Optimization (Dagstuhl Seminar 22101). Dagstuhl Reports 12(3): 1-14 (2022) - 2021
- [c267]Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Analytical characterization and design space exploration for optimization of CNNs. ASPLOS 2021: 928-942 - [c266]Auguste Olivry, Guillaume Iooss, Nicolas Tollenaere, Atanas Rountev, P. Sadayappan, Fabrice Rastello:
IOOpt: automatic derivation of I/O complexity bounds for affine programs. PLDI 2021: 1187-1202 - [c265]Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Efficient Distributed Algorithms for Convolutional Neural Networks. SPAA 2021: 439-442 - [i11]Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Analytical Characterization and Design Space Exploration for Optimization of CNNs. CoRR abs/2101.09808 (2021) - [i10]Rui Li, Yufan Xu, Aravind Sukumaran-Rajam, Atanas Rountev, P. Sadayappan:
Efficient distributed algorithms for Convolutional Neural Networks. CoRR abs/2105.13480 (2021) - 2020
- [c264]Gordon Euhyun Moon, J. Austin Ellis, Aravind Sukumaran-Rajam, Srinivasan Parthasarathy, P. Sadayappan:
ALO-NMF: Accelerated Locality-Optimized Non-negative Matrix Factorization. KDD 2020: 1758-1767 - [c263]Auguste Olivry, Julien Langou, Louis-Noël Pouchet, P. Sadayappan, Fabrice Rastello:
Automated derivation of parametric data movement lower bounds for affine programs. PLDI 2020: 808-822 - [c262]Jinsung Kim, Ajay Panyala, Bo Peng, Karol Kowalski, P. Sadayappan, Sriram Krishnamoorthy:
Scalable heterogeneous execution of a coupled-cluster model with perturbative triples. SC 2020: 79 - [c261]Süreyya Emre Kurt, Aravind Sukumaran-Rajam, Fabrice Rastello, P. Sadayappan:
Efficient tiled sparse matrix multiplication through matrix signatures. SC 2020: 87 - [c260]Troels Henriksen, Sune Hellfritzsch, Ponnuswamy Sadayappan, Cosmin E. Oancea:
Compiling generalized histograms for GPU. SC 2020: 97 - [e7]Ponnuswamy Sadayappan, Bradford L. Chamberlain, Guido Juckeland, Hatem Ltaief:
High Performance Computing - 35th International Conference, ISC High Performance 2020, Frankfurt/Main, Germany, June 22-25, 2020, Proceedings. Lecture Notes in Computer Science 12151, Springer 2020, ISBN 978-3-030-50742-8 [contents]
2010 – 2019
- 2019
- [c259]Jiankai Sun, Bortik Bandyopadhyay, Armin Bashizade, Jiongqian Liang, P. Sadayappan, Srinivasan Parthasarathy:
ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation. AAAI 2019: 265-272 - [c258]Jinsung Kim, Aravind Sukumaran-Rajam, Vineeth Thumma, Sriram Krishnamoorthy, Ajay Panyala, Louis-Noël Pouchet, Atanas Rountev, P. Sadayappan:
A Code Generator for High-Performance Tensor Contractions on GPUs. CGO 2019: 85-95 - [c257]Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Richard W. Vuduc, P. Sadayappan:
Load-Balanced Sparse MTTKRP on GPUs. IPDPS 2019: 123-133 - [c256]Prashant Singh Rawat, Miheer Vaidya, Aravind Sukumaran-Rajam, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
On Optimizing Complex Stencils on GPUs. IPDPS 2019: 641-652 - [c255]Changwan Hong, Aravind Sukumaran-Rajam, Israt Nisa, Kunal Singh, P. Sadayappan:
Adaptive sparse tiling for sparse matrix multiplication. PPoPP 2019: 300-314 - [c254]Gordon Euhyun Moon, Denis Newman-Griffis, Jinsung Kim, Aravind Sukumaran-Rajam, Eric Fosler-Lussier, P. Sadayappan:
Parallel Data-Local Training for Optimizing Word2Vec Embeddings for Word and Graph Embeddings. MLHPC@SC 2019: 44-55 - [c253]Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Prashant Singh Rawat, Sriram Krishnamoorthy, P. Sadayappan:
An efficient mixed-mode representation of sparse tensors. SC 2019: 49:1-49:25 - [c252]Rui Li, Aravind Sukumaran-Rajam, Richard Veras, Tze Meng Low, Fabrice Rastello, Atanas Rountev, P. Sadayappan:
Analytical cache modeling and tilesize optimization for tensor contractions. SC 2019: 74:1-74:13 - [e6]Michèle Weiland, Guido Juckeland, Carsten Trinitis, Ponnuswamy Sadayappan:
High Performance Computing - 34th International Conference, ISC High Performance 2019, Frankfurt/Main, Germany, June 16-20, 2019, Proceedings. Lecture Notes in Computer Science 11501, Springer 2019, ISBN 978-3-030-20655-0 [contents] - [i9]Israt Nisa, Jiajia Li, Aravind Sukumaran-Rajam, Richard W. Vuduc, P. Sadayappan:
Load-Balanced Sparse MTTKRP on GPUs. CoRR abs/1904.03329 (2019) - [i8]Gordon Euhyun Moon, Aravind Sukumaran-Rajam, Srinivasan Parthasarathy, P. Sadayappan:
PL-NMF: Parallel Locality-Optimized Non-negative Matrix Factorization. CoRR abs/1904.07935 (2019) - [i7]Auguste Olivry, Julien Langou, Louis-Noël Pouchet, P. Sadayappan, Fabrice Rastello:
Automated Derivation of Parametric Data Movement Lower Bounds for Affine Programs. CoRR abs/1911.06664 (2019) - 2018
- [j63]Wenlei Bao, Sriram Krishnamoorthy, Louis-Noël Pouchet, P. Sadayappan:
Analytical modeling of cache behavior for affine programs. Proc. ACM Program. Lang. 2(POPL): 32:1-32:26 (2018) - [j62]Prashant Singh Rawat, Miheer Vaidya, Aravind Sukumaran-Rajam, Mahesh Ravishankar, Vinod Grover, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
Domain-Specific Optimization and Generation of High-Performance GPU Code for Stencil Computations. Proc. IEEE 106(11): 1902-1920 (2018) - [c251]Israt Nisa, Aravind Sukumaran-Rajam, Süreyya Emre Kurt, Changwan Hong, P. Sadayappan:
Sampled Dense Matrix Multiplication for High-Performance Machine Learning. HiPC 2018: 32-41 - [c250]Changwan Hong, Aravind Sukumaran-Rajam, Bortik Bandyopadhyay, Jinsung Kim, Süreyya Emre Kurt, Israt Nisa, Shivani Sabhlok, Ümit V. Çatalyürek, Srinivasan Parthasarathy, P. Sadayappan:
Efficient sparse-matrix multi-vector product on GPUs. HPDC 2018: 66-79 - [c249]Gordon Euhyun Moon, Israt Nisa, Aravind Sukumaran-Rajam, Bortik Bandyopadhyay, Srinivasan Parthasarathy, P. Sadayappan:
Parallel Latent Dirichlet Allocation on GPUs. ICCS (2) 2018: 259-272 - [c248]Jinsung Kim, Aravind Sukumaran-Rajam, Changwan Hong, Ajay Panyala, Rohit Kumar Srivastava, Sriram Krishnamoorthy, P. Sadayappan:
Optimizing Tensor Contractions in CCSD(T) for Efficient Execution on GPUs. ICS 2018: 96-106 - [c247]Jyothi Vedurada, Arjun Suresh, Aravind Sukumaran-Rajam, Jinsung Kim, Changwan Hong, Ajay Panyala, Sriram Krishnamoorthy, V. Krishna Nandivada, Rohit Kumar Srivastava, P. Sadayappan:
TTLG - An Efficient Tensor Transposition Library for GPUs. IPDPS 2018: 578-588 - [c246]Israt Nisa, Charles Siegel, Aravind Sukumaran-Rajam, Abhinav Vishnu, P. Sadayappan:
Effective Machine Learning Based Format Selection and Performance Modeling for SpMV on GPUs. IPDPS Workshops 2018: 1056-1065 - [c245]Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, Prashant Singh Rawat, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan:
GPU code optimization using abstract kernel emulation and sensitivity analysis. PLDI 2018: 736-751 - [c244]Prashant Singh Rawat, Fabrice Rastello, Aravind Sukumaran-Rajam, Louis-Noël Pouchet, Atanas Rountev, P. Sadayappan:
Register optimizations for stencils on GPUs. PPoPP 2018: 168-182 - [c243]Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, Prashant Singh Rawat, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan:
Performance modeling for GPUs using abstract kernel emulation. PPoPP 2018: 397-398 - [c242]Prashant Singh Rawat, Aravind Sukumaran-Rajam, Atanas Rountev, Fabrice Rastello, Louis-Noël Pouchet, P. Sadayappan:
Associative instruction reordering to alleviate register pressure. SC 2018: 46:1-46:13 - [i6]Jiankai Sun, Bortik Bandyopadhyay, Armin Bashizade, Jiongqian Liang, P. Sadayappan, Srinivasan Parthasarathy:
ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation. CoRR abs/1811.00839 (2018) - 2017
- [c241]Changwan Hong, Aravind Sukumaran-Rajam, Jinsung Kim, P. Sadayappan:
MultiGraph: Efficient Graph Processing on GPUs. PACT 2017: 27-40 - [c240]Prashant Singh Rawat, Aravind Sukumaran-Rajam, Atanas Rountev, Fabrice Rastello, Louis-Noël Pouchet, P. Sadayappan:
POSTER: Statement Reordering to Alleviate Register Pressure for Stencils on GPUs. PACT 2017: 158-159 - [c239]Gordon Euhyun Moon, Aravind Sukumaran-Rajam, P. Sadayappan:
Parallel LDA with Over-Decomposition. HiPC Workshops 2017: 25-31 - [c238]Süreyya Emre Kurt, Vineeth Thumma, Changwan Hong, Aravind Sukumaran-Rajam, P. Sadayappan:
Characterization of Data Movement Requirements for Sparse Matrix Computations on GPUs. HiPC 2017: 283-293 - [c237]Rakshith Kunchum, Ankur Chaudhry, Aravind Sukumaran-Rajam, Qingpeng Niu, Israt Nisa, P. Sadayappan:
On improving performance of sparse matrix-matrix multiplication on GPUs. ICS 2017: 14:1-14:11 - [c236]Wenlei Bao, Prashant Singh Rawat, Martin Kong, Sriram Krishnamoorthy, Louis-Noël Pouchet, P. Sadayappan:
Efficient Cache Simulation for Affine Computations. LCPC 2017: 65-85 - [c235]Israt Nisa, Aravind Sukumaran-Rajam, Rakshith Kunchum, P. Sadayappan:
Parallel CCD++ on GPU for Matrix Factorization. GPGPU@PPoPP 2017: 73-83 - [c234]Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy, P. Sadayappan:
Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis. PPoPP 2017: 327-340 - 2016
- [j61]Humayun Arafat, James Dinan, Sriram Krishnamoorthy, Pavan Balaji, P. Sadayappan:
Work stealing for GPU-accelerated parallel programs in a global address space framework. Concurr. Comput. Pract. Exp. 28(13): 3637-3654 (2016) - [j60]Qingpeng Niu, James Dinan, Sravya Tirukkovalur, Anouar Benali, Jeongnim Kim, Lubos Mitas, Lucas K. Wagner, P. Sadayappan:
Global-view coefficients: a data management solution for parallel quantum Monte Carlo applications. Concurr. Comput. Pract. Exp. 28(13): 3655-3671 (2016) - [j59]Wenlei Bao, Changwan Hong, Sudheer Chunduri, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan:
Static and Dynamic Frequency Scaling on Multicore CPUs. ACM Trans. Archit. Code Optim. 13(4): 51:1-51:26 (2016) - [c233]Prashant Singh Rawat, Changwan Hong, Mahesh Ravishankar, Vinod Grover, Louis-Noël Pouchet, Atanas Rountev, P. Sadayappan:
Resource Conscious Reuse-Driven Tiling for GPUs. PACT 2016: 99-111 - [c232]Lukasz Domagala, Duco van Amstel, Fabrice Rastello, P. Sadayappan:
Register allocation and promotion through combined instruction scheduling and loop unrolling. CC 2016: 143-151 - [c231]Samyam Rajbhandari, Jinsung Kim, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, Robert J. Harrison, P. Sadayappan:
On fusing recursive traversals of K-d trees. CC 2016: 152-162 - [c230]Sanket Tavarageri, Wooil Kim, Josep Torrellas, P. Sadayappan:
Compiler Support for Software Cache Coherence. HiPC 2016: 341-350 - [c229]Wooil Kim, Sanket Tavarageri, P. Sadayappan, Josep Torrellas:
Architecting and Programming a Hardware-Incoherent Multiprocessor Cache Hierarchy. IPDPS 2016: 555-565 - [c228]Rajkumar Kettimuthu, Gagan Agrawal, P. Sadayappan, Ian T. Foster:
Differentiated Scheduling of Response-Critical and Best-Effort Wide-Area Data Transfers. IPDPS 2016: 1113-1122 - [c227]Changwan Hong, Wenlei Bao, Albert Cohen, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, J. Ramanujam, P. Sadayappan:
Effective padding of multidimensional arrays to avoid cache conflict misses. PLDI 2016: 129-144 - [c226]Wenlei Bao, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan:
PolyCheck: dynamic verification of iteration space transformations on affine programs. POPL 2016: 539-554 - [c225]Prashant Singh Rawat, Changwan Hong, Mahesh Ravishankar, Vinod Grover, Louis-Noël Pouchet, P. Sadayappan:
Effective resource management for enhancing performance of 2D and 3D stencils on GPUs. GPGPU@PPoPP 2016: 92-102 - [c224]Martin Kong, Louis-Noël Pouchet, P. Sadayappan, Vivek Sarkar:
PIPES: a language and compiler for task-based programming on distributed-memory clusters. SC 2016: 456-467 - [c223]Samyam Rajbhandari, Jinsung Kim, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, Robert J. Harrison, P. Sadayappan:
A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment. SC 2016: 468-479 - [c222]Timothy Carpenter, Fabrice Rastello, P. Sadayappan, Anastasios Sidiropoulos:
Brief Announcement: Approximating the I/O Complexity of One-Shot Red-Blue Pebbling. SPAA 2016: 161-163 - 2015
- [j58]Arash Ashari, Naser Sedaghati, John Eisenlohr, P. Sadayappan:
A model-driven blocking strategy for load balanced sparse matrix-vector multiplication on GPUs. J. Parallel Distributed Comput. 76: 3-15 (2015) - [j57]Keshav Pingali, J. Ramanujam, P. Sadayappan:
Introduction to the Special Issue on PPoPP'12. ACM Trans. Parallel Comput. 1(2): 9:1-9:2 (2015) - [c221]Naznin Fauzia, Louis-Noël Pouchet, P. Sadayappan:
Characterizing and enhancing global memory data coalescing on GPUs. CGO 2015: 12-22 - [c220]Naser Sedaghati, Te Mu, Louis-Noël Pouchet, Srinivasan Parthasarathy, P. Sadayappan:
Automatic Selection of Sparse Matrix Representation on GPUs. ICS 2015: 99-108 - [c219]Tobias Grosser, Jagannathan Ramanujam, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop:
Optimistic Delinearization of Parametrically Sized Arrays. ICS 2015: 351-360 - [c218]Ponnuswamy Sadayappan, Ray-Bing Chen:
iWAPT Invited Talks. IPDPS Workshops 2015: 1202-1203 - [c217]Martin Kong, Louis-Noël Pouchet, Ponnuswamy Sadayappan:
A Roofline-Based Performance Estimator for Distributed Matrix-Multiply on Intel CnC. IPDPS Workshops 2015: 1241-1250 - [c216]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Access Complexity of Programs. POPL 2015: 567-580 - [c215]Mahesh Ravishankar, Roshan Dathathri, Venmugil Elango, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Distributed memory code generation for mixed Irregular/Regular computations. PPoPP 2015: 65-75 - [c214]Arash Ashari, Shirish Tatikonda, Matthias Boehm, Berthold Reinwald, Keith Campbell, John Keenleyside, P. Sadayappan:
On optimizing machine learning workloads via kernel fusion. PPoPP 2015: 173-182 - [c213]Prashant Singh Rawat, Martin Kong, Thomas Henretty, Justin Holewinski, Kevin Stock, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
SDSLc: a multi-target domain-specific compiler for stencil computations. WOLFHPC@SC 2015: 6:1-6:10 - [c212]Rajkumar Kettimuthu, Gayane Vardoyan, Gagan Agrawal, P. Sadayappan, Ian T. Foster:
An elegant sufficiency: load-aware differentiated scheduling of data transfers. SC 2015: 46:1-46:12 - 2014
- [j56]Sriram Krishnamoorthy, J. Ramanujam, P. Sadayappan:
Introduction to the JPDC Special Issue on Domain-Specific Languages and High-Level Frameworks for High-Performance Computing. J. Parallel Distributed Comput. 74(12): 3175 (2014) - [j55]Tobias Grosser, Sven Verdoolaege, Albert Cohen, P. Sadayappan:
The Relation Between Diamond Tiling and Hexagonal Tiling. Parallel Process. Lett. 24(3) (2014) - [j54]Martin Kong, Antoniu Pop, Louis-Noël Pouchet, R. Govindarajan, Albert Cohen, P. Sadayappan:
Compiler/Runtime Framework for Dynamic Dataflow Parallelization of Tiled Programs. ACM Trans. Archit. Code Optim. 11(4): 61:1-61:30 (2014) - [j53]Venmugil Elango, Naser Sedaghati, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, Radu Teodorescu, P. Sadayappan:
On Using the Roofline Model with Lower Bounds on Data Movement. ACM Trans. Archit. Code Optim. 11(4): 67:1-67:23 (2014) - [j52]Mahesh Ravishankar, John Eisenlohr, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic parallelization of a class of irregular loops for distributed memory systems. ACM Trans. Parallel Comput. 1(1): 7:1-7:37 (2014) - [c211]S. M. Faisal, Srinivasan Parthasarathy, P. Sadayappan:
Global graphs: A middleware for large scale graph processing. IEEE BigData 2014: 33-40 - [c210]Rajkumar Kettimuthu, Gayane Vardoyan, Gagan Agrawal, P. Sadayappan:
Modeling and Optimizing Large-Scale Wide-Area Data Transfers. CCGRID 2014: 196-205 - [c209]Tobias Grosser, Albert Cohen, Justin Holewinski, P. Sadayappan, Sven Verdoolaege:
Hybrid Hexagonal/Classical Tiling for GPUs. CGO 2014: 66 - [c208]Qingpeng Niu, Pai-Wei Lai, S. M. Faisal, Srinivasan Parthasarathy, P. Sadayappan:
A fast implementation of MLR-MCL algorithm on multi-core processors. HiPC 2014: 1-10 - [c207]Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan:
CAST: Contraction Algorithm for Symmetric Tensors. ICPP 2014: 261-272 - [c206]Humayun Arafat, Sriram Krishnamoorthy, P. Sadayappan:
Checksumming Strategies for Data in Volatile Memories. ICPP Workshops 2014: 245-254 - [c205]Wenlei Bao, Sanket Tavarageri, Füsun Özgüner, P. Sadayappan:
PWCET: Power-Aware Worst Case Execution Time Analysis. ICPP Workshops 2014: 439-447 - [c204]Arash Ashari, Naser Sedaghati, John Eisenlohr, P. Sadayappan:
An efficient two-dimensional blocking strategy for sparse matrix-vector multiplication on GPUs. ICS 2014: 273-282 - [c203]Shoaib Kamil, Saman P. Amarasinghe, P. Sadayappan:
WOSC 2014: second workshop on optimizing stencil computations. SPLASH (Companion Volume) 2014: 89-90 - [c202]Kevin Stock, Martin Kong, Tobias Grosser, Louis-Noël Pouchet, Fabrice Rastello, J. Ramanujam, P. Sadayappan:
A framework for enhancing data reuse via associative reordering. PLDI 2014: 65-76 - [c201]Sanket Tavarageri, Sriram Krishnamoorthy, P. Sadayappan:
Compiler-assisted detection of transient memory errors. PLDI 2014: 204-215 - [c200]Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan:
A Communication-Optimal Framework for Contracting Distributed Tensors. SC 2014: 375-386 - [c199]Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan:
Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications. SC 2014: 781-792 - [c198]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On characterizing the data movement complexity of computational DAGs for parallel execution. SPAA 2014: 296-306 - [i5]Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
Beyond Reuse Distance Analysis: Dynamic Analysis for Characterization of Data Locality Potential. CoRR abs/1401.5024 (2014) - [i4]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Movement Complexity of Computational DAGs for Parallel Execution. CoRR abs/1404.4767 (2014) - [i3]Lukasz Domagala, Fabrice Rastello, P. Sadayappan, Duco van Amstel:
A Tiling Perspective for Register Optimization. CoRR abs/1406.0582 (2014) - [i2]Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
On Characterizing the Data Access Complexity of Programs. CoRR abs/1411.2286 (2014) - 2013
- [j51]Sanket Tavarageri, J. Ramanujam, P. Sadayappan:
Adaptive parallel tiled code generation and accelerated auto-tuning. Int. J. High Perform. Comput. Appl. 27(4): 412-425 (2013) - [j50]Eunjung Park, John Cavazos, Louis-Noël Pouchet, Cédric Bastoul, Albert Cohen, P. Sadayappan:
Predictive Modeling in a Polyhedral Optimization Space. Int. J. Parallel Program. 41(5): 704-750 (2013) - [j49]Daniel Lowell, Jeswin Godwin, Justin Holewinski, Deepan Karthik, Chekuri Choudary, Azamat Mametjanov, Boyana Norris, Gerald Sabin, P. Sadayappan, Jason Sarich:
Stencil-Aware GPU Optimization of Iterative Solvers. SIAM J. Sci. Comput. 35(5) (2013) - [j48]Naznin Fauzia, Venmugil Elango, Mahesh Ravishankar, J. Ramanujam, Fabrice Rastello, Atanas Rountev, Louis-Noël Pouchet, P. Sadayappan:
Beyond reuse distance analysis: Dynamic analysis for characterization of data locality potential. ACM Trans. Archit. Code Optim. 10(4): 53:1-53:29 (2013) - [c197]Tobias Grosser, Albert Cohen, Paul H. J. Kelly, J. Ramanujam, P. Sadayappan, Sven Verdoolaege:
Split tiling for GPUs: automatic parallelization using trapezoidal tiles. GPGPU@ASPLOS 2013: 24-31 - [c196]Louis-Noël Pouchet, Peng Zhang, P. Sadayappan, Jason Cong:
Polyhedral-based data reuse optimization for configurable computing. FPGA 2013: 29-38 - [c195]Pai-Wei Lai, Humayun Arafat, Venmugil Elango, Ponnuswamy Sadayappan:
Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs. HiPC 2013: 139-148 - [c194]Ye Wang, Srinivasan Parthasarathy, P. Sadayappan:
Stratification driven placement of complex data: A framework for distributed data analytics. ICDE 2013: 709-720 - [c193]Thomas Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan:
A stencil compiler for short-vector SIMD architectures. ICS 2013: 13-24 - [c192]Sanket Tavarageri, P. Sadayappan:
A Compiler Analysis to Determine Useful Cache Size for Energy Efficiency. IPDPS Workshops 2013: 923-930 - [c191]Athanasios Konstantinidis, Paul H. J. Kelly, J. Ramanujam, P. Sadayappan:
Parametric GPU Code Generation for Affine Loop Programs. LCPC 2013: 136-151 - [c190]Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet, P. Sadayappan:
When polyhedral transformations meet SIMD code generation. PLDI 2013: 127-138 - [c189]Pai-Wei Lai, Kevin Stock, Samyam Rajbhandari, Sriram Krishnamoorthy, P. Sadayappan:
A framework for load balancing of tensor contraction expressions via dynamic task partitioning. SC 2013: 13:1-13:10 - 2012
- [j47]Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions. J. Parallel Distributed Comput. 72(3): 338-352 (2012) - [j46]Kevin Stock, Louis-Noël Pouchet, P. Sadayappan:
Using machine learning to improve automatic vectorization. ACM Trans. Archit. Code Optim. 8(4): 50:1-50:23 (2012) - [c188]Jeswin Godwin, Justin Holewinski, P. Sadayappan:
High-performance sparse matrix-vector multiplication on GPUs for structured grid computations. GPGPU@ASPLOS 2012: 47-56 - [c187]Jun Shirako, Kamal Sharma, Naznin Fauzia, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan, Vivek Sarkar:
Analytical Bounds for Optimal Tile Size Selection. CC 2012: 101-121 - [c186]Qingpeng Niu, James Dinan, Sravya Tirukkovalur, Lubos Mitas, Lucas K. Wagner, P. Sadayappan:
A global address space approach to automated data management for parallel Quantum Monte Carlo applications. HiPC 2012: 1-10 - [c185]Justin Holewinski, Louis-Noël Pouchet, P. Sadayappan:
High-performance code generation for stencil computations on GPU architectures. ICS 2012: 311-320 - [c184]Humayun Arafat, P. Sadayappan, James Dinan, Sriram Krishnamoorthy, Theresa L. Windus:
Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations through Resource Sharing Barriers. IPDPS 2012: 285-295 - [c183]Qingpeng Niu, James Dinan, Qingda Lu, P. Sadayappan:
PARDA: A Fast Parallel Reuse Distance Analysis Algorithm. IPDPS 2012: 1284-1294 - [c182]Justin Holewinski, Ragavendar Ramamurthi, Mahesh Ravishankar, Naznin Fauzia, Louis-Noël Pouchet, Atanas Rountev, P. Sadayappan:
Dynamic trace-based analysis of vectorization potential of applications. PLDI 2012: 371-382 - [c181]Mahesh Ravishankar, John Eisenlohr, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Code generation for parallel execution of a class of irregular loops on distributed memory systems. SC 2012: 72 - [c180]Tyler Clemons, Srinivasan Parthasarathy, P. Sadayappan:
GADBMS: A Framework for Scalable Array Analytics. SC Companion 2012: 1322-1325 - [c179]Pai-Wei Lai, Huaijian Zhang, Samyam Rajbhandari, Edward F. Valeev, Karol Kowalski, P. Sadayappan:
International Conference on Computational Science, ICCS 2012. ICCS 2012: 412-421 - [e5]J. Ramanujam, P. Sadayappan:
Proceedings of the 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2012, New Orleans, LA, USA, February 25-29, 2012. ACM 2012, ISBN 978-1-4503-1160-1 [contents] - 2011
- [j45]Chi-Chung Lam, Thomas Rauber, Gerald Baumgartner, Daniel Cociorva, P. Sadayappan:
Memory-optimal evaluation of expression trees involving large objects. Comput. Lang. Syst. Struct. 37(2): 63-75 (2011) - [j44]Naga Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, Ponnuswamy Sadayappan, Joel H. Saltz:
Optimizing latency and throughput of application workflows on clusters. Parallel Comput. 37(10-11): 694-712 (2011) - [j43]Xintian Yang, Srinivasan Parthasarathy, P. Sadayappan:
Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining. Proc. VLDB Endow. 4(4): 231-242 (2011) - [c178]Naser Sedaghati, Renji Thomas, Louis-Noël Pouchet, Radu Teodorescu, P. Sadayappan:
StVEC: A Vector Instruction Extension for High Performance Stencil Computation. PACT 2011: 276-287 - [c177]Thomas Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam, P. Sadayappan:
Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures. CC 2011: 225-245 - [c176]Eunjung Park, Louis-Noël Pouchet, John Cavazos, Albert Cohen, P. Sadayappan:
Predictive modeling in a polyhedral optimization space. CGO 2011: 119-129 - [c175]Nawab Ali, Sriram Krishnamoorthy, Niranjan Govind, Karol Kowalski, Ponnuswamy Sadayappan:
Application-Specific Fault Tolerance via Data Access Characterization. Euro-Par (2) 2011: 340-352 - [c174]Sanket Tavarageri, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Dynamic selection of tile sizes. HiPC 2011: 1-10 - [c173]Kevin Stock, Thomas Henretty, Iyyappa Murugandi, P. Sadayappan, Robert J. Harrison:
Model-Driven SIMD Code Generation for a Multi-resolution Tensor Kernel. IPDPS 2011: 1058-1067 - [c172]Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan, Nicolas Vasilache:
Loop transformations: convexity, pruning and optimization. POPL 2011: 549-562 - [c171]Ronald G. Minnich, Curtis L. Janssen, Sriram Krishnamoorthy, Andres Marquez, Wenjing Ma, Maya B. Gokhale, Ponnuswamy Sadayappan, Eric Van Hensbergen, Jonathan Appavoo, Jim McKie:
Poster: FOX: a fault-oblivious extreme scale execution environment. SC Companion 2011: 91-92 - [i1]Xintian Yang, Srinivasan Parthasarathy, Ponnuswamy Sadayappan:
Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining. CoRR abs/1103.2405 (2011) - 2010
- [j42]Vijay S. Kumar, Tahsin M. Kurç, Varun Ratnakar, Jihie Kim, Gaurang Mehta, Karan Vahi, Yoon-Ju Lee Nelson, P. Sadayappan, Ewa Deelman, Yolanda Gil, Mary W. Hall, Joel H. Saltz:
Parameterized specification, configuration and execution of data-intensive scientific workflows. Clust. Comput. 13(3): 315-333 (2010) - [c170]Muthu Manikandan Baskaran, J. Ramanujam, P. Sadayappan:
Automatic C-to-CUDA Code Generation for Affine Programs. CC 2010: 244-263 - [c169]James Dinan, Arjun Singri, P. Sadayappan, Sriram Krishnamoorthy:
Selective Recovery from Failures in a Task Parallel Programming Model. CCGRID 2010: 709-714 - [c168]James Dinan, Pavan Balaji, Ewing L. Lusk, P. Sadayappan, Rajeev Thakur:
Hybrid parallel programming with MPI and unified parallel C. Conf. Computing Frontiers 2010: 177-186 - [c167]Muthu Manikandan Baskaran, Albert Hartono, Sanket Tavarageri, Thomas Henretty, J. Ramanujam, P. Sadayappan:
Parameterized tiling revisited. CGO 2010: 200-209 - [c166]Vitus J. Leung, Gerald Sabin, P. Sadayappan:
Parallel Job Scheduling Policies to Improve Fairness: A Case Study. ICPP Workshops 2010: 346-353 - [c165]Atanas Rountev, Kevin Van Valkenburgh, Dacong Yan, P. Sadayappan:
Understanding parallelism-inhibiting dependences in sequential Java programs. ICSM 2010: 1-9 - [c164]Albert Hartono, Muthu Manikandan Baskaran, J. Ramanujam, Ponnuswamy Sadayappan:
DynTile: Parametric tiled loop generation for parallel execution on multicore processors. IPDPS 2010: 1-12 - [c163]Giridhar Sreenivasa Murthy, Mahesh Ravishankar, Muthu Manikandan Baskaran, Ponnuswamy Sadayappan:
Optimal loop unrolling for GPGPU programs. IPDPS 2010: 1-11 - [c162]Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan:
Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework. SC 2010: 1-11
2000 – 2009
- 2009
- [j41]Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald M. Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
An Integrated Approach to Locality-Conscious Processor Allocation and Scheduling of Mixed-Parallel Applications. IEEE Trans. Parallel Distributed Syst. 20(8): 1158-1172 (2009) - [c161]Qingda Lu, Jiang Lin, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan:
Soft-OLP: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning. PACT 2009: 246-257 - [c160]Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-Fook Ngai:
Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors. PACT 2009: 348-357 - [c159]Nawab Ali, Philip H. Carns, Kamil Iskra, Dries Kimpe, Samuel Lang, Robert Latham, Robert B. Ross, Lee Ward, P. Sadayappan:
Scalable I/O forwarding framework for high-performance computing systems. CLUSTER 2009: 1-10 - [c158]Vijay S. Kumar, P. Sadayappan, Gaurang Mehta, Karan Vahi, Ewa Deelman, Varun Ratnakar, Jihie Kim, Yolanda Gil, Mary W. Hall, Tahsin M. Kurç, Joel H. Saltz:
An integrated framework for performance-based optimization of scientific workflows. HPDC 2009: 177-186 - [c157]Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan:
Parametric multi-level tiling of imperfectly nested loops. ICS 2009: 147-157 - [c156]Albert Hartono, Boyana Norris, Ponnuswamy Sadayappan:
Annotation-based empirical performance tuning using Orio. IPDPS 2009: 1-11 - [c155]Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. PPoPP 2009: 219-228 - [c154]James Dinan, D. Brian Larkins, P. Sadayappan, Sriram Krishnamoorthy, Jarek Nieplocha:
Scalable work stealing. SC 2009 - [c153]Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan:
Enabling software management for multicore caches with a lightweight hardware support. SC 2009 - 2008
- [j40]Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju:
A framework for characterizing overlap of communication and computation in parallel applications. Clust. Comput. 11(1): 75-90 (2008) - [j39]James Dinan, Stephen Olivier, Gerald Sabin, Jan F. Prins, P. Sadayappan, Chau-Wen Tseng:
A message passing benchmark for unbalanced applications. Simul. Model. Pract. Theory 16(9): 1177-1189 (2008) - [c152]Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 - [c151]Narayan Desai, Pavan Balaji, P. Sadayappan, Mohammad Islam:
Are nonblocking networks really needed for high-end-computing workloads? CLUSTER 2008: 152-159 - [c150]Nawab Ali, Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, P. Sadayappan:
An OSD-based approach to managing directory operations in parallel file systems. CLUSTER 2008: 175-184 - [c149]Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan:
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. HPCA 2008: 367-378 - [c148]Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz, Rajkumar Kettimuthu, Ian T. Foster:
Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP. HPDC 2008: 225-226 - [c147]Jarek Nieplocha, Sriram Krishnamoorthy, Marat Valiev, Manojkumar Krishnan, Bruce J. Palmer, P. Sadayappan:
Integrated Data and Task Management for Scientific Applications. ICCS (1) 2008: 20-31 - [c146]Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows. ICPP 2008: 254-261 - [c145]James Dinan, Sriram Krishnamoorthy, D. Brian Larkins, Jarek Nieplocha, P. Sadayappan:
Scioto: A Framework for Global-View Task Parallelism. ICPP 2008: 586-593 - [c144]Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 - [c143]Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 - [c142]Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Joel H. Saltz:
A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP. IPDPS 2008: 1-12 - [c141]Uday Bondhugula, Albert Hartono, J. Ramanujam, P. Sadayappan:
A practical automatic polyhedral parallelizer and locality optimizer. PLDI 2008: 101-113 - [c140]Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPoPP 2008: 1-10 - [c139]Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Ian T. Foster, Joel H. Saltz:
Using overlays for efficient data transfer over shared wide-area networks. SC 2008: 47 - [c138]D. Brian Larkins, James Dinan, Sriram Krishnamoorthy, Srinivasan Parthasarathy, Atanas Rountev, P. Sadayappan:
Global trees: a framework for linked data structures on distributed memory parallel systems. SC 2008: 57 - [e4]P. Sadayappan, Manish Parashar, Ramamurthy Badrinath, Viktor K. Prasanna:
High Performance Computing - HiPC 2008, 15th International Conference, Bangalore, India, December 17-20, 2008. Proceedings. Lecture Notes in Computer Science 5374, Springer 2008, ISBN 978-3-540-89893-1 [contents] - 2007
- [j38]Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Efficient search-space pruning for integrated fusion and tiling transformations. Concurr. Comput. Pract. Exp. 19(18): 2425-2443 (2007) - [c137]Sriram Krishnamoorthy, Juan Piernas, Vinod Tipparaju, Jarek Nieplocha, P. Sadayappan:
Non-collective parallel I/O for global address space programming models. CLUSTER 2007: 41-49 - [c136]Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
Toward Optimizing Latency Under Throughput Constraints for Application Workflows on Clusters. Euro-Par 2007: 173-183 - [c135]Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
Scheduling File Transfers for Data-Intensive Jobs on Heterogeneous Clusters. Euro-Par 2007: 214-223 - [c134]Mohammad Islam, Pavan Balaji, Gerald Sabin, P. Sadayappan:
Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. ICPP 2007: 42 - [c133]James Dinan, Stephen Olivier, Gerald Sabin, Jan F. Prins, P. Sadayappan, Chau-Wen Tseng:
Dynamic Load Balancing of Unbalanced Computations Using Message Passing. IPDPS 2007: 1-8 - [c132]Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan:
A global address space framework for locality aware scheduling of block-sparse computations. IPDPS 2007: 1-8 - [c131]Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 - [c130]Uday Bondhugula, J. Ramanujam, P. Sadayappan:
Automatic mapping of nested loops to FPGAS. PPoPP 2007: 101-111 - [c129]Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, Nawab Ali, P. Sadayappan:
Integrating parallel file systems with object-based storage devices. SC 2007: 27 - 2006
- [j37]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella:
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distributed Comput. 66(5): 659-673 (2006) - [j36]Christian Engelmann, Stephen L. Scott, David E. Bernholdt, Narasimha Raju Gottumukkala, Chokchai Leangsuksun, Jyothish Varma, Chao Wang, Frank Mueller, Aniruddha G. Shet, P. Sadayappan:
MOLAR: adaptive runtime support for high-end computing operating and runtime systems. ACM SIGOPS Oper. Syst. Rev. 40(2): 63-72 (2006) - [j35]Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan:
Layout transformation support for the disk resident arrays framework. J. Supercomput. 36(2): 153-170 (2006) - [c128]Qingda Lu, Sriram Krishnamoorthy, P. Sadayappan:
Combining analytical and empirical approaches in tuning matrix transposition. PACT 2006: 233-242 - [c127]Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju:
A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems. CLUSTER 2006 - [c126]Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications. CLUSTER 2006 - [c125]Uday Bondhugula, Ananth Devulapalli, James Dinan, Joseph Fernando, Pete Wyckoff, Eric Stahlberg, P. Sadayappan:
Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths. FCCM 2006: 152-164 - [c124]Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, Sriram Krishnamoorthy, P. Sadayappan, Joel H. Saltz:
Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O. HPDC 2006: 241-252 - [c123]Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan:
Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 - [c122]Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications. ICPP 2006: 443-450 - [c121]A. Allam, J. Ramanujam, Gerald Baumgartner, P. Sadayappan:
Memory minimization for tensor contractions using integer linear programming. IPDPS 2006 - [c120]Uday Bondhugula, Ananth Devulapalli, Joseph Fernando, Pete Wyckoff, P. Sadayappan:
Parallel FPGA-based all-pairs shortest-paths in a directed graph. IPDPS 2006 - [c119]Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan:
An extensible global address space framework with decoupled task and data abstractions. IPDPS 2006 - [c118]Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, P. Sadayappan:
An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model. IPDPS 2006 - [c117]Gerald Sabin, Matthew Lang, P. Sadayappan:
Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach. JSSPP 2006: 94-114 - [c116]Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz:
A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing. JSSPP 2006: 141-160 - [c115]Stephen Olivier, Jun Huan, Jinze Liu, Jan F. Prins, James Dinan, P. Sadayappan, Chau-Wen Tseng:
UTS: An Unbalanced Tree Search Benchmark. LCPC 2006: 235-250 - [c114]Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan:
Data management and query - Hypergraph partitioning for automatic memory hierarchy management. SC 2006: 98 - [c113]Jarek Nieplocha, Bruce J. Palmer, Manojkumar Krishnan, P. Sadayappan:
M12 - Overview of the global arrays parallel software development toolkit. SC 2006: 226 - [e3]Eduard Ayguadé, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers. Lecture Notes in Computer Science 4339, Springer 2006, ISBN 978-3-540-69329-1 [contents] - 2005
- [j34]Rajkumar Kettimuthu, Vijay Subramani, Srividya Srinivasan, Thiagaraja Gopalsamy, Dhabaleswar K. Panda, P. Sadayappan:
Selective preemption strategies for parallel job scheduling. Int. J. High Perform. Comput. Netw. 3(2/3): 122-152 (2005) - [j33]Gerald Baumgartner, Alexander A. Auer, David E. Bernholdt, Alina Bibireata, Venkatesh Choppella, Daniel Cociorva, Xiaoyang Gao, Robert J. Harrison, So Hirata, Sriram Krishnamoorthy, Sandhya Krishnan, Chi-Chung Lam, Qingda Lu, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan, Alexander Sibiryakov:
Synthesis of High-Performance Parallel Programs for a Class of ab Initio Quantum Chemistry Models. Proc. IEEE 93(2): 276-292 (2005) - [c112]Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan:
A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O. CCGRID 2005: 792-799 - [c111]Sriram Krishnamoorthy, Jarek Nieplocha, P. Sadayappan:
Data and Computation Abstractions for Dynamic and Irregular Computations. HiPC 2005: 258-269 - [c110]Gerald Sabin, Vishvesh Sahasrabudhe, P. Sadayappan:
Assessment and enhancement of meta-schedulers for multi-site job sharing. HPDC 2005: 144-153 - [c109]Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan:
Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 - [c108]Vipin Chaudhary, P. Sadayappan:
Message from the Chairs. ICPP Workshops 2005: 282 - [c107]Swarup Kumar Sahoo, Rajkiran Panuganti, Sriram Krishnamoorthy, P. Sadayappan:
Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors. IPDPS 2005 - [c106]Gerald Sabin, P. Sadayappan:
Unfairness Metrics for Space-Sharing Parallel Job Schedulers. JSSPP 2005: 238-256 - [c105]Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 - [c104]Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan:
Performance modeling and optimization of parallel out-of-core tensor contractions. PPoPP 2005: 266-276 - [c103]Swarup Kumar Sahoo, Sriram Krishnamoorthy, Rajkiran Panuganti, P. Sadayappan:
Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions. SC 2005: 13 - 2004
- [j32]Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan:
Efficient parallel out-of-core matrix transposition. Int. J. High Perform. Comput. Netw. 2(2/3/4): 110-119 (2004) - [j31]Sudha Srinivasan, Savitha Krishnamoorthy, P. Sadayappan:
Robust scheduling of moldable parallel jobs. Int. J. High Perform. Comput. Netw. 2(2/3/4): 120-132 (2004) - [c102]Gerald Sabin, Vishvesh Sahasrabudhe, P. Sadayappan:
On fairness in distributed job scheduling across multiple sites. CLUSTER 2004: 35-44 - [c101]Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda:
Towards provision of quality of service guarantees in job scheduling. CLUSTER 2004: 245-254 - [c100]Nagavijayalakshmi Vydyanathan, Gaurav Khanna, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan:
Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O. GRID 2004: 235-242 - [c99]Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan:
Efficient Layout Transformation for Disk-Based Multidimensional Arrays. HiPC 2004: 386-398 - [c98]Gerald Sabin, Garima Kochhar, P. Sadayappan:
Job Fairness in Non-Preemptive Job Scheduling. ICPP 2004: 186-194 - [c97]Vipin Chaudhary, P. Sadayappan:
Message from the Chairs: International Workshop on Compile and Run Time Techniques for Parallel Computing. ICPP Workshops 2004: 497 - [c96]Qingda Lu, Jiesheng Wu, Dhabaleswar K. Panda, P. Sadayappan:
Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. ICPP Workshops 2004: 538-545 - [c95]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella:
Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 - [c94]Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 - 2003
- [c93]Sudha Srinivasan, Sriram Krishnamoorthy, P. Sadayappan:
A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. CLUSTER 2003: 92-99 - [c92]Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan:
Efficient Parallel Out-of-Core Matrix Transposition. CLUSTER 2003: 300-307 - [c91]Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella:
Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 - [c90]Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam:
Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 - [c89]Gerald Sabin, Rajkumar Kettimuthu, Arun Rajan, P. Sadayappan:
Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement. JSSPP 2003: 87-104 - [c88]Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda:
QoPS: A QoS Based Scheme for Parallel Job Scheduling. JSSPP 2003: 252-268 - [c87]Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella:
Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 - [c86]Konstantin Berlin, Jun Huan, Mary Jacob, Garima Kochhar, Jan F. Prins, William W. Pugh, P. Sadayappan, Jaime Spacco, Chau-Wen Tseng:
Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures. LCPC 2003: 194-208 - 2002
- [c85]Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, Jeanette Johnston, P. Sadayappan:
Selective Buddy Allocation for Scheduling Parallel Jobs on Clusters. CLUSTER 2002: 107- - [c84]Srividya Srinivasan, Vijay Subramani, Rajkumar Kettimuthu, Praveen Holenarsipur, P. Sadayappan:
Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs. HiPC 2002: 174-183 - [c83]Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, P. Sadayappan:
Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests. HPDC 2002: 359- - [c82]Thiagaraja Gopalsamy, Mukesh Singhal, Dhabaleswar K. Panda, P. Sadayappan:
A Reliable Multicast Algorithm for Mobile Ad Hoc Networks. ICDCS 2002: 563-570 - [c81]P. Sadayappan:
Message from the Chair. ICPP Workshops 2002: 495-498 - [c80]Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan:
Characterization of Backfilling Strategies for Parallel Job Scheduling. ICPP Workshops 2002: 514-522 - [c79]Vipin Chaudhary, P. Sadayappan:
Message from the Co-Chairs. ICPP Workshops 2002: 547-550 - [c78]Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan:
A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 - [c77]Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan:
Selective Reservation Strategies for Backfill Job Scheduling. JSSPP 2002: 55-71 - [c76]Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam:
Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 - [c75]Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison:
Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 - [c74]Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan:
A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 33:1-33:10 - 2001
- [c73]Daniel Cociorva, J. W. Wilkins, Gerald Baumgartner, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison:
Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. HiPC 2001: 237-248 - [c72]Mohammad Banikazemi, Jiuxing Liu, Dhabaleswar K. Panda, P. Sadayappan:
Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation. ICPP 2001: 167-174 - [c71]Abhishek Gulati, Dhabaleswar K. Panda, P. Sadayappan, Pete Wyckoff:
NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters. ICPP 2001: 305-312 - [c70]Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan:
Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 - [c69]Mohammad Banikazemi, Jiuxing Liu, S. Kutlug, P. Sadayappan, H. Shah, Dhabaleswar K. Panda:
VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations. IPDPS 2001: 24 - [c68]Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan:
Fast NIC-Based Barrier over Myrinet/GM. IPDPS 2001: 52 - [c67]Amit Singhal, Mohammad Banikazemi, P. Sadayappan, Dhabaleswar K. Panda:
Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations. IPDPS 2001: 71 - [c66]Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan:
Performance Benefits of NIC-Based Barrier on Myrinet/GM. IPDPS 2001: 166 - 2000
- [c65]Vijay Moorthy, Dhabaleswar K. Panda, P. Sadayappan:
Fast Collective Communication Algorithms for Reflective Memory Network Clusters. CANPC 2000: 100-114 - [c64]Darius Buntinas, Dhabaleswar K. Panda, José Duato, P. Sadayappan:
Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages. CANPC 2000: 115-129 - [c63]Praveen Holenarsipur, Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan:
Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems. HiPC 2000: 37-48 - [c62]P. Sadayappan:
Message from the Chair. ICPP Workshops 2000: 391-394 - [c61]Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan:
Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems. ICPP Workshops 2000: 437-446 - [c60]Arindam Paul, Wu-chi Feng, Dhabaleswar K. Panda, P. Sadayappan:
Balancing Web Server Load for Adaptable Video Distribution. ICPP Workshops 2000: 469-478
1990 – 1999
- 1999
- [c59]Matthew G. Jacunski, Vijay Moorthy, Peter P. Ware, Manoj Pillai, Dhabaleswar K. Panda, P. Sadayappan:
Low Latency Message-Passing for Reflective Memory Networks. CANPC 1999: 211-224 - [c58]Mohammad Banikazemi, Jayanthi Sampathkumar, Sandeep Prabhu, Dhabaleswar K. Panda, P. Sadayappan:
Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations. Heterogeneous Computing Workshop 1999: 125- - [c57]Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan:
Memory-Optimal Evaluation of Expression Trees Involving Large Objects. HiPC 1999: 103-110 - [c56]N. S. Sundar, S. Jayanthi, P. Sadayappan, Miguel Visbal:
An Incremental Methodology for Parallelizing Legacy Stencil Codes on Message-Passing Computers. ICPP 1999: 302-310 - [c55]Vijay Moorthy, Matthew G. Jacunski, Manoj Pillai, Peter P. Ware, Dhabaleswar K. Panda, Thomas W. Page Jr., P. Sadayappan, V. Nagarajan, Johns Daniel:
Low-Latency Message Passing on Workstation Clusters using SCRAMNet. IPPS/SPDP 1999: 148-152 - [c54]Matthew G. Jacunski, P. Sadayappan, Dhabaleswar K. Panda:
All-to-All Broadcast on Switch-Based Clusters of Workstations. IPPS/SPDP 1999: 325-329 - [c53]Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan:
Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals. LCPC 1999: 350-364 - [c52]Chi-Chung Lam, P. Sadayappan, Daniel Cociorva, Mebarek Alouani, John Wilkins:
Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays. PP 1999 - 1998
- [j30]Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
A technique for overlapping computation and communication for block recursive algorithms. Concurr. Pract. Exp. 10(2): 73-90 (1998) - [j29]P. Sadayappan, Fikret Erçal, J. Ramanujam:
Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. Inf. Sci. 111(1-4): 223-237 (1998) - [e2]Zhiyuan Li, Pen-Chung Yew, Siddhartha Chatterjee, Chua-Huang Huang, P. Sadayappan, David C. Sehr:
Languages and Compilers for Parallel Computing, 10th International Workshop, LCPC'97, Minneapolis, Minnesota, USA, August 7-9, 1997, Proceedings. Lecture Notes in Computer Science 1366, Springer 1998, ISBN 3-540-64472-5 [contents] - 1997
- [j28]Chi-Chung Lam, Chua-Huang Huang, P. Sadayappan:
Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori. J. Parallel Distributed Comput. 43(1): 3-13 (1997) - [j27]Chi-Chung Lam, P. Sadayappan, Rephael Wenger:
On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Process. Lett. 7(2): 157-168 (1997) - [c51]James Buford White III, Ponnuswamy Sadayappan:
On improving the performance of sparse matrix-vector multiplication. HiPC 1997: 66-71 - [c50]Chi-Chung Lam, P. Sadayappan, Rephael Wenger:
Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines. PP 1997 - 1996
- [j26]Chua-Huang Huang, P. Sadayappan:
Introduction. Int. J. Parallel Program. 24(6): 479-480 (1996) - [j25]S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan:
An Algebraic Theory for Modeling Directt Interconnection Networks. J. Inf. Sci. Eng. 12(1): 25-49 (1996) - [j24]Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, P. Sadayappan:
Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. J. Parallel Distributed Comput. 32(2): 155-172 (1996) - [j23]Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms. J. Parallel Distributed Comput. 34(2): 137-153 (1996) - [j22]S. D. Kaushik, Chua-Huang Huang, P. Sadayappan:
Efficient Index Set Generation for Compiling HPF Array Statements on Distributed-Memory Machines. J. Parallel Distributed Comput. 38(2): 237-247 (1996) - [j21]Himanshu Gupta, P. Sadayappan:
Communication-Efficient Matrix Multiplication on Hypercubes. Parallel Comput. 22(1): 75-99 (1996) - [c49]N. S. Sundar, Doddaballapur Narasimha-Murthy Jayasimha, Dhabaleswar K. Panda, P. Sadayappan:
Hybrid Algorithms for Complete Exchange in 2D Meshes. International Conference on Supercomputing 1996: 181-188 - [c48]Chi-Chung Lam, P. Sadayappan, Rephael Wenger:
Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution. LCPC 1996: 315-329 - [e1]Chua-Huang Huang, P. Sadayappan, Utpal Banerjee, David Gelernter, Alexandru Nicolau, David A. Padua:
Languages and Compilers for Parallel Computing, 8th International Workshop, LCPC'95, Columbus, Ohio, USA, August 10-12, 1995, Proceedings. Lecture Notes in Computer Science 1033, Springer 1996, ISBN 3-540-60765-X [contents] - 1995
- [j20]J. Ramanujam, P. Sadayappan:
Mapping combinatorial optimization problems onto neural networks. Inf. Sci. 82(3-4): 239-255 (1995) - [j19]Ashok K. Goel, John R. Josephson, Olivier Fischer, P. Sadayappan:
Practical abduction: characterization, decomposition and concurrency. J. Exp. Theor. Artif. Intell. 7(4): 429-450 (1995) - [j18]Bharat Kumar, Kalluri Eswar, P. Sadayappan, Chua-Huang Huang:
A Clustering Algorithm for Parallel Sparse Cholesky Factorization. Parallel Process. Lett. 5: 685-696 (1995) - [j17]Bharat Kumar, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. Sci. Program. 4(4): 275-289 (1995) - [c47]S. D. Kaushik, Chua-Huang Huang, J. Ramanujam, P. Sadayappan:
Multi-phase array redistribution: modeling and evaluation. IPPS 1995: 441-445 - [c46]S. D. Kaushik, Chua-Huang Huang, P. Sadayappan:
Compiling Array Statements for Efficient Execution on Distributed-Memory Machines: Two-Level Mappings. LCPC 1995: 209-223 - 1994
- [j16]Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
Implementing Fast Fourier Transforms on Distributed-Memory Multiprocessors Using Data Redistributions. Parallel Process. Lett. 4: 477-488 (1994) - [j15]Scott McMillan, P. Sadayappan, David E. Orin:
Efficient Dynamic Simulation of Multiple Manipulator Systems with Singular Configurations. IEEE Trans. Syst. Man Cybern. Syst. 24(2): 306-313 (1994) - [j14]Scott McMillan, P. Sadayappan, David E. Orin:
Parallel Dynamic Simulation of Multiple Manipulator Systems: Temporal Versus Spatial Methods. IEEE Trans. Syst. Man Cybern. Syst. 24(7): 982-990 (1994) - [c45]Sandeep K. S. Gupta, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan:
Communication-Efficient Implementation of Block Recursive Algorithms on Distributed-Memory Machines. ICPADS 1994: 113-119 - [c44]S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan:
An approach to communication-efficient data redistribution. International Conference on Supercomputing 1994: 364-373 - [c43]Bharat Kumar, P. Sadayappan, Chua-Huang Huang:
On sparse matrix reordering for parallel factorization. International Conference on Supercomputing 1994: 431-438 - [c42]Alpesh Amin, P. Sadayappan, Murali Gudavalli:
A Clustered Reduced Communication Element by Element Preconditioned Conjugate Gradient Algorithm for Finite Element Computations. IPPS 1994: 509-516 - [c41]S. D. Kaushik, Chua-Huang Huang, P. Sadayappan:
Incremental Generation of Index Sets for Array Statement Execution on Distributed-Memory Machines. LCPC 1994: 251-265 - [c40]Donglai Dai, Sandeep K. S. Gupta, S. D. Kaushik, J. H. Lu, Raj Verdhan Singh, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms. SC 1994: 49-58 - [c39]Himanshu Gupta, P. Sadayappan:
Communication Efficient Matrix Multiplication on Hypercubes. SPAA 1994: 320-329 - 1993
- [j13]Chua-Huang Huang, P. Sadayappan:
Communication-Free Hyperplane Partitioning of Nested Loops. J. Parallel Distributed Comput. 19(2): 90-102 (1993) - [c38]Debabrata Ghosh, S. K. Nandy, P. Sadayappan, K. Parthasarathy:
Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving. DAC 1993: 303-307 - [c37]Kalluri Eswar, P. Sadayappan, Chua-Huang Huang, V. Visvanathan:
Supernodal Sparse Cholesky Facotrization on Distributed-Memory Multiprocessors. ICPP (3) 1993: 18-22 - [c36]S. K. Nandy, Ranjani Narayan, V. Visvanathan, P. Sadayappan, Prashant S. Chauhan:
A Parallel Progressive Refinement Image Rendering Algorithm on a Scalable Multithreaded VLSI Processor Array. ICPP (3) 1993: 94-97 - [c35]Kalluri Eswar, P. Sadayappan, Chua-Huang Huang:
Compile-Time Characterization of Recurrent Patterns in Irregular Computations. ICPP (2) 1993: 148-155 - [c34]Sandeep K. S. Gupta, S. D. Kaushik, S. Mufti, Sanjay Sharma, Chua-Huang Huang, P. Sadayappan:
On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. ICPP (2) 1993: 301-305 - [c33]Bharat Kumar, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan:
A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. IPPS 1993: 582-588 - [c32]S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan:
A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas. LCPC 1993: 358-373 - [c31]S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan:
Efficient transposition algorithms for large matrices. SC 1993: 656-665 - 1992
- [j12]Sailesh K. Rao, P. Sadayappan, Frank K. Hwang, Peter W. Shor:
The Rectilinear Steiner Arborescence Problem. Algorithmica 7(2&3): 277-288 (1992) - [j11]J. Ramanujam, P. Sadayappan:
Tiling Multidimensional Itertion Spaces for Multicomputers. J. Parallel Distributed Comput. 16(2): 108-120 (1992) - [j10]Scott McMillan, David E. Orin, P. Sadayappan:
Toward super-real-time simulation of robotic mechanisms using a parallel integration method. IEEE Trans. Syst. Man Cybern. 22(2): 384-391 (1992) - [c30]Scott McMillan, Ponnuswamy Sadayappan, David E. Orin:
Efficient dynamic simulation of multiple manipulator systems with singularities. ICRA 1992: 299-304 - [c29]Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson:
On the Synthesis of Parallel Programs from Tensor Product Formulas for Block Recursive Algorithms. LCPC 1992: 264-280 - [c28]S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, Jeremy R. Johnson, Rodney W. Johnson, P. Sadayappan:
An Algebraic Theory for Modeling Direct Interconnection Networks. SC 1992: 488-497 - [c27]Sanjay Sharma, Chua-Huang Huang, P. Sadayappan:
On Data Dependence Analysis for Compiling Programs on Distributed-Memory Machines (Extended Abstract). SIGPLAN Workshop 1992: 13-16 - [c26]Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan:
On the Automatic Generation of Data Distributions. SIGPLAN Workshop 1992: 82 - [c25]Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan:
A Methodology for Generating Data Distributions to Optimize Communication. SPDP 1992: 436-441 - 1991
- [j9]V. Prasad Krothapalli, P. Sadayappan:
Removal of Redundant Dependences in DOACROSS Loops with Constant Dependences. IEEE Trans. Parallel Distributed Syst. 2(3): 281-289 (1991) - [j8]J. Ramanujam, P. Sadayappan:
Compile-Time Techniques for Data Distribution in Distributed Memory Machines. IEEE Trans. Parallel Distributed Syst. 2(4): 472-482 (1991) - [c24]Kalluri Eswar, P. Sadayappan, V. Visvanathan:
Multifrontal Factorization of Sparse Matrices on Shared-Memory Multiprocessors. ICPP (3) 1991: 159-166 - [c23]Scott Whitman, P. Sadayappan:
Computer Graphics Rendering on a Shared Memory Multiprocessor. ICPP (3) 1991: 197-200 - [c22]Scott McMillan, David E. Orin, Ponnuswamy Sadayappan:
Real-time robot dynamic simulation on a vector/parallel supercomputer. ICRA 1991: 1836-1841 - [c21]Chua-Huang Huang, P. Sadayappan:
Communication-Free Hyperplane Partitioning of Nested Loops. LCPC 1991: 186-200 - [c20]V. Prasad Krothapalli, P. Sadayappan:
Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences. PPoPP 1991: 51-60 - [c19]J. Ramanujam, P. Sadayappan:
Tiling multidimensional iteration spaces for nonshared memory machines. SC 1991: 111-120 - 1990
- [j7]Fikret Erçal, J. Ramanujam, P. Sadayappan:
Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. J. Parallel Distributed Comput. 10(1): 35-44 (1990) - [j6]P. Sadayappan, Fikret Erçal, J. Ramanujam:
Cluster partitioning approaches to mapping parallel programs onto a hypercube. Parallel Comput. 13(1): 1-16 (1990) - [c18]V. Prasad Krothapalli, P. Sadayappan:
Exploiting Parallelism Through Run-Time Analysis on a Vector Processor (Abstract). ACM Conference on Computer Science 1990: 434 - [c17]J. Ramanujam, P. Sadayappan:
Tiling of Iteration Spaces for Multicomputers. ICPP (2) 1990: 179-186 - [c16]V. Prasad Krothapalli, P. Sadayappan:
Dynamic Scheduling of DOACROSS Loops for Multiprocessors. PARBASE / Architectures 1990: 141-160
1980 – 1989
- 1989
- [j5]P. Sadayappan, V. Visvanathan:
Efficient sparse matrix factorization for circuit simulation on vector supercomputers. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 8(12): 1276-1285 (1989) - [j4]P. Sadayappan, Yong-Long Calvin Ling, Karl W. Olson, David E. Orin:
A restructurable VLSI robotics vector processor architecture for real-time control. IEEE Trans. Robotics Autom. 5(5): 583-599 (1989) - [c15]P. Sadayappan, V. Visvanathan:
Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers. DAC 1989: 13-18 - [c14]Amr Zaky, P. Sadayappan:
Optimal Static Scheduling of Sequential Loops on Multiprocessors. ICPP (3) 1989: 130-137 - [c13]Fikret Erçal, P. Sadayappan:
One-to-one mapping of process graphs onto a hypercube. ICS 1989: 91-98 - [c12]Ponnuswamy Sadayappan, Sailesh K. Rao:
Communication reduction for distributed sparse matrix factorization on a processor mesh. SC 1989: 371-379 - [c11]J. Ramanujam, P. Sadayappan:
A methodology for parallelizing programs for multicomputers and complex memory multiprocessors. SC 1989: 637-646 - 1988
- [j3]Cevdet Aykanat, Füsun Özgüner, Fikret Erçal, P. Sadayappan:
Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes. IEEE Trans. Computers 37(12): 1554-1568 (1988) - [j2]P. Sadayappan, V. Visvanathan:
Circuit Simulation on Shared-Memory Multiprocessors. IEEE Trans. Computers 37(12): 1634-1642 (1988) - [c10]Fikret Erçal, J. Ramanujam, P. Sadayappan:
Task allocation onto a hypercube by recursive mincut bipartitioning. C³P 1988: 210-221 - [c9]P. Sadayappan, V. Visvanathan:
Comparative analysis of approaches to hardware acceleration for sparse-matrix factorization. ICCD 1988: 32-35 - [c8]J. Ramanujam, P. Sadayappan:
Optimization by neural networks. ICNN 1988: 325-332 - [c7]Ashok K. Goel, J. Ramanujam, P. Sadayappan:
Towards a 'neural' architecture for abductive reasoning. ICNN 1988: 681-688 - [c6]Yong-Long Calvin Ling, P. Sadayappan, Karl W. Olson, David E. Orin:
A VLSI robotics vector processor for real-time control. ICRA 1988: 303-308 - [c5]P. Sadayappan, V. Visvanathan:
Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor. ICS 1988: 254-265 - [c4]V. Prasad Krothapalli, P. Sadayappan:
An approach to synchronization for parallel computing. ICS 1988: 573-581 - 1987
- [j1]P. Sadayappan, Fikret Erçal:
Nearest-Neighbor Mapping of Finite Element Graphs onto Processor Meshes. IEEE Trans. Computers 36(12): 1408-1424 (1987) - [c3]P. Sadayappan, Fikret Erçal, Steven Martin:
Mapping Finite Element Graphs onto Processor Meshes. ICPP 1987: 192-195 - [c2]P. Sadayappan, Fikret Erçal:
Cluster-Partitioning Approaches to Mapping Parallel Programs onto a Hypercube. ICS 1987: 475-497 - 1985
- [c1]V. Ashok, Roger L. Costello, P. Sadayappan:
Modeling switch-level simulation using data flow. DAC 1985: 637-644
Coauthor Index
aka: Jagannathan Ramanujam
aka: Gerald M. Sabin
aka: Naga Vydyanathan
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 22:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint