You are here

P Sadayappan

  • Professor, Computer Science & Engineering
  • 595 Dreese Laboratories
    2015 Neil Ave
    Columbus, OH 43210
  • 614-292-0053

Honors

  • 2016

    Joel & Ruth Spira Excellence in Teaching Award. .

  • 2015

    Outstanding Teaching Award. .

  • 2008

    Outstanding Teaching Award. .

  • 2008

    Lumley Research Award. .

  • 2006

    Outstanding Service Award. .

  • 2004

    Best Paper Award. .

  • 2003

    Best Paper Award. .

  • 2002

    Lumley Research Award. .

  • 1999

    Outstanding Service Award. .

  • 1997

    Lumley Research Award. .

Journal Articles

2015

  • Venmugil Elango, Naser Sedaghati, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, Radu Teodorescu, P. Sadayappan, 2015, "On Using the Roofline Model with Lower Bounds on Data Movement." ACM Transactions on Architecture and Code Optimization 11, no. 4, 67:1 - 67:23.
  • Arash Ashari, Naser Sedaghati, John Eisenlohr, P. Sadayappan, 2015, "A model-driven blocking strategy for load balanced sparse matrix-vector multiplication on GPUs." J. Parallel Distrib. Comput 76, 3 - 15.
  • Martin Kong, Antoniu Pop, Louis-Noël Pouchet, R. Govindarajan, Albert Cohen, P. Sadayappan, 2015, "Compiler/Runtime Framework for Dynamic Dataflow Parallelization of Tiled Programs." ACM Transactions on Architecture and Code Optimization 11, no. 4, 61:1 - 61:30.

2014

  • Tobias Grosser, Sven Verdoolaege, Albert Cohen, P. Sadayappan, 2014, "The Relation Between Diamond Tiling and Hexagonal Tiling." Parallel Processing Letters 24, no. 3, 20 -

2013

  • Daniel Lowell, Jeswin Godwin, Justin Holewinski, Deepan Karthik, Chekuri Choudary, Azamat Mametjanov, Boyana Norris, Gerald Sabin, P. Sadayappan, Jason Sarich, 2013, "Stencil-Aware GPU Optimization of Iterative Solvers." SIAM J. Scientific Computing 35, no. 5, 209 - 228.
  • Park,Eunjung; Cavazos,John; Pouchet,Louis-Noel; Bastoul,Cedric; Cohen,Albert; Sadayappan,P, 2013, "Predictive Modeling in a Polyhedral Optimization Space." INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING 41, no. 5, 704 - 750.
  • Fauzia,Naznin; Elango,Venmugil; Ravishankar,Mahesh; Ramanujam,J; Rastello,Fabrice; Rountev,Atanas; Pouchet,Louis-Noel; Sadayappan,P, 2013, "Beyond Reuse Distance Analysis: Dynamic Analysis for Characterization of Data Locality Potential." ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION 10, no. 4, 53 -
  • Tavarageri,Sanket; Ramanujam,J; Sadayappan,P, 2013, "Adaptive parallel tiled code generation and accelerated auto-tuning." INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 27, no. 4, 412 - 425.

2012

  • Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, and P. Sadayappan, 2012, "Empirical performance model-driven data layout optimization and library call selection for tensor contraction expressions." J. Parallel Distrib. Comput. 72, no. 3, 338 - 352.
  • Kevin Stock, Louis-Noël Pouchet, and P. Sadayappan, 2012, "Using machine learning to improve automatic vectorization." ACM Transactions on Architecture and Code Optimization 8, no. 4, 50 -

2011

  • Chi-Chung Lam, Thomas Rauber, Gerald Baumgartner, Daniel Cociorva, and P. Sadayappan, 2011, "Memory-optimal evaluation of expression trees involving large objects." Computer Languages, Systems & Structures 37, no. 2, 63 - 75.
  • Xintian Yang, Srinivasan Parthasarathy, P. Sadayappan, 2011, "Fast Sparse Matrix-Vector Multiplication on GPUs: Implications for Graph Mining." pVLDB 4, no. 4, 231 - 242.
  • Naga Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P Sadayappan, and Joel H. Saltz, 2011, "Optimizing latency and throughput of application workflows on clusters." Parallel Computing 37, no. 10-11, 694 - 712.
  • Lam, CC; Rauber, T; Baumgartner, G; Cociorva, D; Sadayappan, P, 2011, "Memory-optimal evaluation of expression trees involving large objects." COMPUTER LANGUAGES SYSTEMS & STRUCTURES 37, no. 2, 63 - 75.

2010

  • Kumar, VS; Kurc, T; Ratnakar, V; Kim, J; Mehta, G; Vahi, K; Nelson, YL; Sadayappan, P; Deelman, E; Gil, Y; Hall, M; Saltz, J, 2010, "Parameterized specification, configuration and execution of data-intensive scientific workflows." CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS 13, no. 3, 315 - 333.

2009

  • Hartono, A.; Lu, Q.; Henretty, T.; Krishnamoorthy, S.; Zhang, H.; Pitzer, R.; Sadayappan, P.; Baumgartner, G.; Ramanujam, J.; Bernholdt, D. E.; Nooijen, M., 2009, "Performance optimization of tensor contraction expressions for many-body methods in quantum chemistry." Journal of Physical Chemistry A 113, no. 45, 12715 - 12723.
  • Vydyanathan, N.; Krishnamoorthy, S.; Sabin, G. M.; Catalyurek, U. V.; Kurc, T.; Saltz, J. H.; Catalyurek, U. V.; Sadayappan, P.; Saltz, J. H., 2009, "An integrated approach to locality-conscious processor allocation and scheduling of mixed-parallel applications." IEEE Transactions on Parallel and Distributed Systems 20, no. 8, 1158 - 1172.

2008

  • Shet, A. G.; Sadayappan, P.; Bernholdt, D. E.; Nieplocha, J.; Tipparaju, V., 2008, "A framework for characterizing overlap of communication and computation in parallel applications." Cluster Computing 11, no. 1, 75 - 90.
  • Dinan, J.; Sabin, G.; Sadayappan, P.; Olivier, S.; Prins, J.; Tseng, C. -W., 2008, "A message passing benchmark for unbalanced applications." Simulation Modelling Practice and Theory 16, no. 9, 1177 - 1189.

2007

  • S. Krishnamoorthy, G. Baumgartner, C. Lam, J. Nieplocha, and P. Sadayappan, 2007, ""Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations"." Concurrency and Computation: Practice and Experience 19(18), 2425 - 2443.
  • Gao, X.; Krishnamoorthy, S.; Sahoo, S. K.; Lam, C. -C.; Sadayappan, P.; Baumgartner, G.; Ramanujam, J., 2007, "Efficient search-space pruning for integrated fusion and tiling transformations." Concurrency Computation Practice and Experience 19, no. 18, 2425 - 2443.

2006

  • A. Auer, G. Baumgartner, D. E. Bernholdt, A. Bibireata, V. Choppella, D. Cociorva, X. Gao, R.Harrison, S. Krishanmoorthy, S. Krishnan, C.-C. Lam, M. Nooijen, R. Pitzer, J. Ramanujam, P. Sadayappan, and A. Sibiryakov, 2006, "“Automatic Code Generation for Many-Body Electronic Structure Methods: The Tensor Contraction Engine”." Molecular Physics 104, no. 2, 211 - 228.
  • S. Krishnan, S. Krishnamoorthy, G. Baumgartner, C. Lam, J. Ramanujam, P. Sadayappan and V. Choppella, 2006, "“Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver”." Journal of Parallel and Distributed Computing 66(5), 659 - 673.

2005

  • R. Kettimuthu, V. Subramani, S. Srinivasan, T. B. Gopalsamy, D. K. Panda and P. Sadayappan, 2005, "“Selective Preemption Strategies for Parallel Job Scheduling”." Intl. Journal of High Performance Computing and Networking Vol. 3, Nos. 2/3, 122 - 152.
  • G. Baumgartner, A. Auer, D. E. Bernholdt, A. Bibireata, V. Choppella, D. Cociorva, X. Gao, R. J. Harrison, S. Hirata, J. Ramanujam, S. Krishanmoorthy, S. Krishnan, C. Lam, Q. Lu, M. Nooijen, R. Pitzer, P. Sadayappan and A. Sibiryakov, 2005, "“Synthesis of High-Performance Parallel Programs for a Class of ab initio Quantum Chemistry Models”." Proceedings of the IEEE 93, no. 2, 276 - 292.

2004

  • S. Srinivasan, S. Krishnamoorthy and P. Sadayappan, 2004, "“A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs”." Intl. Journal of High Performance Computing and Networking Vol. 2, No.2/3/4, 120 - 132.
  • S. Krishnamoorthy, G. Baumgartner, D. Cociorva, C. Lam and P. Sadayappan, 2004, "“Efficient Parallel Out-of-Core Matrix Transposition”." Intl. Journal of High Performance Computing and Networking Vol. 2(2/3/4), 110 - 119.

2001

  • N. S. Sundar, D. N. Jayasimha, D. K. Panda and P. Sadayappan, 2001, "“Hybrid Algorithms for Complete Exchange in 2D Meshes”." IEEE Transactions on Parallel and Distributed Systems Vol. 12, no. 12, 1201 - 1218.

1998

  • P. Sadayappan, F. Ercal and J. Ramanujam, 1998, "“Partitioning Graphs on Message-Passing Machines by Pairwise Mincut”." Information Sciences 111, 223 - 237.
  • Gupta,S,KS; Huang,C,H; Sadayappan,P; Johnson,R,W, 1998, "A technique for overlapping computation and communication for block recursive algorithms." CONCURRENCY-PRACTICE AND EXPERIENCE 10, no. 2, 73 - 90.

1997

  • C. Lam, C.-H. Huang and P. Sadayappan, 1997, "“Optimal Algorithms for All-to-all Complete Exchange on Rings and Tori"." Journal of Parallel and Distributed Computing 43, no. 1, 1 - 13.
  • S. K. S. Gupta, C.-H. Huang, P. Sadayappan, and R. W. Johnson, 1997, "“A Technique for Overlapping Computation and Communication for Block Recursive Algorithms”." Concurrency: Practiceand Experience 9, no. 12,
  • C. Lam, P. Sadayappan and R. Wenger, 1997, "”On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution”." Parallel Processing Letters 7, no. 2, 157 - 168.
  • Lam,C,C; Huang,C,H; Sadayappan,P, 1997, "Optimal algorithms for all-to-all personalized communication on rings and two dimensional tori." JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 43, no. 1, 3 - 13.

1996

  • S. K. S. Gupta, C.-H. Huang, P. Sadayappan, and R. W. Johnson, 1996, "“A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms,”." Journal of Parallel and Distributed Computing 34, no. 2, 137 - 153.
  • S. K. S. Gupta, S. D. Kaushik, C.-H. Huang, and P. Sadayappan, 1996, "“Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines"." Journal of Parallel and Distributed Computing 32, no. 2, 155 - 172.
  • S.D. Kaushik, S. Sharma, C.-H. Huang, J.R. Johnson, R. W. Johnson, and P. Sadayappan, 1996, "“An Algebraic Theory for Modeling Direct Interconnection Networks”." Journal of Information Science and Engineering 10, no. 1, 25 - 49.
  • Y. S. Choi-Grogan, K. Eswar, P. Sadayappan, and R. Lee, 1996, "“Sequential and Parallel Implementations of a Partitioning Finite Element Method,”." IEEE Transactions on Antennas and Propagation 44, no. 12, 1609 - 1616.
  • H. Gupta and P. Sadayappan, 1996, "“Communication-Efficient Matrix Multiplication on Hypercubes”." Parallel Computing 22, 75 - 99.
  • ChoiGrogan,Y,S; Eswar,K; Sadayappan,P; LEE,R, 1996, "Sequential and parallel implementations of the partitioning finite-element method." IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION 44, no. 12, 1609 - 1616.
  • S. D. Kaushik, C.-H. Huang and P. Sadayappan, 1996, ""Efficient Index Set Generation for Compiling HPF Array Statements on Distributed Memory Machines”." Journal of Parallel and Distributed Computing 38, no. 2, 237 - 247.

1993

  • Huang,C,H; Sadayappan,P, 1993, "COMMUNICATION-FREE HYPERPLANE PARTITIONING OF NESTED LOOPS." JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 19, no. 2, 90 - 102.

1992

  • Huang,C,H; Sadayappan,P, 1992, "COMMUNICATION-FREE HYPERPLANE PARTITIONING OF NESTED LOOPS." LECTURE NOTES IN COMPUTER SCIENCE 589, 186 - 200.
  • Ramanujam,J; Sadayappan,P, 1992, "TILING MULTIDIMENSIONAL ITERATION SPACES FOR MULTICOMPUTERS." JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 16, no. 2, 108 - 120.

1990

  • Sadayappan,P; Ercal,F; Ramanujam,J, 1990, "CLUSTER PARTITIONING APPROACHES TO MAPPING PARALLEL PROGRAMS ONTO A HYPERCUBE." PARALLEL COMPUTING 13, no. 1, 1 - 16.
  • Ercal,F; Ramanujam,J; Sadayappan,P, 1990, "TASK ALLOCATION ONTO A HYPERCUBE BY RECURSIVE MINCUT BIPARTITIONING." JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING 10, no. 1, 35 - 44.

1989

  • Sadayappan,P; LING,Y,LC; OLSON,K,W; Orin,D,E, 1989, "A RESTRUCTURABLE VLSI ROBOTICS VECTOR PROCESSOR ARCHITECTURE FOR REAL-TIME CONTROL." IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION 5, no. 5, 583 - 599.
  • Hayot,F; Mandal,M; Sadayappan,P, 1989, "IMPLEMENTATION AND PERFORMANCE OF A BINARY LATTICE GAS ALGORITHM ON PARALLEL PROCESSOR SYSTEMS." JOURNAL OF COMPUTATIONAL PHYSICS 80, no. 2, 277 - 287.
  • Sadayappan,P; Visvanathan,V, 1989, "EFFICIENT SPARSE-MATRIX FACTORIZATION FOR CIRCUIT SIMULATION ON VECTOR SUPERCOMPUTERS." IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS 8, no. 12, 1276 - 1285.

1988

  • Sadayappan,P; Visvanathan,V, 1988, "CIRCUIT SIMULATION ON SHARED-MEMORY MULTIPROCESSORS." IEEE TRANSACTIONS ON COMPUTERS 37, no. 12, 1634 - 1642.
  • Sadayappan,P; Ercal,F, 1988, "CLUSTER-PARTITIONING APPROACHES TO MAPPING PARALLEL PROGRAMS ONTO A HYPERCUBE." LECTURE NOTES IN COMPUTER SCIENCE 297, 475 - 497.
  • Aykanat,C; Ozguner,F; Ercal,F; Sadayappan,P, 1988, "ITERATIVE ALGORITHMS FOR SOLUTION OF LARGE SPARSE SYSTEMS OF LINEAR-EQUATIONS ON HYPERCUBES." IEEE TRANSACTIONS ON COMPUTERS 37, no. 12, 1554 - 1568.

1987

  • Sadayappan,P; Ercal,F, 1987, "NEAREST-NEIGHBOR MAPPING OF FINITE-ELEMENT GRAPHS ONTO PROCESSOR MESHES." IEEE TRANSACTIONS ON COMPUTERS 36, no. 12, 1408 - 1424.

Presentations

  • "Compiler Optmization for Heterogeneous Computing (Keynote)." 2011, Presented at Workshop on Characterizing Applications for Heterogeneous Exascale Systems (CACHES 2011),
  • "Domain-Specific Compiler Optimization for High-Performance Computing." 2011, Presented at Indian Institute of Technology, Bombay,
  • "Compiler optimization for high-performance computing." 2011, Presented at Indian Institute of Technology, Madras,
  • "Software Challenges for High Performance Computing." 2011, Presented at Indian Institute of Science, Bengaluru, India,
  • "Domain-Specific Frameworks for High-Performance Computing." 2011, Presented at Indian Association for the Cultivation of Science,
  • "Pattern-Based Compiler Optimization for Performance Portability." 2012, Presented at CNRS. Lyon, France,
  • "Domain-specific abstractions for performance portability." 2012, Presented at Imperial College, London,
  • "Domain-specific abstractions for performance portability." 2012, Presented at University of Illinois at Urbana-Champaign,
  • "Domain-Specific Abstractions for High-Performance Computing." 2012, Presented at Distinguished Seminar Series, Department of Computer Science, University of Illinois,
  • "Tiling: Progress and Challenges." 2014, Presented at SIAM Parallel Processing Conference: MiniSymposium on Tiling,
  • "Distributed Contraction of Tensors." 2014, Presented at SIAM Parallel Processing: Workshop on Parallel Quantum Chemistry,
  • "Domain-Specific Abstractions for Compiler Optimization." 2014, Presented at Seminar at Stony Brook University,
  • "Challenges in Optimization of Stencil Computations." 2013, Presented at Workshop on Optimizing Stencil Computations (WOSC) 2013, help with OOPSLA/SPLASH 2013,
  • "Future Computational Challenges." 2014, Presented at Software Innovation Institute for Computational Chemistry and Materials Modeling,
  • "Domain-specific abstractions for compiler optimization." 2014, Presented at Seminar at University of Utah,

Papers in Proceedings

2017

  • Samyam Rajbhandari, Fabrice Rastello, Karol Kowalski, Sriram Krishnamoorthy, P Sadayappan "Optimizing the Four-Index Integral Transform Using Data Movement Lower Bounds Analysis." New York, (1 2017). 327 - 340.
  • Israt Nisa, Aravind Sukumaran-Rajam, Rakshith Kunchum, P Sadayappan "Parallel CCD++ on GPU for Matrix Factorization." New York, (2 2017). 73 - 83.

2016

  • Compiler Support for Software Cache Coherence "Sanket Tavarageri, Wooil Kim, Josep Torrellas, P Sadayappan." New York, (12 2016). 341 - 350.
  • Wenlei Bao, Changwan Hong, Sudheer Chunduri, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P Sadayappan "Static and dynamic frequency scaling on multicore CPUs." JOURNAL: "ACM Transactions on Architecture and Code Optimization (TACO)." (12 2016). #51 -
  • Prashant Singh Rawat, Changwan Hong, Mahesh Ravishankar, Vinod Grover, Louis-Noël Pouchet, Atanas Rountev, P Sadayappan "Resource conscious reuse-driven tiling for GPUs." New York, (9 2016). 99 - 111.
  • Timothy Carpenter, Fabrice Rastello, P Sadayappan, Anastasios Sidiropoulos "Brief Announcement: Approximating the I/O Complexity of One-Shot Red-Blue Pebbling." New York, (7 2016). 161 - 163.
  • Wenlei Bao, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, P. Sadayappan "PolyCheck: dynamic verification of iteration space transformations on affine programs." New York, (1 2016). 539 - 554.
  • Samyam Rajbhandari, Jinsung Kim, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, Robert J. Harrison, P. Sadayappan "On fusing recursive traversals of K-d trees." New York, (3 2016). 152 - 162.
  • Lukasz Domagala, Duco van Amstel, Fabrice Rastello, P. Sadayappan "Register allocation and promotion through combined instruction scheduling and loop unrolling." New York, (3 2016). 143 - 151.
  • Changwan Hong, Wenlei Bao, Albert Cohen, Sriram Krishnamoorthy, Louis-Noël Pouchet, Fabrice Rastello, J Ramanujam, P Sadayappan "Effective padding of multidimensional arrays to avoid cache conflict misses." New York, (6 2016). 129 - 144.
  • Martin Kong, Louis-Noël Pouchet, P Sadayappan, Vivek Sarkar "PIPES: a language and compiler for task-based programming on distributed-memory clusters." New York, (11 2016). #39 -
  • Samyam Rajbhandari, Jinsung Kim, Sriram Krishnamoorthy, Louis-Noel Pouchet, Fabrice Rastello, Robert J Harrison, P Sadayappan "A domain-specific compiler for a parallel multiresolution adaptive numerical simulation environment." New York, (11 2016). #40 -

2015

  • Naznin Fauzia, Louis-Noël Pouchet, P. Sadayappan "Characterizing and enhancing global memory data coalescing on GPUs." New York, (2 2015). 10 -
  • Arash Ashari, Shirish Tatikonda, Matthias Boehm, Berthold Reinwald, Keith Campbell, John Keenleyside, P. Sadayappan "On optimizing machine learning workloads via kernel fusion." New York, (2 2015). 173 - 182.
  • Tobias Grosser, Sebastian Pop, Louis-Noël Pouchet, P. Sadayappan, Sebastian Pop "Optimistic Delinearization of Parametrically Sized Arrays." New York, (6 2015). 351 - 360.
  • Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan "On Characterizing the Data Access Complexity of Programs." New York, (1 2015). 567 - 580.
  • Naser Sedaghati, Te Mu, Louis-Noël Pouchet, Srinivasan Parthasarathy, P. Sadayappan "Automatic Selection of Sparse Matrix Representation on GPUs." New York, (6 2015). 99 - 108.
  • Rajkumar Kettimuthu, Gayane Vardoyan, Gagan Agrawal, P. Sadayappan, Ian T. Foster "An elegant sufficiency: load-aware differentiated scheduling of data transfers." New York, (11 2015). 46:1 - 12.
  • Mahesh Ravishankar, Roshan Dathathri, Venmugil Elango, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, P. Sadayappan "Distributed memory code generation for mixed Irregular/Regular computations." New York, (2 2015). 65 - 75.

2014

  • Venmugil Elango, Fabrice Rastello, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan "On Characterizing the Data Movement Complexity of Computational DAGs for Parallel Execution." in SPAA 2014. New York, (6 2014). 10 -
  • Kevin Stock, Martin Kong, Tobias Grosser, Louis-Noël Pouchet, Fabrice Rastello, J. Ramanujam, P. Sadayappan "A framework for enhancing data reuse via associative reordering." in PLDI 2014. New York, (6 2014). 12 -
  • Tobias Grosser, Albert Cohen, Justin Holewinski, P. Sadayappan, Sven Verdoolaege "Hybrid Hexagonal/Classical Tiling for GPUs." in CGO 2014. New York, (2 2014). 66 - 75.
  • Sanket Tavarageri, Sriram Krishnamoorthy, P. Sadayappan "Compiler-assisted detection of transient memory errors." in PLDI 2014. New York, (6 2014). 10 -
  • Samyam Rajbhandari, Akshay Nikam, Pai-Wei Lai, Kevin Stock, Sriram Krishnamoorthy, P. Sadayappan "A Communication-Optimal Framework for Contracting Distributed Tensors." New York, (11 2014). 375 - 386.
  • Arash Ashari, Naser Sedaghati, John Eisenlohr, Srinivasan Parthasarathy, P. Sadayappan: "Fast Sparse Matrix-Vector Multiplication on GPUs for Graph Applications." New York, (11 2014). 781 - 792.

2013

  • Louis-Noel Pouchet, Peng Zhang, P. Sadayappan, and Jason Cong "Polyhedral-based data reuse optimization for configurable computing." New York, (1 2013). 29 - 38.
  • Tobias Grosser, Albert Cohen, Paul H. J. Kelly, J. Ramanujam, P. Sadayappan, and Sven Verdoolaege "Split tiling for GPUs: automatic parallelization using trapezoidal tiles." New York, (3 2013). 24 - 31.
  • Thomas Henretty, Richard Veras, Franz Franchetti, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan "A stencil compiler for short-vector SIMD architectures." in ICS 2013. New York, (6 2013). 13 - 24.
  • Pai-Wei Lai, Kevin Stock, Samyam Rajbhandari, Sriram Krishnamoorthy, P. Sadayappan "A framework for load balancing of tensor contraction expressions via dynamic task partitioning." in Supercomputing. New York, (11 2013). 13 -
  • Pai-Wei Lai, Humayun Arafat, Venmugil Elango, Ponnuswamy Sadayappan "Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs." in HiPC 2013. Los Alamitos, (12 2013). 139 - 148.
  • Martin Kong, Richard Veras, Kevin Stock, Franz Franchetti, Louis-Noël Pouchet, P. Sadayappan "When polyhedral transformations meet SIMD code generation." in PLDI 2013. New York, (6 2013). 127 - 138.
  • Ye Wang, Srinivasan Parthasarathy, P. Sadayappan "Stratification driven placement of complex data: A framework for distributed data analytics." in ICDE 2013. Los Alamitos, (6 2013). 709 - 720.

2012

  • Qingpeng Niu, James Dinan, Qingda Lu, and P. Sadayappan "PARDA: A Fast Parallel Reuse Distance Analysis Algorithm." Shanghai, (5 2012). 1 - 10.
  • Justin Holewinski, Louis-Noël Pouchet, and P. Sadayappan "High-performance code generation for stencil computations on GPU architectures." New York, (6 2012). 1 - 10.
  • Jun Shirako, Kamal Sharma, Naznin Fauzia, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan, and Vivek Sarkar "Analytical Bounds for Optimal Tile Size Selection." Estonia, (3 2012). 101 - 121.
  • Mahesh Ravishankar, John Eisenlohr, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, and P. Sadayappan "Code generation for parallel execution of a class of irregular loops on distributed memory systems." Los Alamitos, (11 2012). 1 - 11.
  • Pai-Wei Lai, Huaijian Zhang, Samyam Rajbhandari, Edward Valeev, Karol Kowalski, P. Sadayappan "Effective Utilization of Tensor Symmetry in Operation Optimization of Tensor Contraction Expressions." New York, (6 2012). 412 - 421.
  • Md. Humayun Arafat, James Dinan, Sriram Krishnamoorthy, Theresa Windus, and P. Sadayappan "Load Balancing of Dynamical Nucleation Theory Monte Carlo Simulations Through Resource Sharing Barriers." Shanghai, (5 2012). 1 - 10.
  • Justin Holewinski, Ragavendar Ramamurthi, Mahesh Ravishankan, Naznin Fauzia, Louis-Noel Pouchet, Atanas Rountev, and P. Sadayappan "Dynamic Trace-Based Analysis of Vectorization Potential of Applications." Beijing, (6 2012). 1 - 10.
  • Shirako, Jun, Kamal Sharma, Naznin Fauzia, Louis-Noël Pouchet, J. Ramanujam, P. Sadayappan, and Vivek Sarkar "Analytical bounds for optimal tile size selection." in Compiler Construction. Heidelberg, (6 2012). 101 - 121.

2011

  • Sanket Tavarageri, Louis-Noël Pouchet, J. Ramanujam, Atanas Rountev, and P. Sadayappan "Dynamic selection of tile sizes." Bengaluru, (12 2011). 1 - 10.
  • Nawab Ali, Sriram Krishnamoorthy, Niranjan Govind, Karol Kowalski, and P. Sadayappan "Application-Specific Fault Tolerance via Data Access Characterization." Bordeaux, (8 2011). 340 - 352.
  • Pouchet, LN; Bondhugula, U; Bastoul, C; Cohen, A; Ramanujam, J; Sadayappan, P; Vasilache, N "Loop Transformations: Convexity, Pruning and Optimization." in ACM Symposium on Principles of Programming Languages (POPL 2011). Columbus, (1 2011). 549 - 561.
  • Eunjung Park, Louis-Noël Pouchet, John Cavazos, Albert Cohen, P. Sadayappan "Predictive modeling in a polyhedral optimization space." in International Symposium on Code Generation and Optimization (CGO 2011). (4 2011).
  • Naser Sedaghati, Renji Thomas, Louis-Noël Pouchet, Radu Teodorescu, and P. Sadayappan "StVEC: A Vector Instruction Extension for High Performance Stencil Computation." in PACT. Galveston, (10 2011). 1 - 10.
  • Kevin Stock, Thomas Henretty, Iyyappa Murugandi, P. Sadayappan, Robert Harrison "Model-Driven SIMD Code Generation for a Multi-Resolution Tensor Kernel." in IEEE International Parallel and Distributed Processing Symposium (IPDPS 2011). Anchorage, (5 2011). 1 - 10.
  • Thomas Henretty, Kevin Stock, Louis-Noël Pouchet, Franz Franchetti, J. Ramanujam, P. Sadayappan "Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures." in International Conference on Compiler Construction (CC 2011). (3 2011).

2010

  • James Dinan, Pavan Balaji, Ewing Lusk, Rajeev Thakur, P. Sadayappan "Hybrid parallel programming with MPI and Unified Parallel C." in Proceedings of ACM International Conference on Computing Frontiers, CF 2010. (5 2010).
  • Muthu Manikandan Baskaran, J. Ramanujam, P. Sadayappan "Automatic C-to-CUDA code generation for affine programs." in 19th International Conference on Compiler Construction, CC 2010. (3 2010). 244 - 263.
  • Atanas Rountev, Kevin Van Valkenburgh, Dacong Yan, P. Sadayappan "Understanding parallelism-inhibiting dependences in sequential Java programs." in International Conference on Software Maintenance (ICSM 2010). (9 2010).
  • Muthu Manikandan Baskaran, Albert Hartono, Sanket Tavarageri, Thomas Henretty, J. Ramanujam, P. Sadayappan "Parameterized tiling revisited." in 8th International Symposium on Code Generation and Optimization (CGO 2010). (4 2010).
  • Giridhar Sreenivasa Murthy, Mahesh Ravishankar, Muthu Manikandan Baskaran, P. Sadayappan "Optimal loop unrolling for GPGPU programs." in Proceedings of IEEE International Parallel & Distributed Processing Symposium, IPDPS 2010. (4 2010).
  • A. Hartono, M. Baskaran, J. Ramanujam, and P. Sadayappan "Parametric Tiled Loop Generation for Effective Parallel Execution on Multicore Processors." in Proc. 24th IEEE International Parallel and Distributed Processing Symposium (IPDPS). (1 2010).
  • Louis-Noël Pouchet, Uday Bondhugula, Cédric Bastoul, Albert Cohen, J. Ramanujam, P. Sadayappan "Combined Iterative and Model-driven Optimization in an Automatic Parallelization Framework." in Supercomputing 2010 (SC 2010). (11 2010).

2009

  • Qingda Lu, Christophe Alias, Uday Bondhugula, Thomas Henretty, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan, Yongjian Chen, Haibo Lin, Tin-fook Ngai "Data Layout Transformation for Enhancing Data Locality on NUCA Chip Multiprocessors." in Proceedings of the 18th International Conference on Parallel Architectures and Compilation Techniques, PACT 2009. (9 2009).
  • Nawab Ali, Philip H. Carns, Kamil Iskra, Dries Kimpe, Samuel Lang, Robert Latham, Robert B. Ross, Lee Ward, P. Sadayappan "Scalable I/O forwarding framework for high-performance computing systems." in Proceedings of the 2009 IEEE International Conference on Cluster Computing. (9 2009).
  • James Dinan, D. Brian Larkins, P. Sadayappan, Sriram Krishnamoorthy, Jarek Nieplocha "Scalable work stealing." in roceedings of the ACM/IEEE Conference on High Performance Computing, SC 2009. (11 2009).
  • Vijay S. Kumar, P. Sadayappan, Gaurang Mehta, Karan Vahi, Ewa Deelman, Varun Ratnakar, Jihie Kim, Yolanda Gil, Mary W. Hall, Tahsin M. Kurç, Joel H. Saltz "An integrated framework for performance-based optimization of scientific workflows." in Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, HPDC 2009. (6 2009).
  • Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev "Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors." in Proceedings of the 14th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2009. (2 2009).
  • Albert Hartono, Muthu Manikandan Baskaran, Cédric Bastoul, Albert Cohen, Sriram Krishnamoorthy, Boyana Norris, J. Ramanujam, P. Sadayappan "Parametric multi-level tiling of imperfectly nested loops." in Proceedings of the International Conference on Supercomputing. (6 2009).
  • Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan "Enabling software management for multicore caches with a lightweight hardware support." in roceedings of the ACM/IEEE Conference on High Performance Computing, SC 2009. (11 2009).
  • Albert Hartono, Boyana Norris, and P. Sadayappan "Annotation-based empirical performance tuning using Orio." in Proceedings of the 2009 IEEE International Parallel and Distributed Processing Symposium. (5 2009).
  • Qingda Lu, Jiang Lin, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan "Soft-OLP: Improving Hardware Cache Performance through Software-Controlled Object-Level Partitioning." in Proceedings of the 18th International Conference on Parallel Architectures and Compilation Techniques, PACT 2009. (9 2009).

2008

  • M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan "“A compiler framework for optimization of affine loop nests for GPGPUs”." in ACM International Conferenceon Supercomputing (ICS). (1 2008).
  • G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan, and J. H. Saltz "“A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP,”." in Proc. 22nd IEEE International Parallel and Distributed Processing Symposium (IPDPS). (4 2008).
  • G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan, I. Foster, J. H. Saltz "Using overlays for efficient data transfer over shared wide-area networks." in Proc. Supercomputing (SC). (11 2008).
  • D. B. Larkins, J. Dinan, S. Krishnamoorthy, S. Parthasarathy, A. Rountev, and P. Sadayappan "“Global trees: a framework for linked data structures on distributed memory parallel systems”." in Proc. Supercomputing (SC). (11 2008).
  • U. Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev and P. Sadayappan "Automatic transformations for communication-minimized parallelization and locality optimization in the polyhedral model." in International Conference on Compiler Construction(CC). (1 2008).
  • U. Bondhugula, A. Hartono, J. Ramanujam, and P. Sadayappan "“A Practical Automatic Polyhedral Parallelizer and Locality Optimizer,”." in ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI). (1 2008).
  • J. Lin, Q. Lu, X. Ding, Z. Zhang, X. Zhang, and P. Sadayappan "Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems." in 14th International Symposium on High-Performance Computer Architecture (HPCA). (2 2008).
  • M. Baskaran, U. Bondhugula, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan "Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories." in 13th ACM SIGPLAN Symposium on Principlesand Practice of Parallel Programming (PPoPP). (1 2008).

2007

  • U. Bondhugula, J. Ramanujam, and P. Sadayappan "Automatic Mapping of Nested Loops to FPGAs." in Proc. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP). (3 2007).
  • S. Krishnamoorthy. M. Baskaran, U. Bondhugula, J. Ramanujam, A. Rountev, and P. Sadayappan "Effective automatic parallelization of stencil computations." in ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI). (1 2007).

2006

  • S. Krishnamoorthy, U. Catalyurek, J. Nieplocha, A. Rountev and P. Sadayappan "Hypergraph Partitioning for Automatic Memory Hierarchy Management." in Proc. Supercomputing (SC). (11 2006).
  • U. Bondhugula, A. Devulapalli, J. Dinan, J. Fernando, P. Wyckoff, E. Stahlberg, and P. Sadayappan. "Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths." in IEEE Symposium on Field-Programmable Custom Computing Machines (FCCM). (4 2006).
  • Q. Lu, S. Krishnamoorthy and P. Sadayappan "Combining analytical and empirical approachesin tuning matrix transposition." in Proc. Intl. Conf. on Parallel Architectures and Compilation Techniques (PACT 2006). (9 2006).
  • U. Bondhugula, A. Devulapalli, J. Fernando, P. Wyckoff, and P. Sadayappan "Parallel FPGA based All-Pairs Shortest-Paths in a Directed Graph." in Proc. 20th IEEE International Parallel and Distributed Processing Symposium (IPDPS). (4 2006).

2005

  • A. Hartono, A. Sibiryakov, M. Nooijen, S. Hirata, D. Bernholdt, G. Baumgartner, J. Ramanujam, R. Pitzer, C. Lam and P. Sadayappan "Automated Operation Minimization for Tensor Contraction Expressions in Electronic Structure Calculations." in Proceedings of International Conference on Computational Science (ICCS). (5 2005).
  • S. K. Sahoo S. Krishnamoorthy, R. Panuganti and P. Sadayappan "Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions." in Proc. Supercomputing (SC). (11 2005).
  • S. Sahoo, R. Panuganti, S. Krishnamoorthy and P. Sadayappan "Cache Miss Characterizationand Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors." in Proceedings of International Parallel and Distributed Processing Symposium (IPDPS). (5 2005).
  • X. Gao, S. Sahoo, Q. Lu, G. Baumgartner, C.-C. Lam, J. Ramanujam and P. Sadayappan "Performance Modeling and Optimization of Parallel Out-of-Core Tensor Contractions." in Proceedings of ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPOPP). (6 2005).
  • G. Sabin, V. Sahasrabudhe, and P. Sadayappan "Assessment and Enhancement Of Meta-Schedulers for Multi-Site Job Sharing." in Proc. 15th IEEE Symp. High Perf. Distributed Computing (HPDC). (7 2005).

2004

  • S. Krishnan, S. Krishnamoorthy, G. Baumgartner, C.-C. Lam, P. Sadayappan, J. Ramanujamand V. Choppella, "Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver." in Proceedings of International Parallel and Distributed Processing Symposium (IPDPS). (4 2004).

2003

  • G. Sabin, R. Kettimuthu, A. Rajan, and P. Sadayappan "Scheduling of Parallel Jobs in a Heterogeneous Multi-Site Environment." in Proceedings of the Ninth Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP). (7 2003).
  • D. Cociorva, X. Gao, S. Krishnan, G. Baumgartner, C. Lam, P. Sadayappan and J. Ramanujam "Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints." in Proceedings of International Parallel and Distributed Processing Symposium (IPDPS). (4 2003).
  • S. Krishnan, S. Krishnamoorthy, G. Baumgartner, D. Cociorva, C.-C. Lam, P. Sadayappan, J. Ramanujam, D. E. Bernholdt and V. Choppella "Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms." in Proc. Tenth Intl. Conf. on High Performance Computing (HiPC). (12 2003).
  • M. Islam, P. Balaji, P. Sadayappan and D.K. Panda "QoPS: A QoS based scheme for Parallel Job Scheduling." in Proceedings of the Ninth Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP). (7 2003).

2002

  • S. Srinivasan, V. Subramani, R. Kettimuthu, P. Holenarsipur and P. Sadayappan "Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs." in Proc. Ninth Intl. Conf. on High Performance Computing (HiPC). (12 2002).
  • D. Cociorva, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt, R. Harrison and R. Pitzer "A High-Level Approach to Synthesis of High-Performance Codes for Quantum Chemistry." in Proceedings of Supercomputing (SC). (11 2002).
  • D. Cociorva, G. Baumgartner, C. Lam, P. Sadayappan, J. Ramanujam, M. Nooijen, D. Bernholdt and R. Harrison "Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations." in Proceedings of ACM SIGPLAN 2002 Conference on Programming Language Design and Implementation (PLDI). (6 2002).
  • V. Subramani, R. Kettimuthu, S. Srinivasan and P. Sadayappan "Distributed Job Scheduling on Computational Grids using Multiple Simultaneous Requests." in Proc. 11th IEEE Symp. High Perf. Distributed Computing (HPDC 2002). (7 2002).
  • S. Srinivasan, R. Kettimuthu, V. Subramani and P. Sadayappan "Selective Reservation Strategies for Backfill Job Scheduling." in Proc. of 8th Workshop on Job Scheduling Strategies for Parallel Processing (JSSPP). (7 2002).

2001

  • D. Cociorva, G. Baumgartner, D. Bernholdt, R. Harrison, M. Nooijen, J. Ramanujam, P. Sadayappan, and J. Wilkins "Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization." in Proceedings of Eighth Intl.Conf. on High Performance Computing (HiPC). (12 2001).
  • Amit Singhal, Mohammad Banikazemi, P. Sadayappan, and D. K. Panda "Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations." in Proceedings of International Parallel and Distributed Processing Symposium (IPDPS). (4 2001).
  • D. Cociorva, C. Lam, G. Baumgartner, J. Ramanujam, P. Sadayappan, and J. Wilkins "Loop Optimization for a Class of Memory-Constrained Computations." in Proc. of ACM Intl. Conf. on Supercomputing (ICS). (6 2001).