Home
About Our Project
Participants
Publications
Reports
PModel related projects

Publications

Argonne National Laboratory

William Gropp and Rajeev Thakur, “Thread Safety in an MPI Implementation: Requirements and Analysis,” Parallel Computing, (33)9:595-604, September 2007.

Rajeev Thakur and William Gropp, “Test Suite for Evaluating Performance of MPI Implementations That Support MPI_THREAD_MULTIPLE,” in  Proc. Of the 14th European PVM/MPI Users’ Group Meeting (Euro PVM/MPI 2007), September 2007. (selected as outstanding paper)

Jesper Larsson Traff, William Gropp, and Rajeev Thakur, “Self-Consistent MPI Performance Requirements,” in  Proc. Of the 14th European PVM/MPI Users’ Group Meeting (Euro PVM/MPI 2007), September 2007. (selected as outstanding paper)

Jesper Larsson Traff, William Gropp, and Rajeev Thakur, “Self-Consistent MPI Performance Requirements,” in preparation for submission to ACM Transactions on Programming Languages and Systems, 2008.

Ewing Lusk and Katherine Yelick, “Languages for High-Productivity Computing: The DARPA HPCS Language Project,” Parallel Processing Letters, Vol. 17, No. 1 (2007) 89-102.

G. Bosilca, D. Buntinas, R. Graham, G. Vallee, G. Watson, "Scalable Tools Communication Infrastructure." Submitted to the 6th annual Symposium on OSCAR and HPC Cluster Systems (OSCAR '08), Feb. 2008.

The Ohio State University

M. Koop, T. Jones, and D.K. Panda.  MVAPICH-Aptus:  Scalable High-Performance Multi-Transport MPI over InfiniBand.  In Int’l Parallel and Distributed Processing Symposium (IPDPS), 2008.

M. Koop, S. Sur, Q. Gao, and D.K. Panda.  High Performance MPI Design using Unreliable Datagram for Ultra-Scale InfiniBand Clusters. In International Conference on Supercomputing (ICS07), 2007.

M. Koop, S. Sur, and D.K. Panda.  Zero-Copy Protocol for MPI using InfiniBand Unreliable Datagram.  In IEEE International Conference on Cluster Computing (Cluster’07), 2007.

Matthew J. Koop, Terry Jones, and Dhabaleswar K. Panda.  Reducing Connection Memory Requirements of MPI for InfiniBand Clusters:  A message Coalescing Approach.  In International Symposium on Cluser Computing and the Grid, pages 495-504, 2007.

Sriram Krishnamoorthy, Juan Piernas Canovas, Vinod Tipparaju and Jarek Nieplocha, and P. Sadayappan.  Non-collective parallel i/o for global address space programming models.  In Proceedings of Cluster 2007, Sept. 2007.

Amith R. mamidala, Debraj De. Abhinav Vishnu, Sundeep Narravula, and Dhabaleswar K. Panda.  Scalable Collective Communication for Next-Generation Multicore Clusters with InfiniBand.  In technical Report No. OSU-CISRC-6/07-TR49, 2007.

Amith R. Mamidala, Sundeep Narravula, Abhinav Vishnu, Gopalakrishnan Santhanaraman, and Dhabaleswar K. Panda.  On Using Connection-Oriented vs. Connection-Less Transport for Performance and Scalability of Collective and One-Sided Operations:  Trade-offs and Impact.  In Symposium on Principles and Practices of Parallel programming, pages 46-54, 2007.

Sundeep Narravula, Amith Mamidala, Abhinav Vishnu, Gopal Santhanaraman, and Dhabaleswar K. Panda.  High Performance MPI over iWARP:  Early Experiences.  In International Conference on Parallel Processing, 2007.

Nework-Based Computing Laboratory.  MVAPICH/MVAPICH2:  MPI-1/MPI-2 for InfiniBand and iWARP.  http://mvapich.cse.ohio-state.edu.

S. Sur. M. J. Koop, L. Chai, and D.K. Panda.  Performance Analysis and Evluation of Mellanox ConnectX InfiniBand Architecture with Multi-Core Platforms.  In International Workshop on High-Level Parallel programming Models and Supportive Environments, IPDPS, 2007.

Abhinav Vishnu, Matthew J. Koop, Adam Moody, Amith R. Mamidala, Sundeep Narravula, and Dhabaleswar K. Panda.  Hot-Spot Avoidance With Multi-Pathing Over InfiniBand:  An MPI Perspective.  In International Symposium on Cluster Computing and the Grid. Pages 479-486, 2007.

Pacific Northwest National Laboratory

Manojkumar Krishnan, S. Bohn, W. Cowley, Vernon L. Crow and Jarek Nieplocha}, “Scalable Visual Analytics of Massive Textual Datasets”, in IPDPS, 2007,  pp. 1-10.

Vinod Tipparaju, Andriy Kot, Jarek Nieplocha, Monika ten Bruggencate, and Nikos Chrisochoides, “Evaluation of Remote Memory Access Communication on the Cray XT3”, IPDPS, 2007,  pp. 1-7.

Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha and Vinod Tipparaju, “A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems”, 2006.

Sriram Krishnamoorthy, Juan Piernas Canovas, Vinod Tipparaju, Jarek Nieplocha
 and P. Sadayappan, “Non-Collective Parallel I/O for Global Address Space Programming Models”, in CLUSTER, 2007.

Michael Blocksome, Charles Archer, Todd Inglett, Patrick McCarthy, Michael Mundy, Joe Ratterman, A. Sidelnik, Brian Smith, George Almãsi, José G. Castaños, Derek Lieber, José E. Moreira, Sriram Krishnamoorthy, Vinod Tipparaju, Jarek Nieplocha, “Blue Gene system software - Design and implementation of a one-sided communication interface for the IBM eServer Blue Gene”, in SC, 2006, pge 120.

Chris Oehmen and Jarek Nieplocha, “ScalaBLAST: A Scalable Implementation of BLAST for High-Performance Data-Intensive Bioinformatics Analysis”, in IEEE Trans. Parallel Distrib. Syst., 17  (8) pp  740-749, 2006.

Jarek Nieplocha, Bruce Palmer, Vinod Tipparaju, Manojkumar Krishnan, Harold Trease and Edoardo Aprà, “Advances, Applications and Performance of the Global Arrays Shared Memory Programming Toolkit”, in Int. J. High Perform. Comput. Appl.,
20, (2), pp 1094-3420, 2006.

Rice University

C. Coarfa. Portable High Performance and Scalability of Global Address Space Languages. Ph.D. thesis, Rice University, Department of Computer Science, January 2007.

C. Coarfa, J. Mellor-Crummey, N. Froyd, and Y. Dotsenko. Scalability analysis of SPMD codes using expectations. In Proceedings of the International Conference on Supercomputing, Seattle,WA, June 2007.

Y. Dotsenko. Expressiveness, Programmability and Portable High Performance of Global Address Space Languages. Ph.D. thesis, Rice University, Department of Computer Science, January 2007.

University of California at Berkeley

Alfredo Buttari, Jack Dongarra, Parry Husbands, Jakub Kurzak and Katherine Yelick, “Multithreading for synchronization tolerance in matrix factorization,” The proceedings of the SciDAC 2007 Conference, Boston, Massachusetts, July 24-28, 2007.  Published in the Journal of Physics: Conference Series. Volume 78, 2007, June, 2007.

Jimmy Su and Katherine Yelick, “Automatic Performance Debugging in Partitioned Global Address Space Programs” 20th International Workshop on Languages and Compilers for Parallel Computing (LCPC), Urbana, Illinois, October 2007. To appear in Springer Lecture Notes in Computer Science.

Parry Husbands and Katherine Yelick, “Multithreading and One-Sided Communication in Parallel LU Factorization.” Proceedings of Supercomputing (SC07), Reno, NV, November, 2007.

Tong Wen, Jimmy Su, Phillip Colella, Katherine Yelick and Noel Keen, “An Adaptive Mesh Refinement Benchmark for Modern Parallel Programming Languages.” Proceedings of Supercomputing (SC07), Reno, NV, November 2007.

Amir Kamil and Katherine Yelick, “Hierarchical Pointer Analysis for Distributed Programs,” Static Analysis Symposium (SAS), Kongens Lyngby, Denmark, August 22-24, 2007.

Katherine Yelick, Paul Hilfinger, Susan Graham, Dan Bonachea, Jimmy Su, Amir Kamil, Kaushik Datta, Phillip Colella, and Tong Wen, “Parallel Languages and Compilers: Perspective from the Titanium Experience.” Journal of High Performance Computing Applications, August 2007, vol. 21, pp. 266-290.

K. Yelick, D. Bonachea, W.-Y. Chen, P. Colella, K. Datta, J. Duell, S. Graham, P. Hargrove, P. Hilfinger, P. Husbands, C. Iancu, A. Kamil, R. Nishtala, J. Su, M. Welcome, T. Wen, “Productivity and Performance Using Partitioned Global Address Space Languages,” Proceedings of Parallel Symbolic Computation (PASCO), London, Ontario, July 27-28, 2007.

Ewing Lusk and Katherine Yelick, “Languages for High-Productivity Computing: The DARPA HPCS Language Project,” Parallel Processing Letters, Vol. 17, No. 1 (2007) 89-102.

Wei Chen, Dan Bonachea, Costin Iancu, and Katherine Yelick, “Automatic Nonblocking Communication for Partitioned Global Address Space Programs,” Proceedings of the International Conference on Supercomputing (ICS), Seattle, Washington, June 16-17, 2007.

Shivali Agarwal, Rajkishore Barik, Dan Bonachea, Vivek Sarkar, Rudrapatna Shyamasundar, Katherine Yelick, “Deadlock-Free Scheduling of X10 Computations with Bounded Resources,” Symposium on Parallel Algorithms and Architecture (SPAA), San Diego California, June 9-11, 2007.

University of Houston

Laksono Adhianto. A new Framework for Analyzing, Modeling and Optimizing MPI and/or OpenMP Applications. PhD thesis, Department of Computer Science, University of Houston, 2007.

Barbara Chapman and Lei Huang.  Enhancing openmp and its implementation for programming multicore systems.  In Parallel Computing 2007 (ParCo 2007), September 2007.

Barbara Chapman, Gabriele Jost, and Ruud va der Pas.  Using OpenMP.  The MIT Press, Cambridge, Massachusetts and London, England, 2007.

Barbara M. Chapman, Lei Huang, Haoqiang Jin, Gabriele Jost, and Bronis R. de Supinski.  Extending OpenMP worksharing directives for multi-threading.  In Europar 2006, pages 645-654, 2006.

Deepak Eachempati, Lei Huang, and Barbara m. Chapman.  Strategies and implementation for translating openmp code for clusters.  In HPCC, pages 420-431, 2007.

Lei Huang, Barbara Chapman, and Chunhua Liao.  An implementation and evaluation of thread subteam for OpenMP extensions.  In Workshop on Programming Models for Ubiquitous Parallelism  (PMUP 06), Seattle, WA, September 2006.

Lei Huang, Girija Sethuraman, and Barbara Chapman.  Parallel data flow analysis for openmp programs.  In Proceedings of IWOMP 2007, June, 2007.

Haoqiang Jin, Barbara Chapman, and Lei Huang.  Performance evaluation of a multi-zzone application in different openmp approaches.  In Proceedings of IWOMP 2007, June, 2007.

Chunhua Liao.  A Compile-time OpenMP Cost Model.  Phdthesis, Department of Computer Science, University of Houston, 2007.

Chunhua Liao and Barbara Chapman.  Invited paper:  A compile-time cost model for openmp.  In 12th International Workshop on high-Level Parallel Programming Models and Supportive Environmnets (HIPS), March 2007.

Chunhua Liao, Oscar Hernandez, Barbara Chapman, Wenguang Chen, and Weimin Zheng.  Openuh:  an optimizing, portable openmp compiler:  Research articles.  Concurr. Comput.: Pract. Exper., 19(18):2317-2332, 2007.

The OpenUH compiler project.  http://www.cs.uh.edu/~openuh, 2005.

Girija Sethuraman.  Parallel data flow analysis for openmp programs.  Master’s thesis, Department of computer Science, University of Houston, Spring, 2007.