Skip to Main Content U.S. Department of Energy
Fundamental and Computational Sciences Directorate

Staff information

Leon Song

High Performance Computing
Pacific Northwest National Laboratory
PO Box 999
MSIN: J4-30
Richland, WA 99352


Dr. Shuaiwen Leon Song is a research staff scientist for PAL lab at Pacific Northwest National Lab. He graduated with a Master's and Ph.D. degree in Computer Science And Applications from Virginia Tech in May 2013. He was a member of SCAPE lab directed by Dr. Kirk W. Cameron at Virginia Tech. The central theme of his research is to improve power and performance efficiency in high performance computing (HPC) systems and applications. In the past, he worked very closely with government and industry research labs including PNNL, LLNL, and NEC research America. He is a recipient of 2011 Paul E. Torgersen excellent research award and 2011 LLNL ISCR scholar. His research page is located at:

Research Interests

  • Performance and Energy modeling/analysis for HPC systems, highly efficient Parallel system and application design
  • Performance/Power optimization on Multi-core and Many-core architectures (e.g. emergent many-core accelerators)
  • Power-aware computing and energy-efficient design for large scale distributed systems
  • Runtime System

Education and Credentials

  • Ph.D. in Computer Science and Application, Virginia Tech, May 2013
  • Master's in Computer Science and Application, Virginia Tech, May 2009

Affiliations and Professional Service

  • IEEE professional
  • ACM professional
  • Upsilon Pi Epsilon

Awards and Recognitions

  • IEEE/ACM SC'12 travel grant
  • ACM PACT'12 ACM SRC research competition travel grant by Microsoft Research
  • IEEE/ACM SC'11 selected doctoral research showcase
  • 2011 Paul E. Torgersen excellent Ph.D. research award
  • NSF/TCPP travel award for IPDPS 2011, Alaska
  • 2011 ISCR scholar, Lawrence Livermore National Lab
  • Outstanding model award, National Mathematic Modeling Contest, 2005

PNNL Publications


  • Li C, S Song, H Dai, A Sidelnik, S Hari, and H Zhou. 2015. "Locality-Driven Dynamic GPU Cache Bypassing." In Proceedings of the 29th ACM on International Conference on Supercomputing (ICS 2015), June 8-11, 2015, Newport Beach, California, pp. 66-77.  ACM , New York, NY.  doi:10.1145/2751205.2751237
  • Tan L, S Song, P Wu, Z Chen, R Ge, and DJ Kerbyson. 2015. "Investigating the Interplay between Energy Efficiency and Resilience in High Performance Computing." In IEEE International Parallel and Distributed Processing Symposium (IPDPS 2015), May 25-29, 2015, Hyderabad, India, pp. 786-796.  IEEE Computer Society, Los Alamitos.  doi:10.1109/IPDPS.2015.108
  • You Y, H Fu, S Song, A Randles, DJ Kerbyson, A Marquez, G Yang, and A Hoisie. 2015. "Scaling Support Vector Machines On Modern HPC Platforms." Journal of Parallel and Distributed Computing 76:16-31.  doi:10.1016/j.jpdc.2014.09.005


  • Li B, HC Chang, S Song, CY Su, T Meyer, J Mooring, and K Cameron. 2014. "Extending PowerPack for Profiling and Analysis of High Performance Accelerator-Based Systems." Parallel Processing Letters 24(4):Article No. 144200.  doi:10.1142/S0129626414420018
  • Li B, HC Chang, S Song, CY Su, T Meyer, J Mooring, and K Cameron. 2014. "The Power-Performance Tradeoffs of the Intel Xeon Phi on HPC Applications." In IEEE International Parallel & Distributed Processing Symposium Workshops (IPDPSW 2014), May 19-23, 2014, Phoenix, Arizona, pp. 1448-1456.  IEEE, Piscataway, NJ.  doi:10.1109/IPDPSW.2014.162
  • Marquez A, JB Manzano Franco, S Song, B Meister, S Shrestha, T St. John, and GR Gao. 2014. "ACDT: Architected Composite Data Types Trading-in Unfettered Data Access for Improved Execution." In The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014), December 16-19, 2015, Hsinchu, Taiwan, pp. 289-297.  IEEE, Piscataway, NJ.  doi:10.1109/PADSW.2014.7097820
  • You Y, S Song, and DJ Kerbyson. 2014. "An Adaptive Cross-Architecture Combination Method for Graph Traversal." In Proceedings of the 28th ACM international conference on Supercomputing (ICS'14), June 10-13, 2014, Munich, Germany, pp. 169-169.  Association for Computing Machinery , New York, NY.  doi:10.1145/2597652.2600110
  • You Y, H Fu, S Song, M Mehri Dehanavi, L Gan, X Huang, and G Yang. 2014. "Evaluating Multi-core Architectures through Accelerating the Three-Dimensional Lax-Wendroff Correction." International Journal of High Performance Computing Applications 28(3):301-318.  doi:10.1177/1094342014524807
  • You Y, S Song, H Fu, A Marquez, M Mehri Dehanavi, KJ Barker, K Cameron, A Randles, and G Yang. 2014. "MIC-SVM: Designing A Highly Efficient Support Vector Machine For Advanced Modern Multi-Core and Many-Core Architectures." In IEEE 28th International Parallel and Distributed Processing Symposium (IPDPS 2014), May 19-23, 2014, Phoenix, Arizona, pp. 809-818.  IEEE Computer Society, Los Alamitos, CA.  doi:10.1109/IPDPS.2014.88


  • Vishnu A, S Song, A Marquez, KJ Barker, DJ Kerbyson, K Cameron, and P Balaji. 2013. "Designing Energy Efficient Communication Runtime Systems: A View from PGAS Models." Journal of Supercomputing 63(3):691-709 .  doi:10.1007/s11227-011-0699-9
  • Li B, S Song, I Bezakova, and K Cameron. 2013. "EDR: An Energy-Aware Runtime Load Distribution System for Data-Intensive Applications in the Cloud." In IEEE International Conference on Cluster Computing (CLUSTER 2013), September 23-27, 2013, Indianapolis, IN, pp. 1-8.  Institute of Electrical and Electronics Engineers , Piscataway, NJ.  doi:10.1109/CLUSTER.2013.6702674
  • Song S, KJ Barker, and DJ Kerbyson. 2013. "Unified Performance and Power Modeling of Scientific Workloads." In E2SC '13 Proceedings of the 1st International Workshop on Energy Efficient Supercomputing, November 17-21, 2013, Denver, Colorado, p. Article No. 4.  Association for Computing Machinery, New York, NY.  doi:10.1145/2536430.2536435
  • Song S, NR Tallent, and A Vishnu. 2013. "Exploring Machine Learning Techniques For Dynamic Modeling on Future Exascale Systems." In Modeling & Simulation of Exascale Systems & Applications: Workshop on Modeling & Simulation of Exascale Systems & Applications, September 18-19, 2013, Seattle, Washington.  US Department of Energy, Office of Advanced Scientific Computing Research, Washington DC. 


  • Song S, C Si Yu, R Ge, A Vishnu, and K Cameron. 2011. "Iso-Energy-Efficiency: An Approach to Power Constrained Parallel Computation." In IEEE International Parallel & Distributed Processing Symposium (IPDPS 2011), May 16-20, 2011, Anchorage, Alaska, pp. 128-139.  IEEE, Piscataway, NJ.  doi:10.1109/IPDPS.2011.22


  • Vishnu A, HJJ van Dam, WA De Jong, P Balaji, and S Song. 2010. "Fault Tolerant Communication Runtime Support for Data-Centric Programming Models." In International Conference on High Performance Computing (HiPC 2010), December 19-22, 2010, Goa, India.  International Electrical and Electronics Engineers, Piscataway, NJ.  doi:10.1109/HIPC.2010.5713195
  • Vishnu A, S Song, A Marquez, KJ Barker, DJ Kerbyson, K Cameron, and P Balaji. 2010. "Designing Energy Efficient Communication Runtime Systems for Data Centric Programming Models." In IEEE/ACM Internationall Conference on Green Computing and Communications (GreenCom 2010) and the International Conference on Cyber, Physical and Social Computing (CPSCom 2010), December 18-20, 2010, Hangzhou, China, ed. P Zhu, et al, pp. 229-236.  Institute of Electrical and Electronics Engineers, Inc., Piscatawy, NJ.  doi:10.1109/GreenCom-CPSCom.2010.133

Science at PNNL

Research Areas


User Facilities

Research Highlights

View All Research Highlights & Staff Accomplishments

RSS Feed