Linked e-resources
Details
Table of Contents
[I]. Compilers for parallel computing: Using hardware counters to predict vectorization / Neftali Watkinson, Aniket Shivam, Zhi Chen, Alexander Veidenbaum, Alexandru Nicolau, and Zhangxiaowen Gong
Software cache coherent control by parallelizing compiler / Boma A. Adhi, Masayoshi Mase, Yuhei Hosokawa, Yohei Kishimoto, Taisuke Onishi, Hiroki Mikami, Keiji Kimura, and Hironori Kasahara
Polyhedral compilation support for C++ features : a case study with CPPTRAJ / Amit Roy, Daniel Roe, Mary Hall, and Thomas Cheatham
Language-agnostic optimization and parallelization for interpreted languages / Michelle Mills Strout, Saumya Debray, Kate Isaacs, Barbara Kreaseck, Julio Cárdenas-Rodríguez, Bonnie Hurwitz, Kat Volk, Sam Badger, Jesse Bartels, Ian Bertolacci, Sabin Devkota, Anthony Encinas, Ben Gaska, Brandon Neth, Theo Sackos, Jon Stephens, Sarah Willer, and Babak Yadegari.
[II]. Performance modeling and instrumentation: Memory distance measurement for concurrent programs / Hao Li, Jialiang Chang, Zijiang Yang, and Steve Carr
Efficient cache simulation for affine computations / Wenlei Bao, Prashant Singh Rawat, Martin Kong, Sriram Krishnamoorthy, Louis-Noel Pouchet, and P. Sadayappan
ADLER : adaptive sampling for precise monitoring / Arnamoy Bhattacharyya and Cristiana Amza
How low can you go? / Henry Dietz
Memory-access-pattern analysis techniques for OpenCL kernels / Gangwon Jo, Jaehoon Jung, Jiyoung Park, and Jaejin Lee.
[III]. Parallel programming and abstractions: Characterizing performance of imbalanced collectives on hybrid and task centric runtimes for two-phase reduction / Udayanga Wickramasinghe and Andrew Lumsdaine
Abstract representation of shared data for heterogeneous computing / Tushar Kumar, Aravind Natarajan, Wenjia Ruan, Mario Badr, Dario Suarez Gracia, and Calin Cascaval
Parallel roles for practical deterministic parallel programming / Michael Faes and Thomas R. Gross
Mozart : efficient composition of library functions for heterogeneous execution / Rajkishore Barik, Tatiana Shpeisman, Hongbo Rong, Chunling Hu, Victor W. Lee, Todd A. Anderson, Greg Henry, Hai Liu, Youfeng Wu, Paul Petersen, and Geoff Lowney
Lock-free transactional adjacency list / Zachary Painter, Christina Peterson, and Damian Dechev.
[IV]. GPU applications: Efficient inspected critical sections in data-parallel GPU codes / Thorsten Blaß, Michael Philippsen, and Ronald Veldema
Scalable top-K query processing using graphics processing unit / Yulin Zhang, Hui Fang, and Xiaoming Li
Fast dynamic graph algorithms / Gaurav Malhotra, Hitish Chappidi, and Rupesh Nasre.
[V]. Posters: Towards fine-grained dataflow parallelism in big data systems / Sebastian Ertel, Justus Adam, and Jeronimo Castrillon
JIT for intelligent manufacturing / Lei Zhang, L. N. C. Annadorai, Atin Angrish, Xipeng Shen, Binil Starly, Yuan-Shin Lee, and Paul Cohen
Static reuse time analysis using dependence distance / Dong Chen, Fangzhou Liu, Chen Ding, and Chucheow Lim
Analyzing auto-vectorization rates and quality in C/C++ compilers / Angela Pohl, Biagio Cosenza, and Ben Juurlink.
Software cache coherent control by parallelizing compiler / Boma A. Adhi, Masayoshi Mase, Yuhei Hosokawa, Yohei Kishimoto, Taisuke Onishi, Hiroki Mikami, Keiji Kimura, and Hironori Kasahara
Polyhedral compilation support for C++ features : a case study with CPPTRAJ / Amit Roy, Daniel Roe, Mary Hall, and Thomas Cheatham
Language-agnostic optimization and parallelization for interpreted languages / Michelle Mills Strout, Saumya Debray, Kate Isaacs, Barbara Kreaseck, Julio Cárdenas-Rodríguez, Bonnie Hurwitz, Kat Volk, Sam Badger, Jesse Bartels, Ian Bertolacci, Sabin Devkota, Anthony Encinas, Ben Gaska, Brandon Neth, Theo Sackos, Jon Stephens, Sarah Willer, and Babak Yadegari.
[II]. Performance modeling and instrumentation: Memory distance measurement for concurrent programs / Hao Li, Jialiang Chang, Zijiang Yang, and Steve Carr
Efficient cache simulation for affine computations / Wenlei Bao, Prashant Singh Rawat, Martin Kong, Sriram Krishnamoorthy, Louis-Noel Pouchet, and P. Sadayappan
ADLER : adaptive sampling for precise monitoring / Arnamoy Bhattacharyya and Cristiana Amza
How low can you go? / Henry Dietz
Memory-access-pattern analysis techniques for OpenCL kernels / Gangwon Jo, Jaehoon Jung, Jiyoung Park, and Jaejin Lee.
[III]. Parallel programming and abstractions: Characterizing performance of imbalanced collectives on hybrid and task centric runtimes for two-phase reduction / Udayanga Wickramasinghe and Andrew Lumsdaine
Abstract representation of shared data for heterogeneous computing / Tushar Kumar, Aravind Natarajan, Wenjia Ruan, Mario Badr, Dario Suarez Gracia, and Calin Cascaval
Parallel roles for practical deterministic parallel programming / Michael Faes and Thomas R. Gross
Mozart : efficient composition of library functions for heterogeneous execution / Rajkishore Barik, Tatiana Shpeisman, Hongbo Rong, Chunling Hu, Victor W. Lee, Todd A. Anderson, Greg Henry, Hai Liu, Youfeng Wu, Paul Petersen, and Geoff Lowney
Lock-free transactional adjacency list / Zachary Painter, Christina Peterson, and Damian Dechev.
[IV]. GPU applications: Efficient inspected critical sections in data-parallel GPU codes / Thorsten Blaß, Michael Philippsen, and Ronald Veldema
Scalable top-K query processing using graphics processing unit / Yulin Zhang, Hui Fang, and Xiaoming Li
Fast dynamic graph algorithms / Gaurav Malhotra, Hitish Chappidi, and Rupesh Nasre.
[V]. Posters: Towards fine-grained dataflow parallelism in big data systems / Sebastian Ertel, Justus Adam, and Jeronimo Castrillon
JIT for intelligent manufacturing / Lei Zhang, L. N. C. Annadorai, Atin Angrish, Xipeng Shen, Binil Starly, Yuan-Shin Lee, and Paul Cohen
Static reuse time analysis using dependence distance / Dong Chen, Fangzhou Liu, Chen Ding, and Chucheow Lim
Analyzing auto-vectorization rates and quality in C/C++ compilers / Angela Pohl, Biagio Cosenza, and Ben Juurlink.