Follow
Jiayuan Meng
Title
Cited by
Cited by
Year
Rodinia: A benchmark suite for heterogeneous computing
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, SH Lee, K Skadron
2009 IEEE international symposium on workload characterization (IISWC), 44-54, 2009
38232009
A performance study of general-purpose applications on graphics processors using CUDA
S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, K Skadron
Journal of parallel and distributed computing 68 (10), 1370-1380, 2008
9312008
Dynamic warp subdivision for integrated branch and memory divergence tolerance
J Meng, D Tarjan, K Skadron
Proceedings of the 37th annual international symposium on Computer …, 2010
3572010
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs
J Meng, K Skadron
Proceedings of the 23rd international conference on Supercomputing, 256-265, 2009
2022009
GROPHECY: GPU performance projection from CPU code skeletons
J Meng, VA Morozov, K Kumaran, V Vishwanath, TD Uram
Proceedings of 2011 International Conference for High Performance Computing …, 2011
1332011
Best-effort parallel execution framework for recognition and mining applications
J Meng, S Chakradhar, A Raghunathan
2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009
1332009
Improving GPU performance prediction with data transfer modeling
M Boyer, J Meng, K Kumaran
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
792013
Increasing memory miss tolerance for SIMD cores
D Tarjan, J Meng, K Skadron
Proceedings of the Conference on High Performance Computing Networking …, 2009
772009
Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling
J Meng, K Skadron
2009 IEEE international conference on computer design, 282-288, 2009
722009
A performance study for iterative stencil loops on GPUs with ghost zone optimizations
J Meng, K Skadron
International Journal of Parallel Programming 39, 115-142, 2011
672011
Exploiting the forgiving nature of applications for scalable parallel execution
J Mengte, A Raghunathan, S Chakradhar, S Byna
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
572010
Workflow performance improvement using model-based scheduling over multiple clusters and clouds
K Maheshwari, ES Jung, J Meng, V Morozov, V Vishwanath, R Kettimuthu
Future Generation Computer Systems 54, 206-218, 2016
402016
Exploiting inter-thread temporal locality for chip multithreading
J Meng, JW Sheaffer, K Skadron
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
372010
Best-effort semantic document search on GPUs
S Byna, J Meng, A Raghunathan, S Chakradhar, S Cadambi
Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010
372010
Systems and methods for implementing best-effort parallel computing frameworks
S Chakradhar, A Raghunathan, J Meng
US Patent 8,286,172, 2012
352012
Dataflow-driven GPU performance projection for multi-kernel transformations
J Meng, VA Morozov, V Vishwanath, K Kumaran
SC'12: Proceedings of the International Conference on High Performance …, 2012
312012
Skope: A framework for modeling and exploring workload behavior
J Meng, X Wu, V Morozov, V Vishwanath, K Kumaran, V Taylor
Proceedings of the 11th ACM Conference on Computing Frontiers, 1-10, 2014
292014
Dynamic warp subdivision for integrated branch and memory latency divergence tolerance
K Skadron, J Meng, D Tarjan
US Patent App. 13/040,045, 2011
262011
Robust SIMD: Dynamically adapted SIMD width and multi-threading depth
J Meng, JW Sheaffer, K Skadron
2012 IEEE 26th international parallel and distributed processing symposium …, 2012
242012
A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments
Y Wang, S Chen, J Wan, J Meng, K Zhang, W Liu, X Ning
2013 IEEE 19th International Symposium on High Performance Computer …, 2013
212013
The system can't perform the operation now. Try again later.
Articles 1–20