Jiayuan Meng

Cited by

	All	Since 2019
Citations	6211	2105
h-index	20	12
i10-index	26	17

580

290

145

435

2008200920102011201220132014201520162017201820192020202120222023202426 87 193 249 345 419 510 565 527 513 513 487 430 419 307 314 148

Public access

View all

2 articles

1 article

available

not available

Based on funding mandates

Jiayuan Meng

Argonne National Laboratory

Verified email at alcf.anl.gov - Homepage

High Performance Computing Computer Architecture Modeling Simulation Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Rodinia: A benchmark suite for heterogeneous computing S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, SH Lee, K Skadron 2009 IEEE international symposium on workload characterization (IISWC), 44-54, 2009	3698	2009
A performance study of general-purpose applications on graphics processors using CUDA S Che, M Boyer, J Meng, D Tarjan, JW Sheaffer, K Skadron Journal of parallel and distributed computing 68 (10), 1370-1380, 2008	921	2008
Dynamic warp subdivision for integrated branch and memory divergence tolerance J Meng, D Tarjan, K Skadron Proceedings of the 37th annual international symposium on Computer …, 2010	350	2010
Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs J Meng, K Skadron Proceedings of the 23rd international conference on Supercomputing, 256-265, 2009	199	2009
GROPHECY: GPU performance projection from CPU code skeletons J Meng, VA Morozov, K Kumaran, V Vishwanath, TD Uram Proceedings of 2011 International Conference for High Performance Computing …, 2011	133	2011
Best-effort parallel execution framework for recognition and mining applications J Meng, S Chakradhar, A Raghunathan 2009 IEEE International Symposium on Parallel & Distributed Processing, 1-12, 2009	131	2009
Improving GPU performance prediction with data transfer modeling M Boyer, J Meng, K Kumaran 2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013	79	2013
Increasing memory miss tolerance for SIMD cores D Tarjan, J Meng, K Skadron Proceedings of the Conference on High Performance Computing Networking …, 2009	77	2009
Avoiding cache thrashing due to private data placement in last-level cache for manycore scaling J Meng, K Skadron 2009 IEEE international conference on computer design, 282-288, 2009	72	2009
A performance study for iterative stencil loops on GPUs with ghost zone optimizations J Meng, K Skadron International Journal of Parallel Programming 39, 115-142, 2011	65	2011
Exploiting the forgiving nature of applications for scalable parallel execution J Mengte, A Raghunathan, S Chakradhar, S Byna 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010	57	2010
Workflow performance improvement using model-based scheduling over multiple clusters and clouds K Maheshwari, ES Jung, J Meng, V Morozov, V Vishwanath, R Kettimuthu Future Generation Computer Systems 54, 206-218, 2016	40	2016
Exploiting inter-thread temporal locality for chip multithreading J Meng, JW Sheaffer, K Skadron 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010	37	2010
Best-effort semantic document search on GPUs S Byna, J Meng, A Raghunathan, S Chakradhar, S Cadambi Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010	37	2010
Systems and methods for implementing best-effort parallel computing frameworks S Chakradhar, A Raghunathan, J Meng US Patent 8,286,172, 2012	35	2012
Dataflow-driven GPU performance projection for multi-kernel transformations J Meng, VA Morozov, V Vishwanath, K Kumaran SC'12: Proceedings of the International Conference on High Performance …, 2012	30	2012
Skope: A framework for modeling and exploring workload behavior J Meng, X Wu, V Morozov, V Vishwanath, K Kumaran, V Taylor Proceedings of the 11th ACM Conference on Computing Frontiers, 1-10, 2014	29	2014
Dynamic warp subdivision for integrated branch and memory latency divergence tolerance K Skadron, J Meng, D Tarjan US Patent App. 13/040,045, 2011	26	2011
Robust SIMD: Dynamically adapted SIMD width and multi-threading depth J Meng, JW Sheaffer, K Skadron 2012 IEEE 26th international parallel and distributed processing symposium …, 2012	24	2012
A multiple SIMD, multiple data (MSMD) architecture: Parallel execution of dynamic and static SIMD fragments Y Wang, S Chen, J Wan, J Meng, K Zhang, W Liu, X Ning 2013 IEEE 19th International Symposium on High Performance Computer …, 2013	20	2013

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by