Publications
"ZSim: Fast and Accurate Microarchitectural Simulation of Thousand-Core Systems",
Proceedings of the 40th annual International Symposium in Computer Architecture (ISCA-40), Tel-Aviv, Israel, 06/2013.
Download: paper (454.61 KB)
"The ZCache: Decoupling Ways and Associativity",
Annual IEEE/ACM International Symposium on Microarchitecture (MICRO'43), Atlanta, GE, 12/2010.
Download: paper (276.4 KB); slides (752.04 KB)
"Vector vs. Superscalar and VLIW Architectures for Embedded Multimedia Benchmarks",
Proceedings of the 35th Annual ACM/IEEE International Symposium on Microarchitecture (MICRO), Istanbul, Turkey, pp. 283–293, 11/2002.
"Vantage: Scalable and Efficient Fine-Grain Cache Partitioning",
International Symposium on Computer Architecture (ISCA), San Jose, CA, 06/2011.
Download: paper (753.6 KB); slides (1.74 MB)
"Understanding Sources of Inefficiency in General-purpose Chips",
Proceedings of the 37th Annual International Symposium on Computer Architecture, New York, NY, USA, ACM, pp. 37–47, 2010.
Download: paper (455.9 KB)
"Understanding Sources of Ineffciency in General-purpose Chips",
Commun. ACM, vol. 54, no. 10, New York, NY, USA, ACM, pp. 85–93, 2011.
Download: paper (2.83 MB)
"Transactional Memory Coherence and Consistency",
Proceedings of the 31st Annual International Symposium on Computer Architecture (ISCA), Munich, Germany, pp. 102–, 6/2004.
"Towards Energy-proportional Datacenter Memory with Mobile DRAM",
Proceedings of the 39th Annual International Symposium on Computer Architecture, Washington, DC, USA, IEEE Computer Society, pp. 37–48, 2012.
Download: paper (5.08 MB)
"Towards Energy Proportionality for Large-Scale Latency-Critical Workloads",
International Symposium on Computer Architecture, Minneapolis, Minnesota, 06/2014.
Download: Paper (897.16 KB); Slides (6.82 MB)
"Time and Cost-Efficient Modeling and Generation of Large-Scale TPC Workloads",
TPC Technology Conference on Performance Evaluation & Benchmarking (TPCTC), Seattle, WA, 08/2011.
Download: paper (931.08 KB)
"TETRIS: Scalable and Efficient Neural Network Acceleration with 3D Memory",
The 22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Xi'an, China, 04/2017.
Download: paper (1.93 MB); slides (1.06 MB)
"Tarcil: Reconciling Scheduling Speed and Quality in Large Shared Clusters",
ACM Symposium on Cloud Computing (SOCC), Kohala Coast, HI, USA, 08/2015.
Download: paper (1.92 MB)
Tarcil: High Quality and Low Latency Scheduling in Large, Shared Clusters,
: Stanford University, 11/2014.
Download: paper (1.85 MB)
"The Stream Virtual Machine",
Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques (PACT), pp. 267–277, 9/2004.
"Storage I/O Generation and Replay for Datacenter Applications",
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Austin, TX, 04/2011.
Download: paper (260.56 KB)
"Server Engineering Insights for Large-Scale Online Services",
IEEE Micro, vol. 30, no. 4, Los Alamitos, CA, USA, IEEE Computer Society Press, pp. 8–19, 2010.
Download: paper (496.53 KB)
"Security Implications of Data Mining in Cloud Scheduling",
IEEE Computer Architecture Letters (CAL), 07/2015.
Download: paper (842.94 KB)
"Security Implications of Data Mining in Cloud Scheduling",
IEEE Computer Architecture Letters, vol. 15, issue 2, 07/2016.
Download: 2016.seccloud.cal_.pdf (574.4 KB)
"SCD: A Scalable Coherence Directory with Flexible Sharer Set Encoding",
Proceedings of the 18th international symposium on High Performance Computer Architecture (HPCA-18), New Orleans, LA, 02/2012.
Download: paper (424.5 KB); slides (418.71 KB)
"Scalable Vector Processors for Embedded Systems",
IEEE Micro, vol. 23, no. 6, pp. 36–45, 11/2003.
"Scalable and Efficient Fine-Grained Cache Partitioning with Vantage",
IEEE Micro's Top Picks from the Computer Architecture Conferences, vol. 32, no. 3, May-June, 2012.
Download: paper (1.05 MB)
"Resource Efficienct Computing for Warehouse-scale Datacenters",
Conference on Design Automation and Test in Europe (DATE), Grenoble, France, 03/2013.
Download: paper (332.62 KB)
"ReFlex: Remote Flash == Local Flash",
22nd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS) , Xi'an, China, 04/2017.
Download: paper (961.78 KB)
"Reconciling High Server Utilization and Sub-millisecond Quality-of-Service",
Proceedings of the 2014 EuroSys Conference, Amsterdam, Netherlands, 04/2014.
"Quasar: Resource-Efficient and QoS-Aware Cluster Management",
19th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), Salt Lake City, UT, 03/2014.
Download: paper (1.73 MB); slides (3.66 MB)