ARDAVAN PEDRAM

email: Perdavan at gmail dot com

 

CV

 

 

 

 

 

Research Associate

Department of Electrical Engineering

Stanford University

 

I am a member of the Pervasive Prallelism Laboratory (PPL) working with Professor Kunle Olukotun.

 

My work on algorithm/architecture codesign of specialized accelerators originated two National Science Foundation Awards in [2012 and 2016] and is a core part of a third one [2014].

I specifically work on hardware-software co-design (algorithm for architecture) of special purposed accelerators for high performance linear algebra, machine-learning, and signal processing applications.

Here you find the poster of my research that I presented in the Stanford's Science Teaching through ARt (STAR) exhibition for high school students.

 

I received my PhD in Computer Engineering from the department of Electrical and Computer Engineering at The University of Texas at Austin in 2013.
My PhD supervisors are professors Robert van de Geijn and Andreas Gerstlauer.

 


 

PRISM Project:

Download the presentation

Award: National Science Foundation (NSF) Grant

 


 

Dissertation:

 

“Algorithm/Architecture Codesign of Low Power and High Performance Linear Algebra Compute Fabrics”

 

Download the PDF

Poster: TCPP Best Poster Award in IPDPS 2013 conference PhD Forum

Award: National Science Foundation (NSF) Grant

 


 

Software:

 

The Linear Algebra Processor (LAP) simulator is available under free BSD license.

The cycle accurate simulator engine is functional and performs the actual computations on the simulated hardware. Therefore, debugging under this environment is easy.

 


 

Dark Memory:

 

Looking at the power dissipation in modern architectures the memory system contributes well over 50% of the total system power. So, given Amdahl’s Law, changing the compute engine without improving the memory can only have a modest (less than 2x) change in energy efficiency.

Large gains in efficiency are only possible if the DRAM and memory hierarchy are mostly idle. We refer to this desirable state as Dark Memory, and it only occurs for applications with an extreme form of locality.

Watch The Presentation

 


 

Refereed Publications:

 

1-      David Koeplinger, Matt Feldman, Raghu Prabhakar, Yaqi Zhang, Stefan Hadjis, Ruben Fiszel, Tian Zhao, Luigi Nardi, Ardavan Pedram, Christos Kozyrakis, and Kunle Olukotun:

Spatial: A Language and Compiler for Application Accelerators,

The 39th ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI 2018).

 

2-      Yuanfang Li and Ardavan Pedram:

CATERPILLAR: Coarse Grain Reconfigurable Architecture for Accelerating the Training of Deep Neural Networks,

The 28th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP2017).

Best Paper Award

 

3-      Raghu Prabhakar, Yaqi Zhang, David Koeplinger, Matt Feldman, Tian Zhao, Stefan Hadjis, Ardavan Pedram, Christos Kozyrakis, and Kunle Olukotun:

Plasticine: A Reconfigurable Architecture For Parallel Patterns,

The 44th International Symposium on Computer Architecture (ISCA 2017).

IEEE Micro Top Picks 2018

 

4-      Ardavan Pedram, Stephen Richardson, Sameh Galal, Shahar Kvatinsky, and Mark A. Horowitz:

Dark Memory and Accelerator-Rich System Optimization in the Dark Silicon Era,

IEEE Design and Test Magazine Special Issue on Dark Silicon, April 2017.

 

5-      Artem Vasilyev, Nikhil Bhagdikar, Ardavan Pedram, Stephen E Richardson, Shahar Kvatinsky, and Mark Horowitz:

Evaluating Programmable Architectures for ISP and Computer Vision!,

The 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016. (Micro 49)

 

6-      Heonjae Ha, Ardavan Pedram, Stephen Richardson, Shahar Kvatinsky, and Mark Horowitz:

Improving Energy Efficiency of DRAM by Exploiting Half Page Row Access,

The 49th Annual IEEE/ACM International Symposium on Microarchitecture, 2016. (Micro 49)

 

7-      Mochamad Asri, Ardavan Pedram, Lizy K. John, and Andreas Gerstlauer:

Simulator Calibration for Accelerator-Rich Architecture Studies,

International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS 2016).

 

8-      Song Han, Xingyu Liu, Huizi Mao, Jing Pu, Ardavan Pedram, Mark A. Horowitz, and William J. Dally:

EIE: Efficient Inference Engine on Compressed Deep Neural Network,

The 43rd International Symposium on Computer Architecture (ISCA 2016).

 

9-      Ardavan Pedram, John McCalpin, and Andreas Gerstlauer:

A Highly Efficient Multicore Floating-Point FFT Architecture Based on Hybrid Linear Algebra/FFT Cores,

The Journal of Signal Processing Systems, Springer, 2014.

 

10-      Ardavan Pedram, Andreas Gerstlauer, and Robert van de Geijn:

"Algorithm, Architecture, and Floating-Point Unit Codesign of a Matrix Factorization Accelerator,"

IEEE Transactions on Computers (TC) Special Section on Computer Arithmetic, August 2014.

 

11-      Ardavan Pedram, John McCalpin, and Andreas Gerstlauer:

Transforming a Linear Algebra Core to an FFT Accelerator,”

The 24th IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP2013).

 

12-      Ardavan Pedram, Andreas Gerstlauer, and Robert van de Geijn:

"Floating Point Architecture Extensions for Optimized Matrix Factorization,"

The 21st IEEE International Symposium on Computer Arithmetic (ARITH21).

 

13-      Ardavan Pedram, Robert van de Geijn, and Andreas Gerstlauer:

"Codesign Tradeoffs for High-Performance, Low-Power Linear Algebra Architectures,"

IEEE Transactions on Computers (TC) Special Issue on Energy Efficient Computing, Volume 61, Issue 12, Page(s) 1724 – 1736, December 2012. 

 

14-      Ardavan Pedram, Andreas Gerstlauer, and Robert van de Geijn:

"On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators,"

The 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2012). (Acceptance rate 28%)

 

15-   Ardavan Pedram, Syed Gilani, Nam Sung Kim, Robert van de Geijn, Mike Schulte,and Andreas Gerstlauer:

"A Linear Algebra Core Design For Efficient Level-3 BLAS,"

The 23rd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP2012).

 

16-   Ardavan Pedram, Andreas Gerstlauer, and Robert van de Geijn:

"A High-performance, Low-power Linear Algebra Core,"

The 22nd IEEE International Conference on Application-specific Systems, Architectures and Processors (ASAP2011). (Acceptance rate 25%)

 

17-   Ardavan Pedram, David Craven, and Andreas Gerstlauer:

"Modeling Cache Effects at the Transaction Level,"

International Embedded Systems Symposium (IESS2009).

Best Paper Runner Up

 

18-   Ardavan Pedram, Mohammad Reza Jamali, Caro Lucas, and Syed Mehdi Fakhraie:

"Local Linear Model Tree (LOLIMOT) Reconfigurable Parallel Hardware,"

Transactions on Engineering, Computing and Technology, Volume 13, Page(s) 96-101, May 2006.

 


 

 

 

Photography