Parallel and Customized Computer Architecture Course
“Parallel and Customized Computer Architecture” course (ECE-GY 9413) at NYU Tandon by Brandon Reagen
“Parallel and Customized Computer Architecture” course (ECE-GY 9413) at NYU Tandon by Brandon Reagen
This paper introduces HAAC, a novel garbled circuits accelerator and compiler, to make privacy-preserving computation more feasible. HAAC is a hardware-softw...
In two-party machine learning prediction services, clients seek to utilize a remote server’s trained machine learning model without exposing sensitive data. ...
The Ring-Learning-with-Errors (RLWE) method is crucial for enhancing security and privacy in various technologies like homomorphic encryption and post-quantu...
In this paper, the researchers propose PRIME, a novel PIM architecture designed to accelerate NN applications within ReRAM-based main memory. PRIME allows fo...
The paper introduces Gemmini, an open-source tool for creating DNN accelerators. Gemmini generates a diverse range of efficient ASIC accelerators using a fle...
This paper introduces GenAx, a specialized accelerator designed to expedite the read alignment process, which is a time-intensive task in genome sequencing. ...
The paper details the development of Graphicionado, a specialized accelerator designed for efficient graph analytics. The researchers optimized both data pro...
This paper assesses the effectiveness of a custom ASIC called a Tensor Processing Unit (TPU), which has been utilized in datacenters since 2015 to expedite t...
As hardware costs decrease and processing requirements become clearer, specialized systems are increasingly developed; however, the lack of a standardized ap...
In this paper, the researchers introduce a novel dataflow technique termed row-stationary (RS), aimed at minimizing energy consumption related to data moveme...
This paper introduces an Energy Efficient Inference Engine (EIE), aiming to process inference tasks on compressed network models efficiently. EIE achieves en...
In this paper, the authors present MANIC, a novel architecture tailored for the ultra-low-power sensor domain. MANIC achieves high energy efficiency while re...
The paper proposes a customized semi-programmable loop accelerator architecture that balances customization for efficiency with the flexibility to execute mu...
This paper presents the polymorphous TRIPS architecture, which can be configured for different granularities and types of parallelism. The results indicate s...
This paper introduces the Raw microprocessor, which employs a scalable ISA to solve the growing wire-delay problem. By offering a parallel, software interfac...
This paper compares general-purpose processors, including CMPs, to ASICs in terms of cost-effectiveness, performance, and energy efficiency. It focuses on a ...
The paper models multicore scaling limits to measure the speedup potential for parallel workloads for the next five technology generations. The findings indi...
The paper presents WaveScalar, an alternative to superscalar design, a dataflow ISA and execution model designed for scalable, simple, high-performance proce...
The RISC-V V extension has recently reached the 1.0-Frozen Status. This paper introduces the initial open-source implementation, explores its influence on la...
The paper showcases CODE, a scalable vector microarchitecture that enhances performance and tackles issues in vector processors such as the complexity and si...
This report highlights VIRAM architecture’s efficiency for embedded multimedia applications, utilizing EEMBC benchmarks. With compact code, low power consump...
This report delves into the fundamentals of Vector Processors. The report outlines the architecture, functionality, challenges, performance enhancements, as ...
In this post, I summarize the various searching algorithms discussed in AlgoExpert - Algorithm Questions Course and implement them in Python.
In this post, I summarize the various sorting algorithms discussed in AlgoExpert - Algorithm Questions Course and implement them in Python.
Within this PDF, you’ll find my comprehensive notes covering the “Linear Algebra for Machine Learning and Data Science” Course offered by DeepLearning.AI
This is the summary of what I learned from Kunal Kushwaha’s YouTube Course on Computer Networking.
This is the summary of what I learned from Kunal Kushwaha’s YouTube Course on Git and GitHub, and a few extra notes taken while researching.
These are my notes on the Coursera Course - Learning How to Learn. The course is instructed by Dr. Barbara Oakley and Terrence Sejnowski.
These are my notes on Introduction to Self Driving Cars course on Coursera. It is instructed by professors from the University of Toronto.
“Deep Learning” course (ECE-GY 7123) at NYU Tandon by Chinmay Hegde
“Advanced Machine Learning” course (ECE-GY 7143) at NYU Tandon by Anna Choromanska