default search action
47th ISCA 2020: Virtual Event / Valencia, Spain
- 47th ACM/IEEE Annual International Symposium on Computer Architecture, ISCA 2020, Virtual Event / Valencia, Spain, May 30 - June 3, 2020. IEEE 2020, ISBN 978-1-7281-4661-4
- Bülent Abali, Bart Blaner, John J. Reilly, Matthias Klein, Ashutosh Mishra, Craig B. Agricola, Bedri Sendir, Alper Buyuktosunoglu, Christian Jacobi, William J. Starke, Haren Myneni, Charlie Wang:
Data Compression Accelerator on IBM POWER9 and z15 Processors : Industrial Product. 1-14 - G. Glenn Henry, Parviz Palangpour, Michael Thomson, J. Scott Gardner, Bryce Arden, Jim Donahue, Kimble Houck, Jonathan Johnson, Kyle O'Brien, Scott Petersen, Benjamin Seroussi, Tyler Walker:
High-Performance Deep-Learning Coprocessor Integrated into x86 SoC with Server-Class CPUs Industrial Product. 15-26 - Narasimha Adiga, James Bonanno, Adam Collura, Matthias Heizmann, Brian R. Prasky, Anthony Saporito:
The IBM z15 High Frequency Mainframe Branch Predictor Industrial Product. 27-39 - Brian Grayson, Jeff Rupley, Gerald D. Zuraski, Eric Quinnell, Daniel A. Jiménez, Tarun Nakra, Paul Kitchin, Ryan Hensley, Edward Brekelbaum, Vikas Sinha, Ankit Ghiya:
Evolution of the Samsung Exynos CPU Microarchitecture. 40-51 - Chen Chen, Xiaoyan Xiang, Chang Liu, Yunhai Shang, Ren Guo, Dongqi Liu, Yimin Lu, Ziyi Hao, Jiahui Luo, Zhijian Chen, Chunqiang Li, Yu Pu, Jianyi Meng, Xiaolang Yan, Yuan Xie, Xiaoning Qi:
Xuantie-910: A Commercial Multi-Core 12-Stage Pipeline Out-of-Order 64-bit High Performance RISC-V Processor with Vector Extension : Industrial Product. 52-64 - Ali Ansari, Pejman Lotfi-Kamran, Hamid Sarbazi-Azad:
Divide and Conquer Frontend Bottleneck. 65-78 - Sumeet Bandishte, Jayesh Gaur, Zeev Sperber, Lihu Rappoport, Adi Yoaz, Sreenivas Subramoney:
Focused Value Prediction. 79-91 - Adarsh Chauhan, Jayesh Gaur, Zeev Sperber, Franck Sala, Lihu Rappoport, Adi Yoaz, Sreenivas Subramoney:
Auto-Predication of Critical Branches. 92-104 - Vinesh Srinivasan, Rangeen Basu Roy Chowdhury, Eric Rotenberg:
Slipstream Processors Revisited: Exploiting Branch Sets. 105-117 - Samuel Pakalapati, Biswabandan Panda:
Bouquet of Instruction Pointers: Instruction Pointer Classifier-based Spatial Hardware Prefetching. 118-131 - Sam Ainsworth, Timothy M. Jones:
MuonTrap: Preventing Cross-Domain Spectre-Like Attacks by Capturing Speculative State. 132-144 - Dennis Abts, Jonathan Ross, Jonathan Sparling, Mark Wong-VanHaren, Max Baker, Tom Hawkins, Andrew Bell, John Thompson, Temesghen Kahsai, Garrin Kimmell, Jennifer Hwang, Rebekah Leslie-Hurd, Michael Bye, E. R. Creswick, Matthew Boyd, Mahitha Venigalla, Evan Laforge, Jon Purdy, Purushotham Kamath, Dinesh Maheshwari, Michael Beidler, Geert Rosseel, Omar Ahmad, Gleb Gagarin, Richard Czekalski, Ashay Rane, Sahil Parmar, Jeff Werner, Jim Sproch, Adrián Macías, Brian Kurtz:
Think Fast: A Tensor Streaming Processor (TSP) for Accelerating Deep Learning Workloads. 145-158 - Victor A. Ying, Mark C. Jeffrey, Daniel Sánchez:
T4: Compiling Sequential Code for Effective Speculative Parallelization in Hardware. 159-172 - Moyang Wang, Tuan Ta, Lin Cheng, Christopher Batten:
Efficiently Supporting Dynamic Task Parallelism on Heterogeneous Cache-Coherent Systems. 173-186 - Shenghsun Cho, Han Chen, Sergey Madaminov, Michael Ferdman, Peter A. Milder:
Flick: Fast and Lightweight ISA-Crossing Call for Heterogeneous-ISA Environments. 187-198 - Mark Sutherland, Siddharth Gupta, Babak Falsafi, Virendra J. Marathe, Dionisios N. Pnevmatikatos, Alexandros Daglis:
The NEBULA RPC-Optimized Architecture. 199-212 - Nathaniel Bleier, Muhammad Husnain Mubarik, Farhan Rasheed, Jasmin Aghassi-Hagmann, Mehdi B. Tahoori, Rakesh Kumar:
Printed Microprocessors. 213-226 - Jawad Haj-Yahya, Mohammed Alser, Jeremie S. Kim, Abdullah Giray Yaglikçi, Nandita Vijaykumar, Efraim Rotem, Onur Mutlu:
SysScale: Exploiting Multi-domain Dynamic Voltage and Frequency Scaling for Energy Efficient Mobile Processors. 227-240 - Shulin Zhao, Haibo Zhang, Sandeepa Bhuyan, Cyan Subhra Mishra, Ziyu Ying, Mahmut T. Kandemir, Anand Sivasubramaniam, Chita R. Das:
Déjà View: Spatio-Temporal Compute Reuse for' Energy-Efficient 360° VR Video Streaming. 241-253 - Tae Jun Ham, David Bruns-Smith, Brendan Sweeney, Yejin Lee, Seong Hoon Seo, U. Gyeong Song, Young H. Oh, Krste Asanovic, Jae W. Lee, Lisa Wu Wills:
Genesis: A Hardware Acceleration Framework for Genomic Data Analysis. 254-267 - Jian Weng, Sihao Liu, Vidushi Dadu, Zhengrong Wang, Preyas Shah, Tony Nowatzki:
DSAGEN: Synthesizing Programmable Spatial Accelerators. 268-281 - Nikola Samardzic, Weikang Qiao, Vaibhav Aggarwal, Mau-Chung Frank Chang, Jason Cong:
Bonsai: High-Performance Adaptive Merge Tree Sorting. 282-294 - Gangwon Jo, Heehoon Kim, Jeesoo Lee, Jaejin Lee:
SOFF: An OpenCL High-Level Synthesis Framework for FPGAs. 295-308 - Matthew Vilim, Alexander Rucker, Yaqi Zhang, Sophia Liu, Kunle Olukotun:
Gorgon: Accelerating Machine Learning from Relational Data. 309-321 - Jaeyoung Jang, Sungjun Jung, Sunmin Jeong, Jun Heo, Hoon Shin, Tae Jun Ham, Jae W. Lee:
A Specialized Architecture for Object Serialization with Applications to Big Data Analytics. 322-334 - Ilkwon Byun, Dongmoon Min, Gyu-hyeon Lee, Seongmin Na, Jangwoo Kim:
CryoCore: A Fast and Dense Processor Architecture for Cryogenic Computing. 335-348 - Surya Narayanan, Karl Taht, Rajeev Balasubramonian, Edouard Giacomin, Pierre-Emmanuel Gaillardon:
SpinalFlow: An Architecture and Dataflow Tailored for Spiking Neural Networks. 349-362 - Sonali Singh, Anup Sarma, Nicholas Jao, Ashutosh Pattnaik, Sen Lu, Kezhou Yang, Abhronil Sengupta, Vijaykrishnan Narayanan, Chita R. Das:
NEBULA: A Neuromorphic Spin-Based Ultra-Low Power Architecture for SNNs and ANNs. 363-376 - Di Wu, Jingjie Li, Ruokai Yin, Hsuan Hsiao, Younghyun Kim, Joshua San Miguel:
UGEMM: Unary Computing Architecture for GEMM Applications. 377-390 - Ioannis Karageorgos, Karthik Sriram, Ján Veselý, Michael Wu, Marc Powell, David A. Borton, Rajit Manohar, Abhishek Bhattacharjee:
Hardware-Software Co-Design for Brain-Computer Interfaces. 391-404 - Xinhui Zhu, Weixiang Jiang, Fangming Liu, Qixia Zhang, Li Pan, Qiong Chen, Ziyang Jia:
Heat to Power: Thermal Energy Harvesting and Recycling for Warm Water-Cooled Datacenters. 405-418 - Yifan Yang, Zhaoshi Li, Yangdong Deng, Zhiwei Liu, Shouyi Yin, Shaojun Wei, Leibo Liu:
GraphABCD: Scaling Out Graph Analytics with Asynchronous Block Coordinate Descent. 419-432 - Nagadastagiri Challapalle, Sahithi Rampalli, Linghao Song, Nandhini Chandramoorthy, Karthik Swaminathan, John Sampson, Yiran Chen, Vijaykrishnan Narayanan:
GaaS-X: Graph Analytics Accelerator Supporting Sparse Data Representation using Crossbar Architectures. 433-445 - Vijay Janapa Reddi, Christine Cheng, David Kanter, Peter Mattson, Guenther Schmuelling, Carole-Jean Wu, Brian Anderson, Maximilien Breughe, Mark Charlebois, William Chou, Ramesh Chukka, Cody Coleman, Sam Davis, Pan Deng, Greg Diamos, Jared Duke, Dave Fick, J. Scott Gardner, Itay Hubara, Sachin Idgunji, Thomas B. Jablin, Jeff Jiao, Tom St. John, Pankaj Kanwar, David Lee, Jeffery Liao, Anton Lokhmotov, Francisco Massa, Peng Meng, Paulius Micikevicius, Colin Osborne, Gennady Pekhimenko, Arun Tejusve Raghunath Rajan, Dilip Sequeira, Ashish Sirasao, Fei Sun, Hanlin Tang, Michael Thomson, Frank Wei, Ephrem Wu, Lingjie Xu, Koichi Yamada, Bing Yu, George Yuan, Aaron Zhong, Peizhao Zhang, Yuchen Zhou:
MLPerf Inference Benchmark. 446-459 - Mario Badr, Carlo Delconte, Isak Edo, Radhika Jagtap, Matteo Andreozzi, Natalie D. Enright Jerger:
Mocktails: Capturing the Memory Behaviour of Proprietary Mobile Architectures. 460-472 - Mahmoud Khairy, Zhesheng Shen, Tor M. Aamodt, Timothy G. Rogers:
Accel-Sim: An Extensible Simulation Framework for Validated GPU Modeling. 473-486 - Alexey Lavrov, David Wentzlaff:
HyperTRIO: Hyper-Tenant Translation of I/O Addresses. 487-500 - Dimitrios Skarlatos, Umur Darbaz, Bhargava Gopireddy, Nam Sung Kim, Josep Torrellas:
BabelFish: Fusing Address Translations for Containers. 501-514 - Chloe Alverti, Stratos Psomadakis, Vasileios Karakostas, Jayneel Gandhi, Konstantinos Nikas, Georgios I. Goumas, Nectarios Koziris:
Enhancing and Exploiting Contiguity for Fast Memory Virtualization. 515-528 - Prakash Murali, Dripto M. Debroy, Kenneth R. Brown, Margaret Martonosi:
Architecting Noisy Intermediate-Scale Trapped Ion Quantum Computers. 529-542 - Jinglei Cheng, Haoqing Deng, Xuehai Qian:
AccQOC: Accelerating Quantum Optimal Control Based Pulse Generation. 543-555 - Adam Holmes, Mohammad Reza Jokar, Ghasem Pasandi, Yongshan Ding, Massoud Pedram, Frederic T. Chong:
NISQ+: Boosting quantum computing power by approximating quantum error correction. 556-569 - Yongshan Ding, Xin-Chuan Wu, Adam Holmes, Ash Wiseth, Diana Franklin, Margaret Martonosi, Frederic T. Chong:
SQUARE: Strategic Quantum Ancilla Reuse for Modular Quantum Programs via Cost-Effective Uncomputation. 570-583 - Miao Cai, Chance C. Coats, Jian Huang:
HOOP: Efficient Hardware-Assisted Out-of-Place Update for Non-Volatile Memory. 584-596 - Jian Zhou, Amro Awad, Jun Wang:
Lelantus: Fine-Granularity Copy-On-Write Operations for Secure Non-Volatile Memories. 597-609 - Xueliang Wei, Dan Feng, Wei Tong, Jingning Liu, Liuqing Ye:
MorLog: Morphable Hardware Logging for Atomic Persistence in Non-Volatile Main Memory. 610-623 - Rajat Kateja, Nathan Beckmann, Gregory R. Ganger:
TVARAK: Software-Managed Hardware Offload for Redundancy in Direct-Access NVM Storage. 624-637 - Jeremie S. Kim, Minesh Patel, Abdullah Giray Yaglikçi, Hasan Hassan, Roknoddin Azizi, Lois Orosa, Onur Mutlu:
Revisiting RowHammer: An Experimental Analysis of Modern DRAM Devices and Mitigation Techniques. 638-651 - Vaibhav Gogte, William Wang, Stephan Diestelhorst, Peter M. Chen, Satish Narayanasamy, Thomas F. Wenisch:
Relaxed Persist Ordering Using Strand Persistency. 652-665 - Haocong Luo, Taha Shahroodi, Hasan Hassan, Minesh Patel, Abdullah Giray Yaglikçi, Lois Orosa, Jisung Park, Onur Mutlu:
CLR-DRAM: A Low-Cost DRAM Architecture Enabling Dynamic Capacity-Latency Trade-Off. 666-679 - Yuanchao Xu, Chencheng Ye, Yan Solihin, Xipeng Shen:
Hardware-Based Domain Virtualization for Intra-Process Isolation of Persistent Memory Objects. 680-692 - Joohyeong Yoon, Won Seob Jeong, Won Woo Ro:
Check-In: In-Storage Checkpointing for Key-Value Store System Leveraging Flash-Based SSDs. 693-706 - Jiyong Yu, Namrata Mantri, Josep Torrellas, Adam Morrison, Christopher W. Fletcher:
Speculative Data-Oblivious Execution: Mobilizing Safe Prediction For Safe and Efficient Speculative Execution. 707-720 - Mohammadkazem Taram, Ashish Venkat, Dean M. Tullsen:
Packet Chasing: Spying on Network Packets over a Cache Side-Channel. 721-734 - Meysam Taassori, Rajeev Balasubramonian, Siddhartha Chhabra, Alaa R. Alameldeen, Manjula Peddireddy, Rajat Agarwal, Ryan Stutsman:
Compact Leakage-Free Support for Integrity and Reliability. 735-748 - Zhenyu Xu, Thomas Mauldin, Zheyi Yao, Shuyi Pei, Tao Wei, Qing Yang:
A Bus Authentication and Anti-Probing Architecture Extending Hardware Trusted Computing Base Off CPU Chips and Beyond. 749-761 - Rasool Sharifi, Ashish Venkat:
CHEx86: Context-Sensitive Enforcement of Memory Safety via Microcode-Enabled Capabilities. 762-775 - Joongun Park, Naegyeong Kang, Taehoon Kim, Youngjin Kwon, Jaehyuk Huh:
Nested Enclave: Supporting Fine-grained Hierarchical Isolation with SGX. 776-789 - Liu Ke, Udit Gupta, Benjamin Youngjae Cho, David Brooks, Vikas Chandra, Utku Diril, Amin Firoozshahian, Kim M. Hazelwood, Bill Jia, Hsien-Hsin S. Lee, Meng Li, Bert Maher, Dheevatsa Mudigere, Maxim Naumov, Martin Schatz, Mikhail Smelyanskiy, Xiaodong Wang, Brandon Reagen, Carole-Jean Wu, Mark Hempstead, Xuan Zhang:
RecNMP: Accelerating Personalized Recommendation with Near-Memory Processing. 790-803 - Peng Gu, Xinfeng Xie, Yufei Ding, Guoyang Chen, Weifeng Zhang, Dimin Niu, Yuan Xie:
iPIM: Programmable In-Memory Image Processing Accelerator Using Near-Bank Architecture. 804-817 - Benjamin Y. Cho, Yongkee Kwon, Sangkug Lym, Mattan Erez:
Near Data Acceleration with Concurrent Host Access. 818-831 - Weitao Li, Pengfei Xu, Yang Zhao, Haitong Li, Yuan Xie, Yingyan Lin:
Timely: Pushing Data Movements And Interfaces In Pim Accelerators Towards Local And In Time Domain. 832-845 - Yue Zha, Jing Li:
Hyper-Ap: Enhancing Associative Processing Through A Full-Stack Optimization. 846-859 - R. David Evans, Lufei Liu, Tor M. Aamodt:
JPEG-ACT: Accelerating Deep Learning via Transform-based Lossy Compression. 860-873 - Naorin Hossain, Caroline Trippel, Margaret Martonosi:
TransForm: Formally Specifying Transistency Models and Synthesizing Enhanced Litmus Tests. 874-887 - Nicolai Oswald, Vijay Nagarajan, Daniel J. Sorin:
HieraGen: Automated Generation of Concurrent, Hierarchical Cache Coherence Protocols. 888-899 - Faruk Guvenilir, Yale N. Patt:
Tailored Page Sizes. 900-912 - Chang Hyun Park, Sanghoon Cha, Bokyeong Kim, Youngjin Kwon, David Black-Schaffer, Jaehyuk Huh:
Perforated Page: Supporting Fragmented Memory Allocation for Large Pages. 913-925 - Esha Choukse, Michael B. Sullivan, Mike O'Connor, Mattan Erez, Jeff Pool, David W. Nellans, Stephen W. Keckler:
Buddy Compression: Enabling Larger Memory for Deep Learning and HPC Workloads on GPUs. 926-939 - Eunjin Baek, Dongup Kwon, Jangwoo Kim:
A Multi-Neural Network Acceleration Architecture. 940-953 - Yang Zhao, Xiaohan Chen, Yue Wang, Chaojian Li, Haoran You, Yonggan Fu, Yuan Xie, Zhangyang Wang, Yingyan Lin:
SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation. 954-967 - Ranggi Hwang, Taehun Kim, Youngeun Kwon, Minsoo Rhu:
Centaur: A Chiplet-based, Hybrid Sparse-Dense Accelerator for Personalized Recommendations. 968-981 - Udit Gupta, Samuel Hsia, Vikram Saraph, Xiaodong Wang, Brandon Reagen, Gu-Yeon Wei, Hsien-Hsin S. Lee, David Brooks, Carole-Jean Wu:
DeepRecSys: A System for Optimizing End-To-End At-Scale Neural Recommendation Inference. 982-995 - Benjamin Klenk, Nan Jiang, Greg Thorson, Larry Dennison:
An In-Network Architecture for Accelerating Shared-Memory Multiprocessor Collectives. 996-1009 - Zhuoran Song, Bangqi Fu, Feiyang Wu, Zhaoming Jiang, Li Jiang, Naifeng Jing, Xiaoyao Liang:
DRQ: Dynamic Region-based Quantization for Deep Neural Network Acceleration. 1010-1021 - Alexandru Dutu, Matthew D. Sinclair, Bradford M. Beckmann, David A. Wood, Marcus Chow:
Independent Forward Progress of Work-groups. 1022-1035 - Aditya K. Kamath, Alvin A. George, Arkaprava Basu:
ScoRD: A Scoped Race Detector for GPUs. 1036-1049 - Nastaran Hajinazar, Pratyush Patel, Minesh Patel, Konstantinos Kanellopoulos, Saugata Ghose, Rachata Ausavarungnirun, Geraldo F. Oliveira, Jonathan Appavoo, Vivek Seshadri, Onur Mutlu:
The Virtual Block Interface: A Flexible Alternative to the Conventional Virtual Memory Framework. 1050-1063 - Jie Zhang, Myoungsoo Jung:
ZnG: Architecting GPU Multi-Processors with New Flash for Scalable Data Analysis. 1064-1075 - Ben Feinberg, Benjamin C. Heyman, Darya Mikhailenko, Ryan Wong, An C. Ho, Engin Ipek:
Commutative Data Reordering: A New Technique to Reduce Data Movement Energy on Sparse Inference Workloads. 1076-1088 - Bojian Zheng, Nandita Vijaykumar, Gennady Pekhimenko:
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training. 1089-1102 - Gyusun Lee, Wenjing Jin, Wonsuk Song, Jeonghun Gong, Jonghyun Bae, Tae Jun Ham, Jae W. Lee, Jinkyu Jeong:
A Case for Hardware-Based Demand Paging. 1103-1116
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.