{"636776":{"#nid":"636776","#data":{"type":"event","title":"PhD Defense by Hyoukjun Kwon","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003ETitle:\u003C\/strong\u003E\u0026nbsp;Data- and Communication-centric Approaches to Model and Design Flexible Deep Neural Network Accelerators\u003C\/p\u003E\r\n\r\n\u003Cp\u003EHyoukjun Kwon\u003Cbr \/\u003E\r\nPhD Candidate\u0026nbsp;\u003Cbr \/\u003E\r\nSchool of Computer Science\u003Cbr \/\u003E\r\nGeorgia Institute of Technology\u0026nbsp;\u003Cbr \/\u003E\r\n\u003Ca href=\u0022http:\/\/hyoukjunkwon.com\/\u0022\u003Ehttp:\/\/hyoukjunkwon.com\/\u003C\/a\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003EDate:\u003C\/strong\u003E\u0026nbsp;Thursday, July 16th, 2020\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003ETime:\u003C\/strong\u003E\u0026nbsp;3 -5 pm\u003Cbr \/\u003E\r\n\u003Cstrong\u003ELocation:\u0026nbsp;\u003C\/strong\u003E\u0026nbsp;\u003Ca href=\u0022https:\/\/bluejeans.com\/219451978\u0022\u003Ehttps:\/\/bluejeans.com\/219451978\u003C\/a\u003E\u0026nbsp;(remote)\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003ECommittee:\u003C\/strong\u003E\u003Cbr \/\u003E\r\nDr. Tushar Krishna (advisor), School of Electrical and Computer Engineering, Georgia Institute of Technology\u003Cbr \/\u003E\r\nDr. Vivek Sarkar, School of Computer Science, Georgia Institute of Technology\u003Cbr \/\u003E\r\nDr. Hyesoon Kim, School of Computer Science, Georgia Institute of Technology\u003Cbr \/\u003E\r\nDr. Alexey Tumanov, School of Computer Science, Georgia Institute of Technology\u003Cbr \/\u003E\r\nDr. Micahel Pellauer, Architecture Research Group, NVIDIA\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003EAbstract:\u003C\/strong\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003EDeep neural network (DNN) acceleration has emerged as an enabler of many applications such as image classification, face recognition, natural language processing, that was challenging to achieve high operational performance (i.e., accuracy or quality of outputs). Since recent DNNs involve billions of multiply-and-accumulate (MAC) operations with millions of parameters, DNN accelerators, specialized hardware for DNN computation, have emerged. However, designing dedicated hardware for each DNN model requires high development costs while DNN models and algorithms rapidly evolve. In addition, specializing a DNN accelerator for one DNN model with limited support for compiler mappings often leads to inefficiency for other DNN models. Therefore, this thesis explores flexible DNN accelerator designs that support diverse compiler mappings (i.e., dataflow + tile sizes for each data dimension) to adapt to new DNN models without re-designing hardware.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThis thesis first focuses on the modeling costs and benefits of mapping choices to quantify the potential costs and benefits of mapping choices considering underlying hardware. We codify the cost model and implement \u003Cem\u003EMAESTRO\u003C\/em\u003E, and perform case studies that show no single mapping is ideal for all the layers. For the flexible DNN accelerator designs, this thesis addresses the challenge from two perspectives: reconfigurability and heterogeneity.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EFor the reconfigurability approach, this thesis focuses on the data movement since the cost of data movement dominates in DNN accelerators, and the rearranging the data movement is effectively equivalent to programming a DNN accelerator considering the nature of predefined target application. We propose a light-weight network-on-chip (NoC) architecture, \u003Cem\u003EMicroswitch NoC\u003C\/em\u003E, specialized for DNN accelerator traffic while providing sufficient flexibility for any dataflow. We also present a reconfigurable DNN accelerator design, \u003Cem\u003EMAERI\u003C\/em\u003E, that employs reconfigurable data distribution and reduction NoCs that support all the communication patterns in DNN accelerators and perform reduction inside NoC switches (i.e., in-network-processing style). MAERI enables to map computations on compute units without underutilizing PEs for any irregular DNN computations resulting from diverse layers and various optimizations (e.g., cross layer mapping, sparsity, etc.).\u003C\/p\u003E\r\n\r\n\u003Cp\u003EFor the heterogeneity approach, this thesis explores heterogeneous DNN accelerators (HDAs), which contains multiple sub-accelerators that contain different amount of hardware resources and run different dataflows. For the HDA-based approach, this thesis proposes a comprehensive HDA optimization framework, \u003Cem\u003EHerald\u003C\/em\u003E, that automatically explore optimization opportunities of mapping DNN layers to a sub-accelerator with the lowest EDP at run time and proper hardware resource partitioning at design time. Finally, we formally define the mapping flexibility so that we can quantify the degree of flexibility of flexible accelerators, which enables comprehensive comparison across flexible DNN accelerators.\u003C\/p\u003E\r\n","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"Data- and Communication-centric Approaches to Model and Design Flexible Deep Neural Network Accelerators"}],"uid":"27707","created_gmt":"2020-07-07 15:35:34","changed_gmt":"2020-07-07 15:35:34","author":"Tatianna Richardson","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2020-07-16T16:00:00-04:00","event_time_end":"2020-07-16T18:00:00-04:00","event_time_end_last":"2020-07-16T18:00:00-04:00","gmt_time_start":"2020-07-16 20:00:00","gmt_time_end":"2020-07-16 22:00:00","gmt_time_end_last":"2020-07-16 22:00:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"groups":[{"id":"221981","name":"Graduate Studies"}],"categories":[],"keywords":[{"id":"100811","name":"Phd Defense"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[{"id":"1788","name":"Other\/Miscellaneous"}],"invited_audience":[{"id":"78771","name":"Public"},{"id":"174045","name":"Graduate students"},{"id":"78751","name":"Undergraduate students"}],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":[],"slides":[],"orientation":[],"userdata":""}}}