{"49646":{"#nid":"49646","#data":{"type":"event","title":"CSE Seminar: Tiankai Tu","body":[{"value":"\u003Cp\u003E\u003Cstrong\u003ETiankai Tu\u003C\/strong\u003E\u003Cbr \/\u003EComputer Scientist\u003Cbr \/\u003ED. E. Shaw Research\u003C\/p\u003E\u003Cp\u003EFor more information please contact Dr. George Biros at \u003Ca href=\u0022mailto:gbrios@cc.gatech.edu\u0022\u003Egbrios@cc.gatech.edu\u003C\/a\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003E\u0022Accelerating Parallel Analysis of Scientific Simulation Data via Zazen\u0022\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EAbstract:\u0026nbsp; \u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003EAs a new generation of parallel supercomputers enables researchers to conduct scientific simulations of unprecedented scale and resolution, terabyte-scale simulation output has become increasingly commonplace. Analysis of such massive data sets is typically I\/O-bound: many parallel analysis programs spend most of their execution time reading data from disk rather than performing useful computation. To overcome this I\/O bottleneck, we have developed a new data access method. Our main idea is to cache a copy of simulation output files on the local disks of an analysis cluster\u2019s compute nodes, and to use a novel task-assignment protocol to co-locate data access with computation. We have implemented our methodology in a parallel disk cache system called Zazen. By avoiding the overhead associated with querying metadata servers and by reading data in parallel from local disks, Zazen is able to deliver a sustained read bandwidth of over 20 gigabytes per second on a commodity Linux cluster with 100 nodes, approaching the optimal aggregated I\/O bandwidth attainable on these nodes. Compared with conventional NFS, PVFS2, and Hadoop\/HDFS, respectively, Zazen is 75, 18, and 6 times faster for accessing large (1-GB) files, and 25, 13, and 85 times faster for accessing small (2-MB) files. We have deployed Zazen in conjunction with Anton\u2014a special-purpose supercomputer that dramatically accelerates molecular dynamics (MD) simulations\u2014and have been able to accelerate the parallel analysis of terabyte-scale MD trajectories by about an order of magnitude.\u003C\/p\u003E\u003Cp\u003E\u003Cstrong\u003EBio:\u003C\/strong\u003E\u003C\/p\u003E\u003Cp\u003ETiankai Tu is a computer scientist at D. E. Shaw Research, where he architects and implements parallel software systems for analyzing very long molecular dynamics trajectories.\u0026nbsp; He is also a visiting scientist at the University of Texas at Austin, developing parallel adaptive mesh refinement (AMR) algorithms for simulating global mantle convection on petascale computers.\u0026nbsp; Tiankai earned a Ph.D. in Computer Science from Carnegie Mellon University, where he developed computational database systems and parallel algorithms for simulating earthquake ground motion on terascale systems. \u003C\/p\u003E\u003Cp\u003EHe received the Gordon Bell Award for Special Achievement in 2003, the SC06 HPC Analytics Challenge Award in 2006, the TeraGrid Capability Computing Challenge Award in 2008, and the SC09 Best Poster Award in 2009.\u0026nbsp; He was also a finalist for the SC06 Best Student Paper Award, the SC08 Best Technical Paper Award, and the 2008 Gordon Bell Award for Special Achievement.\u003C\/p\u003E\u003Cp\u003E~~~~~~~~~~~~~~~~\u003C\/p\u003E\u003Cp\u003EYou are cordially invited to attend a reception in the lounge next to Klaus 1324 before the seminar to chat informally with faculty and students. Refreshments will be provided.\u003C\/p\u003E\u003Cp\u003ETo receive future announcements, please sign up to the cse-seminar email list:\u003C\/p\u003E\u003Cp\u003E\u003Ca href=\u0022https:\/\/mailman.cc.gatech.edu\/mailman\/listinfo\/cse-seminar\u0022 target=\u0022_blank\u0022\u003Ehttps:\/\/mailman.cc.gatech.edu\/mailman\/listinfo\/cse-seminar\u003C\/a\u003E\u003C\/p\u003E","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"Accelerating Parallel Analysis of Scientific Simulation Data via Zazen"}],"uid":"27154","created_gmt":"2010-01-22 11:34:50","changed_gmt":"2016-10-08 01:49:32","author":"Louise Russo","boilerplate_text":"","field_publication":"","field_article_url":"","field_event_time":{"event_time_start":"2010-01-22T13:00:00-05:00","event_time_end":"2010-01-22T14:00:00-05:00","event_time_end_last":"2010-01-22T14:00:00-05:00","gmt_time_start":"2010-01-22 18:00:00","gmt_time_end":"2010-01-22 19:00:00","gmt_time_end_last":"2010-01-22 19:00:00","rrule":null,"timezone":"America\/New_York"},"extras":[],"groups":[{"id":"37041","name":"Computational Science and Engineering"}],"categories":[],"keywords":[{"id":"3498","name":"cse graduate programs"},{"id":"3497","name":"cse seminar"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[],"invited_audience":[],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cp\u003ELometa Mitchell\u003C\/p\u003E\u003Cp\u003EPhone: 404-385-4785\u003C\/p\u003E\u003Cp\u003EEmail: lometa@cc.gatech.edu\u003C\/p\u003E","format":"limited_html"}],"email":[],"slides":[],"orientation":[],"userdata":""}}}