{"621214":{"#nid":"621214","#data":{"type":"news","title":"Cleaning Up Those Messy Notebooks Just Got A Lot Easier  ","body":[{"value":"\u003Cp\u003EThis year\u0026rsquo;s\u0026nbsp;\u003Ca href=\u0022https:\/\/chi2019.acm.org\/\u0022\u003EACM CHI Conference on Human Factors in Computing Systems\u003C\/a\u003E\u0026nbsp;best paper award winner\u0026nbsp;has developed a set of tools to help programmers and data scientists clean up their computational notebooks so they can program more effectively and efficiently.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026ldquo;Programming in computational notebooks is helpful for seeing intermediate pieces of code and results interlaced together, but often these notebooks become very long and messy. This likely resonates with many students, but also data science and industry professionals, since it is a widely used technology,\u0026rdquo; said\u0026nbsp;\u003Ca href=\u0022https:\/\/www.cse.gatech.edu\/\u0022\u003ESchool of Computational Science and Engineering\u003C\/a\u003E\u0026nbsp;(CSE) Ph.D. student and co-investigator of the paper,\u0026nbsp;\u003Ca href=\u0022https:\/\/fredhohman.com\/\u0022\u003E\u003Cstrong\u003EFred Hohman\u003C\/strong\u003E\u003C\/a\u003E.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThe set of tools, called code gathering tools, allow the user to go to any part of a long notebook, such as a certain variable or equation hidden in messy code, and pull out the relevant information.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026ldquo;What we did is create a means to pull out, or gather, a desired item out of a large notebook and show all its changes from previous versions. This will show you what minimal set of code you need to get a certain result,\u0026rdquo; said Hohman.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EIn conjunction with efficiency, this tool also helps with reproducibility, sharing code, and communication by helping analysts find, clean, recover, and compare versions of code in cluttered, inconsistent notebooks.\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003EAccording to the paper,\u0026nbsp;\u003Ca href=\u0022http:\/\/andrewhead.info\/assets\/pdf\/notebook-gathering.pdf\u0022\u003E\u003Cem\u003EManaging Messes in Computational Notebooks\u003C\/em\u003E\u003C\/a\u003E, the tools also archive all versions of code outputs, allowing analysts to review these versions and recover the subsets of code that produced them. These subsets can serve as succinct summaries of analysis activity or starting points for new analyses.\u0026nbsp;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003E[Related Links: \u003Ca href=\u0022https:\/\/gvu.gatech.edu\/chi-2019\u0022\u003EGeorgia Tech Research Integrates Human Capabilities with Machine Advances for Positive Impact in Society\u003C\/a\u003E]\u0026nbsp;\u003C\/strong\u003E\u003C\/p\u003E\r\n","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"CSE Ph.D. student Fred Hohman co-authors CHI 2019 Best Paper Award Winner on cleaning up computational notebooks."}],"uid":"34540","created_gmt":"2019-05-02 13:58:18","changed_gmt":"2019-05-03 13:31:55","author":"Kristen Perez","boilerplate_text":"","field_publication":"","field_article_url":"","dateline":{"date":"2019-05-02T00:00:00-04:00","iso_date":"2019-05-02T00:00:00-04:00","tz":"America\/New_York"},"extras":[],"hg_media":{"621213":{"id":"621213","type":"image","title":"Managing Messy Notebooks","body":null,"created":"1556805301","gmt_created":"2019-05-02 13:55:01","changed":"1556805308","gmt_changed":"2019-05-02 13:55:08","alt":"A diagram showing a \u0027messy notebook\u0027 that leads to an \u0027execution log\u0027 with revisions of the messy notebook and the outcome of slicing the log to an \u0027ordered, minimal, complete slices\u0027 noebook.","file":{"fid":"236629","name":"managingmessynotebooks.png","image_path":"\/sites\/default\/files\/images\/managingmessynotebooks.png","image_full_path":"http:\/\/hg.gatech.edu\/\/sites\/default\/files\/images\/managingmessynotebooks.png","mime":"image\/png","size":99307,"path_740":"http:\/\/hg.gatech.edu\/sites\/default\/files\/styles\/740xx_scale\/public\/images\/managingmessynotebooks.png?itok=NrLzc6uj"}}},"media_ids":["621213"],"groups":[{"id":"47223","name":"College of Computing"},{"id":"431631","name":"OMS"},{"id":"50877","name":"School of Computational Science and Engineering"}],"categories":[{"id":"8862","name":"Student Research"}],"keywords":[{"id":"181220","name":"cse-ml"},{"id":"181216","name":"cc-research"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[],"invited_audience":[],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cp\u003EKristen Perez\u003C\/p\u003E\r\n\r\n\u003Cp\u003ECommunications Officer\u003C\/p\u003E\r\n","format":"limited_html"}],"email":["kristen.perez@cc.gatech.edu"],"slides":[],"orientation":[],"userdata":""}}}