{"71094":{"#nid":"71094","#data":{"type":"news","title":"Georgia Tech to Analyze Massive Data Sets Using Visual Analytics","body":[{"value":"\u003Cp\u003EEnormous amounts of data are being generated in health care, computational biology, homeland security and other areas, but analyzing these massive and unstructured data sets has proven cumbersome and difficult. An emerging research field known as data and visual analytics is helping sift through such mountains of information to find and put together individual pieces of a picture.\u003C\/p\u003E\n\u003Cp\u003EThe Georgia Institute of Technology has received a five-year grant to lead and coordinate a new initiative that will develop foundational research in massive data analysis and visual analytics. A research team headed by Haesun Park, a professor and associate chair in the Computational Science and Engineering Division of the Georgia Tech College of Computing, will investigate ways to improve the visual analytics of massive data sets through machine learning, numerical algorithms and optimization, computational statistics, and information visualization. \n\u003C\/p\u003E\n\u003Cp\u003E\u0022Developing new and improved mathematical and computational methodologies will further enable systems developers, intelligence analysts, biologists and health care workers to implement new methods to \u0027detect the expected and discover the unexpected\u0027 among massive data sets,\u0022 Park explained.\n\u003C\/p\u003E\n\u003Cp\u003EThe $3 million joint National Science Foundation and Department of Homeland Security grant establishes Georgia Tech as the lead academic research institution for all national Foundations of Data and Visual Analytics (FODAVA) research efforts. Seven other FODAVA Partnership Awards will be announced later this year, all working in conjunction with eleven Georgia Tech investigators to advance the field. \n\u003C\/p\u003E\n\u003Cp\u003EOver the next five years, the Georgia Tech-led research team will work to establish FODAVA as a distinct research field and build a community of top-quality researchers that will collaborate on research workshops and conferences, industry engagement and technology transfer. \n\u003C\/p\u003E\n\u003Cp\u003E\u0022FODAVA seeks to put an improved science base under one portion of the problem - how can we transform large, complex data sets into reduced computational models or mathematical formalisms that retain the information content while better supporting the human in extracting critical information from the data,\u0022 said Lawrence Rosenblum, program director for graphics and visualization at the National Science Foundation. \u0022Scientific advances here are critical to future advances in the science of data and visual analytics that will keep us safe and provide technological and commercial advances that benefit mankind.\u0022\n\u003C\/p\u003E\n\u003Cp\u003EGeorgia Tech\u0027s expertise in advanced computer-based analysis, probability and statistics, numerical algorithms and optimization, machine learning, and human-computer interaction techniques provides a strong foundation to lead this new initiative. \n\u003C\/p\u003E\n\u003Cp\u003EPark specializes in using numerical linear algebra and optimization techniques to develop computer-based algorithms that dramatically reduce the dimension and number of data points in massive data sets. Dimension reduction is essential for efficient processing of high-dimension data sets while removing the noise in the data. \n\u003C\/p\u003E\n\u003Cp\u003EPark is especially interested in developing methods for dimension reduction that exploit prior knowledge in the data sets - such as clustered structures and non-negativity. This process is important because it leads to more accurate classification and prediction results. \n\u003C\/p\u003E\n\u003Cp\u003EAlexander Gray, an assistant professor in the Computational Science and Engineering Division of the College of Computing, has experience developing efficient algorithms that allow statistical and machine learning methods to be applied to massive datasets. He employs ideas from computational geometry and computational physics to statistical computations.\n\u003C\/p\u003E\n\u003Cp\u003E\u0022Reducing the computation time for an analysis from hours to seconds makes all the difference, since data analysis is inherently an iterative and interactive process,\u0022 explained Gray, also a principal investigator on the project.\n\u003C\/p\u003E\n\u003Cp\u003ELarge data sets may also include multiple objects of high dimensionality, such as images, that must be analyzed based on a relatively small number of samples. The mathematical analysis of problems like these requires expertise in statistics and probability methods, which Georgia Tech School of Mathematics professor and principal investigator Vladimir Koltchinskii will contribute to the new initiative. \n\u003C\/p\u003E\n\u003Cp\u003EOnce massive amounts of data are collected and processed, relevant information must be pulled from it and presented using visual and interactive means. John Stasko, a principal investigator on this project and professor in the School of Interactive Computing, conducts research in the field of visual analytics. \n\u003C\/p\u003E\n\u003Cp\u003EHe heads a team that developed Jigsaw, a visual analytics system that helps analysts better assess, analyze and make sense of large document collections. The system provides multiple coordinated views to show connections between entities extracted from a document collection.\n\u003C\/p\u003E\n\u003Cp\u003E\u0022Jigsaw essentially acts as a visual index of the document collection - helping analysts identify particular documents to read and examine next,\u0022 explained Stasko, whose team won the university division of the 2007 Visual Analytics Science and Technology contest using Jigsaw.\n\u003C\/p\u003E\n\u003Cp\u003EStasko also serves as Georgia Tech\u0027s director in the Department of Homeland Security-sponsored SouthEast Regional Visualization and Analytics Center (SRVAC), a regional center created in 2006 to perform research in visual analytics. SRVAC is a partnership between the Georgia Tech and the University of North Carolina Charlotte, and is one of five national university centers connected to the National Visualization and Analytics Center located at Pacific Northwest National Laboratory.\n\u003C\/p\u003E\n\u003Cp\u003EAll of the steps involved in massive data analysis and visual analytics - data collection, processing, analysis and visualization - require optimization. Renato Monteiro, a professor in the H. Milton Stewart School of Industrial and Systems Engineering and principal investigator, specializes in this research field. \n\u003C\/p\u003E\n\u003Cp\u003E\u0022This new center provides me the opportunity to apply optimization techniques to new and unique problems and applications that I haven\u0027t studied in the past,\u0022 said Monteiro.\n\u003C\/p\u003E\n\u003Cp\u003EFrom law enforcement and intelligence gathering to electronic heath records and computational biology, the accurate and timely analysis of massive amounts of information is critical to deeper understanding and effective decision making. \n\u003C\/p\u003E\n\u003Cp\u003E\u0022Collaborations across Georgia Tech\u0027s computing, engineering and mathematics disciplines aim to develop better scientific and foundational methods to help practitioners in many different lines of work analyze and interactively explore large data sets more efficiently and effectively,\u0022 Park added. \n\u003C\/p\u003E\n\u003Cp\u003E\u003Cstrong\u003EResearch News \u0026amp; Publications Office\u003Cbr \/\u003E\nGeorgia Institute of Technology\u003Cbr \/\u003E\n75 Fifth Street, N.W., Suite 100\u003Cbr \/\u003E\nAtlanta, Georgia  30308  USA\n\u003C\/strong\u003E\u003C\/p\u003E\n\u003Cp\u003EMedia Relations Contacts: Abby Vogel (404-385-3364); E-mail: (\u003Ca href=\u0022mailto:avogel@gatech.edu\u0022\u003Eavogel@gatech.edu\u003C\/a\u003E) or John Toon (404-894-6986); E-mail: (\u003Ca href=\u0022mailto:jtoon@gatech.edu\u0022\u003Ejtoon@gatech.edu\u003C\/a\u003E).\n\u003C\/p\u003E\n\u003Cp\u003E\u003Cstrong\u003ETechnical Contact:\u003C\/strong\u003E Haesun Park (404-385-2170); E-mail: (\u003Ca href=\u0022mailto:hpark@cc.gatech.edu\u0022\u003Ehpark@cc.gatech.edu\u003C\/a\u003E)\n\u003C\/p\u003E\n\u003Cp\u003E\u003Cstrong\u003EWriter:\u003C\/strong\u003E Abby Vogel\n\u003C\/p\u003E","summary":null,"format":"limited_html"}],"field_subtitle":[{"value":"$3 million award will build a foundation for emerging research field"}],"field_summary":[{"value":"The Georgia Institute of Technology has received a five-year, $3 million grant from the National Science Foundation and the Department of Homeland Security to lead and coordinate a new initiative that will develop foundational research in massive data analysis and visual analytics.","format":"limited_html"}],"field_summary_sentence":[{"value":"$3M awarded for data analysis and visual analytics initiative"}],"uid":"27206","created_gmt":"2008-08-04 00:00:00","changed_gmt":"2016-10-08 03:03:19","author":"Abby Vogel Robinson","boilerplate_text":"","field_publication":"","field_article_url":"","dateline":{"date":"2008-08-06T00:00:00-04:00","iso_date":"2008-08-06T00:00:00-04:00","tz":"America\/New_York"},"extras":[],"hg_media":{"71095":{"id":"71095","type":"image","title":"Jigsaw","body":null,"created":"1449177348","gmt_created":"2015-12-03 21:15:48","changed":"1475894628","gmt_changed":"2016-10-08 02:43:48"}},"media_ids":["71095"],"related_links":[{"url":"http:\/\/www.cc.gatech.edu\/directory\/faculty\/faculty\/directory\/john-stasko","title":"John Stasko"},{"url":"http:\/\/www.isye.gatech.edu\/faculty-staff\/profile.php?entry=rm88","title":"Renato Monteiro"},{"url":"http:\/\/www.math.gatech.edu\/people\/faculty\/vlad.html","title":"Vladimir Koltchinskii"},{"url":"http:\/\/www.cc.gatech.edu\/directory\/faculty\/faculty\/directory\/alexander-gray","title":"Alexander Gray"},{"url":"http:\/\/www.cc.gatech.edu\/directory\/faculty\/faculty\/directory\/haesun-park","title":"Haesun Park"}],"groups":[{"id":"1188","name":"Research Horizons"}],"categories":[{"id":"153","name":"Computer Science\/Information Technology and Security"},{"id":"145","name":"Engineering"},{"id":"146","name":"Life Sciences and Biology"},{"id":"147","name":"Military Technology"},{"id":"135","name":"Research"}],"keywords":[{"id":"7258","name":"algebra"},{"id":"5660","name":"algorithms"},{"id":"3929","name":"analysis"},{"id":"7251","name":"analytics"},{"id":"277","name":"Biology"},{"id":"5637","name":"Computational"},{"id":"438","name":"data"},{"id":"5270","name":"FODAVA"},{"id":"398","name":"health"},{"id":"7259","name":"high-dimension"},{"id":"3928","name":"homeland"},{"id":"1620","name":"Information"},{"id":"3823","name":"learning"},{"id":"5424","name":"Linear"},{"id":"7254","name":"machine"},{"id":"7255","name":"numerical"},{"id":"7261","name":"NVAC"},{"id":"1377","name":"optimization"},{"id":"7256","name":"probability"},{"id":"7260","name":"reduction"},{"id":"167055","name":"security"},{"id":"170864","name":"set"},{"id":"170865","name":"SRVAC"},{"id":"167169","name":"statistics"},{"id":"7252","name":"visual"},{"id":"7257","name":"visualization"}],"core_research_areas":[],"news_room_topics":[],"event_categories":[],"invited_audience":[],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[{"value":"\u003Cstrong\u003EAbby Robinson\u003C\/strong\u003E\u003Cbr \/\u003EResearch News and Publications\u003Cbr \/\u003E\u003Ca href=\u0022http:\/\/www.gatech.edu\/contact\/index.html?id=avogel6\u0022\u003EContact Abby Robinson\u003C\/a\u003E\u003Cbr \/\u003E\u003Cstrong\u003E404-385-3364\u003C\/strong\u003E","format":"limited_html"}],"email":["abby@innovate.gatech.edu"],"slides":[],"orientation":[],"userdata":""}}}