{"604576":{"#nid":"604576","#data":{"type":"news","title":"HQ Insiders Find More than Game Analysis in HQ Trivia Dataset","body":[{"value":"\u003Ch2\u003E\u003Cem\u003E\u003Cstrong\u003EGeorgia Tech computing students create an investigative analysis with instantaneous question-response dataset\u003C\/strong\u003E\u003C\/em\u003E\u003C\/h2\u003E\r\n\r\n\u003Cp\u003EHQ Trivia is a gaming app that has become a daily tradition for a growing league of devotees across the United States. The accessibility of the app places the chance of winning a large monetary prize into the hands of millions of players, twice a day.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EA recent analysis conducted by Georgia Tech computing students, that began as a project aiming to breakdown the game\u0026rsquo;s questions according to difficulty, has progressed to something much larger.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026ldquo;I think we have the first instantaneous question-response database that maintains millions of responses,\u0026rdquo; said \u003Cstrong\u003EJustin Melnick\u003C\/strong\u003E, online master of science in analytics\u0026nbsp;student from Georgia Tech, about the analysis that he and partner \u003Cstrong\u003EDavid Milmont\u003C\/strong\u003E, a data scientist at a fintech company and part of Georgia Tech\u0026rsquo;s Analytic MicroMasters program, have created.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EMelnick and Milmont are the two minds driving the HQ Trivia research group, \u003Ca href=\u0022http:\/\/hqinsiders.com\/\u0022\u003EHQ Insiders\u003C\/a\u003E.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003EBecoming the HQ Insiders\u003C\/strong\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThe duo first began breaking down data on HQ questions\u0026nbsp;that was publicly available and\u0026nbsp;manually inputting entries for analysis. They then discussed their findings via a \u003Ca href=\u0022https:\/\/www.reddit.com\/r\/hqtrivia\/\u0022\u003Esubreddit\u003C\/a\u003E in which they used the handle, HQ Insiders.\u003C\/p\u003E\r\n\r\n\u003Cp\u003ENot long after the team disclosed their \u003Ca href=\u0022http:\/\/hqinsiders.com\/savagery-board\/\u0022\u003EHQ question analysis\u003C\/a\u003E, they were contacted by the \u003Cem\u003EWashington Post\u003C\/em\u003E for an \u003Ca href=\u0022https:\/\/www.washingtonpost.com\/graphics\/2018\/business\/hq-trivia\/?utm_term=.f563f83f0983\u0022\u003EHQ feature\u003C\/a\u003E\u0026nbsp;showcasing their findings. However, the \u003Cem\u003EWashington Post\u003C\/em\u003E encouraged the team to gather more data before moving forward - something that would require the team to seek an alternative to manual data entry.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026ldquo;We knew through YouTube that a lot of fans had been screen recording the game and we could go back and archive the games available,\u0026rdquo; explained Melnick. \u0026ldquo;We used Amazon Mechanical Turk to create a HTML for transcribing videos. Then, using heuristics and other data cleaning tools we ensured that the information logged was accurate.\u0026rdquo;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003ELetting the Data Speak for Itself\u003C\/strong\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThus far, the team has been able to gather data from 630,872,741 player responses, with 2,486 questions over 205 games.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThe likes of a data set of this size, language variance, and instantaneous response has never been collected before and can reveal much more than what the HQ Insiders set out to find. And, with an incoming of requests from several outlets to discuss the data collected from different angles, the team now has to just decide what direction they wish to pursue first.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EThe CEO and founder of HQ, \u003Cstrong\u003ERus Yusupov\u003C\/strong\u003E, was even impressed by the team\u0026rsquo;s analysis and messaged them with words of encouragement, while even going as far as \u003Ca href=\u0022https:\/\/twitter.com\/rus\/status\/970719631542546432\u0022\u003Ereposting the story on his social media accounts\u003C\/a\u003E.\u003C\/p\u003E\r\n\r\n\u003Cp\u003EOne thing that is very unique compared to other analysis is the way in which the HQ Insiders are scraping the data. Milmont said, \u0026ldquo;We\u0026rsquo;re able to log quickly because of the automation we have developed using Python and SQL around data collection and cleaning.\u0026rdquo;\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u003Cstrong\u003EBeing Prepared to Answer\u003C\/strong\u003E\u003C\/p\u003E\r\n\r\n\u003Cp\u003EMelnick is a student of CSE Associate Professor \u003Ca href=\u0022https:\/\/www.cc.gatech.edu\/~dchau\/\u0022\u003E\u003Cstrong\u003EPolo Chau\u0026rsquo;s\u003C\/strong\u003E\u003C\/a\u003E CSE-6242-OAN course, Data and Visual Analytics, which introduces students to techniques and tools for analyzing and visualizing data at a scale.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026ldquo;Without that class we would not have had a clear direction on how to support the Washington Post with our data, our ideas going forward to assist other media outlets, and having a presence on the internet,\u0026quot; said Melnick.\u003C\/p\u003E\r\n\r\n\u003Cp\u003E\u0026quot;It is very empowering, and we feel like what we know isn\u0026rsquo;t in a textbook, but knowledge through real-world trial and error. It is a very tough class, but when the opportunity knocks from the real world, we are better prepared to answer.\u003C\/p\u003E\r\n","summary":null,"format":"limited_html"}],"field_subtitle":"","field_summary":"","field_summary_sentence":[{"value":"Georgia Tech computing students create an investigative analysis with instantaneous question-response dataset."}],"uid":"34540","created_gmt":"2018-04-02 15:40:50","changed_gmt":"2018-04-09 02:53:52","author":"Kristen Perez","boilerplate_text":"","field_publication":"","field_article_url":"","dateline":{"date":"2018-04-02T00:00:00-04:00","iso_date":"2018-04-02T00:00:00-04:00","tz":"America\/New_York"},"extras":[],"hg_media":{"604578":{"id":"604578","type":"image","title":"HQ Insiders- Graph 2","body":null,"created":"1522685409","gmt_created":"2018-04-02 16:10:09","changed":"1522685409","gmt_changed":"2018-04-02 16:10:09","alt":"","file":{"fid":"230483","name":"HQInsidersgraph2.jpg","image_path":"\/sites\/default\/files\/images\/HQInsidersgraph2.jpg","image_full_path":"http:\/\/hg.gatech.edu\/\/sites\/default\/files\/images\/HQInsidersgraph2.jpg","mime":"image\/jpeg","size":1191732,"path_740":"http:\/\/hg.gatech.edu\/sites\/default\/files\/styles\/740xx_scale\/public\/images\/HQInsidersgraph2.jpg?itok=n6zU7jb-"}}},"media_ids":["604578"],"groups":[{"id":"47223","name":"College of Computing"},{"id":"50877","name":"School of Computational Science and Engineering"}],"categories":[],"keywords":[{"id":"177609","name":"HQ"},{"id":"177610","name":"HQ Insiders"},{"id":"177611","name":"OMSA"},{"id":"83261","name":"Polo Chau"},{"id":"76791","name":"GTPE"},{"id":"426","name":"isye"},{"id":"172588","name":"CoB"},{"id":"11559","name":"CSE computational science engineering"}],"core_research_areas":[{"id":"39431","name":"Data Engineering and Science"}],"news_room_topics":[],"event_categories":[],"invited_audience":[],"affiliations":[],"classification":[],"areas_of_expertise":[],"news_and_recent_appearances":[],"phone":[],"contact":[],"email":["kristen.perez@cc.gatech.edu"],"slides":[],"orientation":[],"userdata":""}}}