{"id":3145,"date":"2015-10-23T15:22:18","date_gmt":"2015-10-23T05:22:18","guid":{"rendered":"http:\/\/mappingonlinepublics.net\/?p=3145"},"modified":"2015-10-23T15:28:45","modified_gmt":"2015-10-23T05:28:45","slug":"anyone-for-some-quick-crowdsourced-twitter-research","status":"publish","type":"post","link":"https:\/\/mappingonlinepublics.net\/dev\/2015\/10\/23\/anyone-for-some-quick-crowdsourced-twitter-research\/","title":{"rendered":"Anyone for Some Quick Crowdsourced Twitter Research?"},"content":{"rendered":"<p>Taking a quick break from <a href=\"http:\/\/snurb.info\/taxonomy\/term\/154\">the AoIR 2015 liveblogging at <em>snurb.info<\/em><\/a>: <a href=\"http:\/\/snurb.info\/node\/2021\">today\u2019s presentation by Fabio Giglietto, Luca Rossi and Jiyoung Kim<\/a> got me thinking. They built on <a href=\"http:\/\/snurb.info\/files\/2012\/Quantitative%20Approaches%20to%20Comparing%20Communication%20Patterns%20on%20Twitter.pdf\">a paper by Stefan Stieglitz and me<\/a> which compared some basic properties of a large number of hashtag datasets (and some keyword-based datasets, too), and used these to classify different hashtag uses (mainly distinguishing between crisis events and media audiencing).<\/p>\n<p>Back then, we looked at the percentage of tweets containing URLs, and the percentage of tweets that were retweets, as well as the total number of tweets in each dataset:<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2015\/10\/image3.png\"><img decoding=\"async\" loading=\"lazy\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"image\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2015\/10\/image_thumb3.png\" alt=\"image\" width=\"519\" height=\"484\" border=\"0\" \/><\/a><br \/>\n<span style=\"font-size: x-small;\">From: Axel Bruns and Stefan Stieglitz. \u201c<a href=\"http:\/\/snurb.info\/files\/2012\/Quantitative%20Approaches%20to%20Comparing%20Communication%20Patterns%20on%20Twitter.pdf\">Quantitative Approaches to Comparing Communication Patterns on Twitter.<\/a>\u201d In Klaus Bredl, Julia H\u00fcnniger, and Jakob Linaa Jensen, eds., <a href=\"http:\/\/www.amazon.com\/gp\/product\/041581832X\/ref=as_li_ss_tl?ie=UTF8&amp;camp=1789&amp;creative=390957&amp;creativeASIN=041581832X&amp;linkCode=as2&amp;tag=snurbaxelbrun-20\"><em>Methods for Analyzing Social Media<\/em><\/a><img decoding=\"async\" loading=\"lazy\" src=\"http:\/\/ir-na.amazon-adsystem.com\/e\/ir?t=snurbaxelbrun-20&amp;l=as2&amp;o=1&amp;a=041581832X\" alt=\"\" width=\"1\" height=\"1\" border=\"0\" \/>. 20-44.<\/span><\/p>\n<p>I\u2019m keen to update that study with new data from more recent hashtags, and we\u2019ve already started to work through our own archived datasets to generate further metrics. But our datasets are limited to the research interests we\u2019ve pursued over time, and to Australian and international topics.<\/p>\n<p>So, I\u2019m wondering whether we could build this up to a much larger collection by taking a collaborative, crowdsourced approach: if anyone else out there has <em>Twitter<\/em> datasets from the past few years, could you run a handful of quick analyses over your archives and share the results? What we\u2019d need are:<\/p>\n<ul>\n<li>Hashtag(s) or keyword(s) used to capture the dataset<\/li>\n<li>Timeframe of capture (from\/to date)<\/li>\n<li>Total number of tweets<\/li>\n<li>Total number of tweets containing URLs \u2013 using the regular expression \/http\/<\/li>\n<li>Total number of tweets containing retweets \u2013 using the regular expression \/(\\&#8221;@|RT @|MT @|via @)[A-Za-z0-9_]+\/<\/li>\n<\/ul>\n<p>You could leave those details in the comments attached to this post, or email them to me at a.bruns(at)qut.edu.au.<\/p>\n<p>This is an experiment, in the spirit of AoIR collegiality. Would anyone be interested in sharing the metrics for their datasets? In return, I\u2019d be very happy to include you as a contributing author in the paper we\u2019ll eventually develop from this. Thanks in advance!<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>Taking a quick break from the AoIR 2015 liveblogging at snurb.info: today\u2019s presentation by Fabio Giglietto, Luca Rossi and Jiyoung Kim got me thinking. They built on a paper by Stefan Stieglitz and me which compared some basic properties of a large number of hashtag datasets (and some keyword-based datasets, too), and used these to &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mappingonlinepublics.net\/dev\/2015\/10\/23\/anyone-for-some-quick-crowdsourced-twitter-research\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Anyone for Some Quick Crowdsourced Twitter Research?&#8221;<\/span><\/a><\/p>\n<p><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":2,"featured_media":3143,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"nf_dc_page":"","footnotes":""},"categories":[176,8],"tags":[303,304,82,50,298],"class_list":["post-3145","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-processing","category-twitter","tag-analysis","tag-aoir-2015","tag-hashtags","tag-metrics","tag-twitter","entry"],"_links":{"self":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/3145","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/comments?post=3145"}],"version-history":[{"count":5,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/3145\/revisions"}],"predecessor-version":[{"id":3150,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/3145\/revisions\/3150"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media\/3143"}],"wp:attachment":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media?parent=3145"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/categories?post=3145"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/tags?post=3145"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}