{"id":1801,"date":"2012-09-29T10:52:38","date_gmt":"2012-09-29T00:52:38","guid":{"rendered":"http:\/\/mappingonlinepublics.net\/?p=1801"},"modified":"2012-09-29T10:59:02","modified_gmt":"2012-09-29T00:59:02","slug":"a-quick-recap-of-twitter-research-approaches","status":"publish","type":"post","link":"https:\/\/mappingonlinepublics.net\/dev\/2012\/09\/29\/a-quick-recap-of-twitter-research-approaches\/","title":{"rendered":"A Quick Recap of Twitter Research Approaches"},"content":{"rendered":"<p>Any self-respecting Internet researcher will already be aware that the <a href=\"http:\/\/aoir.org\/\">Association of Internet Researchers<\/a> (AoIR) is <em>the<\/em> place to keep track of what\u2019s cutting edge in the field. On the <a href=\"http:\/\/aoir.org\/email-list\/\">AoIR mailing-list<\/a>, there\u2019s been an interesting discussion over the last few days about the <a href=\"http:\/\/listserv.aoir.org\/pipermail\/air-l-aoir.org\/2012-September\/026449.html\">available tools for tracking, capturing, and analysing <em>Twitter<\/em> data<\/a> \u2013 and since nobody had yet mentioned <em>yourTwapperkeeper<\/em> as a useful solution for many standard tasks, I\u2019ve just posted an overview of how we\u2019ve been approach such research.<\/p>\n<p>As I haven\u2019t had much time recently to post many updates on this Website about our methodological work, I thought I\u2019d cross-post my message to the mailing-list here. For those who\u2019ve come to <em>Mapping Online Publics<\/em> only recently, perhaps it also provides a useful summary of where we\u2019re at so far.<\/p>\n<p>&nbsp;<\/p>\n<p>Erica Ciszek asked:<\/p>\n<blockquote><p>&gt; I was wondering if anyone can suggest particular tools for aggregating and<br \/>\n&gt; analyzing Twitter content.<\/p><\/blockquote>\n<p>Maybe I\u2019m old-school on this, but I&#8217;m surprised no-one&#8217;s mentioned <em>yourTwapperkeeper<\/em> yet \u2013 in my experience, very straightforward to set up (all you need is a standard LAMP server setup to run it on), and fine for most standard <em>Twitter<\/em> capture tasks (e.g. tracking hashtags, keywords, specific users, etc.). It\u2019s open source and available here:<\/p>\n<blockquote><p><a href=\"https:\/\/github.com\/jobrieniii\/yourTwapperKeeper\">https:\/\/github.com\/jobrieniii\/yourTwapperKeeper<\/a><\/p><\/blockquote>\n<p>We\u2019ve made some modifications to more easily export datasets in CSV\/TSV-format datasets \u2013 see details here:<\/p>\n<blockquote><p><a href=\"http:\/\/mappingonlinepublics.net\/2012\/01\/09\/twapperkeeper-and-beyond-a-reminder\/\">http:\/\/mappingonlinepublics.net\/2012\/01\/09\/twapperkeeper-and-beyond-a-reminder\/<\/a><\/p><\/blockquote>\n<p>Personally, I don\u2019t trust most out-of-the-box <em>Twitter<\/em> analytics tools, and prefer to roll my own \u2013 for processing CSV\/TSV datasets containing <em>Twapperkeeper<\/em>-format data, I\u2019ve been using the scriptable command-line tool <em>Gawk<\/em> with great success. A collection of <em>Gawk<\/em> scripts for standard <em>Twapperkeeper<\/em> data processing tasks is available under a Creative Commons licence here:<\/p>\n<blockquote><p><a href=\"http:\/\/mappingonlinepublics.net\/2011\/06\/22\/gawk-scripts-for-processing-twitter-data-vol-1\/\">http:\/\/mappingonlinepublics.net\/2011\/06\/22\/gawk-scripts-for-processing-twitter-data-vol-1\/<\/a><\/p><\/blockquote>\n<p>Additionally, my \u2018Swiss army knife\u2019 <em>Gawk<\/em> script for extracting activity metrics from a <em>Twapperkeeper<\/em> dataset is here:<\/p>\n<blockquote><p><a href=\"http:\/\/mappingonlinepublics.net\/2012\/01\/31\/more-twitter-metrics-metrify-revisited\/\">http:\/\/mappingonlinepublics.net\/2012\/01\/31\/more-twitter-metrics-metrify-revisited\/<\/a><\/p><\/blockquote>\n<p>The question of developing standard, case-independent metrics for the description of <em>Twitter<\/em> activity patterns is something Stefan Stieglitz and I are taking up in two forthcoming papers (happy to share drafts \u2013 email me off-list). The keynote which Jean Burgess and I presented at the recent Conference on Science and the Internet foreshadows some of this discussion, though:<\/p>\n<blockquote><p>Axel Bruns and Jean Burgess. \u201c<a href=\"http:\/\/snurb.info\/files\/2012\/Notes%20towards%20the%20Scientific%20Study%20of%20Public%20Communication%20on%20Twitter.pdf\">Notes towards the Scientific Study of Public Communication on Twitter.<\/a>\u201d Keynote presented at the Conference on Science and the Internet, D\u00fcsseldorf, 4 Aug. 2012. (The slides and video of the presentation are here: <a href=\"http:\/\/snurb.info\/node\/1678\">http:\/\/snurb.info\/node\/1678<\/a>.)<\/p><\/blockquote>\n<p>Detailed notes on how we use these scripts to process <em>Twitter<\/em> data, and additional processing tools, are also on our Website \u2013 see <a href=\"http:\/\/mappingonlinepublics.net\/category\/twitter\/\">http:\/\/mappingonlinepublics.net\/category\/twitter\/<\/a> for more.<\/p>\n<p>For network visualisation, I recommend the open source software <em>Gephi<\/em>. My article in <em>Information, Communication &amp; Society<\/em> describes how I\u2019ve used <em>yourTwapperkeeper<\/em>, <em>Gawk<\/em> and <em>Gephi<\/em> to create dynamic visualisations of <em>Twitter<\/em> conversation networks:<\/p>\n<blockquote><p>Axel Bruns. \u201cHow Long Is a Tweet? Mapping Dynamic Conversation Networks on Twitter Using Gawk and Gephi.\u201d <em>Information, Communication &amp; Society<\/em>, 17 Nov. 2011. <a href=\"http:\/\/dx.doi.org\/10.1080\/1369118X.2011.635214\">http:\/\/dx.doi.org\/10.1080\/1369118X.2011.635214<\/a><\/p><\/blockquote>\n<p>For more sophisticated, \u2018big data\u2019 research (i.e. upwards of a few million tweets per dataset), the <em>yourTwapperkeeper<\/em> approach is less useful (the LAMP framework just isn&#8217;t built for big data), and you\u2019ll probably need to build your own customised solution. Eugene Liang and I discuss the pros and cons of both approaches in a recent article in First Monday (while we frame this in a crisis communication context, the discussion applies well beyond this):<\/p>\n<blockquote><p>Axel Bruns and Yuxian Eugene Liang. \u201c<a href=\"http:\/\/firstmonday.org\/htbin\/cgiwrap\/bin\/ojs\/index.php\/fm\/article\/viewArticle\/3937\/3193\">Tools and Methods for Capturing Twitter Data during Natural Disasters.<\/a>\u201d <em>First Monday<\/em> 17.4 (2012).<\/p><\/blockquote>\n<p>Hope that helps.<\/p>\n<p>&lt;insert obligatory plug for the \u201c<a href=\"https:\/\/www.conftool.com\/aoir-ir13\/index.php?page=browseSessions&amp;search=digital+data\">Digital Data \u2013 Lost, Found, and Made<\/a>\u201d panel at the upcoming <a href=\"http:\/\/ir13.aoir.org\/\">AoIR conference<\/a>, where I&#8217;m sure we can discuss the question of <em>Twitter<\/em> research methods some more as well&gt;<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>Any self-respecting Internet researcher will already be aware that the Association of Internet Researchers (AoIR) is the place to keep track of what\u2019s cutting edge in the field. On the AoIR mailing-list, there\u2019s been an interesting discussion over the last few days about the available tools for tracking, capturing, and analysing Twitter data \u2013 and &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mappingonlinepublics.net\/dev\/2012\/09\/29\/a-quick-recap-of-twitter-research-approaches\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;A Quick Recap of Twitter Research Approaches&#8221;<\/span><\/a><\/p>\n<p><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":2,"featured_media":1324,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"nf_dc_page":"","footnotes":""},"categories":[175,176,8],"tags":[7,23,297,298,111],"class_list":["post-1801","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-capture","category-processing","category-twitter","tag-gawk","tag-gephi","tag-methods","tag-twitter","tag-yourtwapperkeeper","entry"],"_links":{"self":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/1801","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/comments?post=1801"}],"version-history":[{"count":2,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/1801\/revisions"}],"predecessor-version":[{"id":1803,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/1801\/revisions\/1803"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media\/1324"}],"wp:attachment":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media?parent=1801"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/categories?post=1801"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/tags?post=1801"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}