{"id":2344,"date":"2013-07-08T10:00:05","date_gmt":"2013-07-08T00:00:05","guid":{"rendered":"http:\/\/mappingonlinepublics.net\/?p=2344"},"modified":"2013-07-09T17:43:04","modified_gmt":"2013-07-09T07:43:04","slug":"introducing-twitter-follower-accession-graphs","status":"publish","type":"post","link":"https:\/\/mappingonlinepublics.net\/dev\/2013\/07\/08\/introducing-twitter-follower-accession-graphs\/","title":{"rendered":"Introducing Twitter Follower Accession Graphs"},"content":{"rendered":"<p>It\u2019s been uncommonly quiet on this site for the last few weeks, as we\u2019ve been busy with various projects, and overseas for a range of research activities. This may well prove to be the calm before the storm, though \u2013 we have a number of very exciting new social media research outcomes in the pipeline, and <a href=\"http:\/\/mappingonlinepublics.net\/2013\/06\/27\/spill-how-twitter-reacted-to-the-labor-leadership-challenge\/\">our preliminary analysis of Twitter use in the latest Labor leadership spill<\/a> a couple of weeks ago was just the start.<\/p>\n<p>To rekindle our interest in new methodological advances in <em>Twitter<\/em> research, here\u2019s a new method which my colleagues Darryl Woodford, Troy Sadkowsky and I came across on Tony Hirst\u2019s fabulous blog <em><a href=\"http:\/\/blog.ouseful.info\/2013\/04\/05\/estimated-follower-accession-charts-for-twitter\/\">OUseful.info<\/a><\/em> a few months ago \u2013 and as Tony points out, there\u2019s also <a href=\"http:\/\/research.microsoft.com\/en-us\/um\/people\/jchayes\/papers\/timestamps.pdf\">a Microsoft Research paper<\/a> which pursues a similar approach, so this method may have been developed almost simultaneously by a number of researchers. What we\u2019re about to do is quite possibly more difficult to explain than to demonstrate, so let\u2019s start with an illustration which shows the followers of <span style=\"text-decoration: line-through;\">former<\/span> reinstated Australian Prime Minister Kevin Rudd\u2019s account @KRuddMP.<\/p>\n<p>(We gathered these data before Rudd\u2019s return to the top job, so any effects of the latest Labor leadership change won\u2019t show up here just yet \u2013 we\u2019ll examine these new changes in a future post, once the dust has settled.)<\/p>\n<p>The following graph draws on two key points. First, the list of current followers for any one account which is returned by the <em>Twitter<\/em> API is in (almost) perfect chronological order; in Rudd\u2019s case, from follower number one to follower 1.2 million or so (at the time we gathered the list). Following Tony\u2019s example, we call the position of each follower in the overall list their <em>follower accession number<\/em>. Second, through a different API request we are able to determine when each of these followers first joined <em>Twitter<\/em>, practically down to the second they joined \u2013 that\u2019s their <em>account creation date<\/em>. If we plot those two parameters against one another, this is what we get (as always, click on the images to access larger versions\u2026):<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-plain2.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-plain\" alt=\"KRuddMP-plain\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-plain_thumb2.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<p>It\u2019s immediately obvious that a curve emerges which looks suspiciously like a growth curve for Rudd\u2019s follower base. As <em>Twitter<\/em> tells us, Rudd joined on 18 October 2008 \u2013 quite possibly without using the @KevinRuddPM username at the time; that handle was revealed only on 13 November that year, <a href=\"http:\/\/www.lifehacker.com.au\/2008\/11\/pm_kevin_rudd_joins_twitter\/\">it seems<\/a>, and changed to @KRuddMP after he lost the leadership to Julia Gillard in June 2010 (the existing follower base remains unaffected by such renamings of a <em>Twitter<\/em> account). Soon after, he began to accumulate followers, and his numbers swelled rapidly from mid-2009 onwards.<\/p>\n<p>But why this pronounced curve, when (in principle) <em>Twitter<\/em> accounts of any age could become followers at any time? Well, here\u2019s the clever bit of logic that Tony Hirst came up with:<\/p>\n<ol>\n<li>None of the follower accounts could <em>become<\/em> followers before they themselves were created.<\/li>\n<li>Some of the follower accounts will have become followers quite shortly after they were created \u2013 within minutes or hours. (This is even more likely in recent times, since <em>Twitter<\/em> began recommending high-profile accounts that new users may find interesting to follow.)<\/li>\n<li>Thus, the lower edge of the graph above is a pretty good approximation of the size of the follower base at any one moment \u2013 and we\u2019re able to trace over time how an account\u2019s follower base has developed.<\/li>\n<\/ol>\n<p>In other words: from time to time, an account will follow @KRuddMP very shortly after it itself has been created. This means that any new followers joining after that account <em>must<\/em> have followed @KRuddMP at a time <em>after<\/em> that date; we are therefore able to calculate an approximate join date for those accounts even if their own creation dates are well in the past. Going through the list of followers from number one to number 1.2 million, noting down each most recent account creation date we come across and assigning it as a join date to all later followers until we come across an even more recent account creation date, we are effectively tracing @KRuddMP\u2019s follower growth curve, then. <em>(See \u2013 I did say this was easier to illustrate than to explain\u2026)<\/em><\/p>\n<p>As long as the target account itself remains the same, this works, incidentally, even if the account has been renamed \u2013 as is the case here, since we know that @KevinRuddPM changed its name to @KRuddMP following the 2010 ALP leadership spill. Interestingly, a pronounced vertical line also emerges \u2013 most of Rudd\u2019s follower accounts were created after February 2009 or so, while the space to the left of that line is comparatively empty. This isn\u2019t unique to Rudd, however: overall, most <em>Twitter<\/em> accounts in the world were created after that time, as the site emerged as a mass medium, so the darker shading to the right of that divide simply shows when <em>Twitter<\/em> itself became a popular social media platform.<\/p>\n<p>There is one significant <em>caveat<\/em>, however: gathering an account\u2019s list of followers, we only gather information about those followers who <em>at present<\/em> still remain as followers. If there was a large group of followers who left at some point before we gathered our data, we will not find any evidence of that exodus; we can only see when those followers who have stuck with the target account until now began to follow it. (Also, if someone followed, unfollowed, and then re-followed an account, we will only be able to see their presence since the last time they re-followed.)<\/p>\n<p>There\u2019s also a smaller issue with the <em>Twitter<\/em> API which seems to result in some recent followers to be placed (erroneously) at the start of the list \u2013 in Rudd\u2019s case, there are a few accounts which were created in 2012 that show up amongst Rudd\u2019s first few hundred followers, which is obviously impossible. So, we\u2019ll ignore the first few followers in the list from now on. On that basis, then, If we isolate the growth curve only, here\u2019s what we\u2019re left with:<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-curve.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-growth curve\" alt=\"KRuddMP-growth curve\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-curve_thumb.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<h1>Spam Followers?<\/h1>\n<p>But something\u2019s not quite right yet. The Rudd graph above shows a <em>very<\/em> rapid growth in followers which seems to begin as abruptly as it ends. Between late June 2009 and late January 2010, @KevinRuddPM (as it was known then) picked up a whopping 700,000 new followers, for no obvious reason; its follower accession rate was substantially smaller both before and after this time. Here\u2019s the account\u2019s estimated follower growth per day:<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-per-day2.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-growth per day\" alt=\"KRuddMP-growth per day\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-per-day_thumb2.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<p>A closer look at some of the accounts which joined during this time reveals some unusual patterns. Many of the new follower accounts began to follow Rudd essentially within minutes of being created; many follow only a handful of other accounts, and usually those of celebrities such as @PaulaAbdul and @CindyCrawford; many never bothered to change their Twitter user avatar away from the default \u2018egg\u2019; many posted only a handful of times, often promoting mail-order pharmaceuticals or \u2018guaranteed followback\u2019 schemes; and many haven\u2019t tweeted again since 2011 or so. In short: these clearly aren\u2019t genuine followers, but spam accounts which are attempting to give themselves the veneer of respectability by following a range of leading <em>Twitter<\/em> users (with an apparent preference for accounts whose identity has been verified by <em>Twitter<\/em>).<\/p>\n<p><strong><span style=\"color: #ff0000;\">Update:<\/span><\/strong> Andrew Richardson (<a href=\"http:\/\/twitter.com\/Andrew303\">@Andrew303<\/a>) has helpfully pointed out that <em>Twitter<\/em> introduced the first iteration of its Suggested User List in early 2009, featuring a handful of major \/ important \/ notable <em>Twitter<\/em> users whom new users were all but forced to follow as part of the sign-on process. Rudd was one of the very few Australians on that (global) list, and \u2013 obviously \u2013 benefitted significantly. I strongly suspect that <em>Twitter <\/em>spammers also followed these accounts disproportionately in order to look \u2018legit\u2019.<\/p>\n<p>Here\u2019s a graph which distinguishes the number of very newly created accounts (created an hour ago or less) that followed @KevinRuddPM \/ @KRuddMP from the rest of his new followers each day. The result is striking: between late June 2009 and late January 2010, the influx of such new followers is massive, peaking at over 5,000 followers per day. (And remember that we\u2019re only seeing those of these spam accounts which still exist today; many others may have been deleted by Twitter for spamming in the meantime.)<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-spam-followers-per-day2.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-spam followers per day\" alt=\"KRuddMP-spam followers per day\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-spam-followers-per-day_thumb2.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<p><strong>To be perfectly clear about this (and avoid any <a href=\"http:\/\/www.menzieshouse.com.au\/2011\/09\/exclusive-investigation-kevin-rudd-buying-twitter-followers-to-boost-leadership-bid.html\">conspiracy theorists<\/a> or lazy journalists who want to turn this into a story about \u201cKevin Rudd\u2019s fake followers\u201d): I\u2019m not suggesting here that Rudd \u2018invited\u2019 or even \u2018bought\u2019 these spam followers in order to boost his numbers and appear more liked. Rather, the opposite is most likely true: these spam accounts followed Rudd precisely <em>because<\/em> he is a prominent <em>Twitter<\/em> user, with a verified account \u2013 following his account makes <em>them<\/em> look more legit. <span style=\"color: #ff0000;\">Update:<\/span> plus, as Andrew has pointed out, many new users followed Rudd simply because <em>Twitter<\/em> itself told them to, as part of the sign-on process.<\/strong><\/p>\n<p>The kinks at the start and end of this period of spam follower influx are very pronounced; if we eliminate the spammers (and assume there are no genuine reasons for a significant increase and slowdown in Rudd\u2019s follower accession rate at these points), the corrected graph should show a much smoother growth curve. A little trial and error reveals that we can remove the majority of suspicious followers by disregarding any accounts which followed Rudd within 90 minutes of their own creation; applied to the entire dataset of over 1.2 million followers, this removes more than half of Rudd\u2019s followers.<\/p>\n<h1>Tracking Events through the Accession Timeline<\/h1>\n<p>Based on this approach, then, it now becomes possible to trace the impact of events in the target account\u2019s timeline on their follower base \u2013 at least where they\u2019ve had a positive impact, since we can\u2019t identify unfollowings. After dropping any users who followed Rudd within 90 minutes of creating their accounts (removing most spam accounts, but probably also a number of genuine followers), we\u2019re left with about 570,000 followers, whose accession is distributed as follows:<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-curve-corrected2.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-growth curve (corrected)\" alt=\"KRuddMP-growth curve (corrected)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-curve-corrected_thumb2.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<p>At first glance, this may not look like much: from early 2009 onwards, Rudd\u2019s follower accession curve looks fairly steady and uneventful. But the devil is in the detail. Switching to a view which shows the estimated number of new followers per day, we can clearly identify several notable spikes in Rudd\u2019s follower accession numbers. These are closely aligned with various recent events, including the 2010, 2012, and 2013 leadership spills (and in the case of 2012, Rudd\u2019s resignation as Foreign Minister which preceded it). There\u2019s also a notable lull in follower accession following the 2010 election which (eventually) returned Julia Gillard as Prime Minister, indicating perhaps a momentary honeymoon for Gillard and a corresponding lack of interest in Rudd\u2019s return as PM. Interestingly, we can also make out a sharp spike on the day that Australian media reported that <a href=\"http:\/\/www.sbs.com.au\/news\/article\/1586904\/Rudd-reaches-million-followers-on-Twitter\">@KRuddMP had surpassed the 1 million followers mark<\/a> \u2013 followers beget followers, it seems.<\/p>\n<p><a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-per-day-corrected.png\"><img decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border-width: 0px;\" title=\"KRuddMP-growth per day (corrected)\" alt=\"KRuddMP-growth per day (corrected)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2013\/06\/KRuddMP-growth-per-day-corrected_thumb.png\" width=\"1204\" border=\"0\" \/><\/a><\/p>\n<p>It\u2019s not in isolation that this approach is most powerful, though. So, <a href=\"http:\/\/mappingonlinepublics.net\/?p=2408\">in a follow-up post<\/a> we\u2019ll look at the comparative <em>Twitter<\/em> follower accession rates of a greater number of leading Australian politicians, to see how they may reflect the waxing and waning fortunes of the various political parties. Stay tuned!<\/p>\n<p>Finally: I\u2019m sorry to say this, but contrary to what we usually do at Mapping Online Publics we won\u2019t be able to share the tools that we\u2019ve used to create these accession graphs \u2013 for the moment, they\u2019re built to run on very specific infrastructure and they wouldn\u2019t be much use to anyone outside of the project team. With a little knowledge of the <em>Twitter<\/em> API, and a lot of patience in gathering the data, you should be able to replicate this approach reasonably easily, though. Again, I recommend Tony Hirst\u2019s excellent site <em><a href=\"http:\/\/blog.ouseful.info\/2013\/04\/05\/estimated-follower-accession-charts-for-twitter\/\">OUseful.info<\/a><\/em> for more on this method.<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>It\u2019s been uncommonly quiet on this site for the last few weeks, as we\u2019ve been busy with various projects, and overseas for a range of research activities. This may well prove to be the calm before the storm, though \u2013 we have a number of very exciting new social media research outcomes in the pipeline, &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mappingonlinepublics.net\/dev\/2013\/07\/08\/introducing-twitter-follower-accession-graphs\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;Introducing Twitter Follower Accession Graphs&#8221;<\/span><\/a><\/p>\n<p><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":2,"featured_media":2368,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[180,175,173,176,8],"tags":[253,254,10,251,252,11,298],"class_list":["post-2344","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-analysis","category-capture","category-politics-2","category-processing","category-twitter","tag-kruddmp","tag-accession","tag-australia","tag-followers","tag-kevin-rudd","tag-politics","tag-twitter","entry"],"_links":{"self":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2344","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/comments?post=2344"}],"version-history":[{"count":11,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2344\/revisions"}],"predecessor-version":[{"id":2416,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2344\/revisions\/2416"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media\/2368"}],"wp:attachment":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media?parent=2344"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/categories?post=2344"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/tags?post=2344"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}