{"id":2816,"date":"2014-08-04T08:30:00","date_gmt":"2014-08-03T22:30:00","guid":{"rendered":"http:\/\/mappingonlinepublics.net\/?p=2816"},"modified":"2014-08-04T20:39:36","modified_gmt":"2014-08-04T10:39:36","slug":"first-steps-in-exploring-the-australian-twittersphere","status":"publish","type":"post","link":"https:\/\/mappingonlinepublics.net\/dev\/2014\/08\/04\/first-steps-in-exploring-the-australian-twittersphere\/","title":{"rendered":"First Steps in Exploring the Australian Twittersphere"},"content":{"rendered":"<p><em>Twitter<\/em> is widely used in Australia, but we don\u2019t actually know such a great deal about the structure and dynamics of the Australian Twittersphere. Back in 2011\/12, <a href=\"http:\/\/mappingonlinepublics.net\/2012\/04\/01\/many-maps-of-the-australian-twittersphere\/\">our research began to identify Australian Twitter users and map their follower\/followee connections<\/a> in order to develop a better understanding of the structure of the network and from this determine some of the key themes and topics driving activity in the Australian Twittersphere, and we\u2019re currently in the process of substantially extending this work. In this post I\u2019m starting to share some first findings from this work.<\/p>\n<h1>Methods<\/h1>\n<p>First things first: here\u2019s our methodology for getting to this point. Over the course of several months in 2013, the tools developed by <a href=\"http:\/\/socialmedia.qut.edu.au\/2013\/06\/25\/towards-data-science\/\">our data scientist Troy Sadkowsky<\/a> used the Twitter API to access the publicly available profile information for each account then in existence; we simply pinged every user ID from 0 through to (at that point) upwards of 2 billion, and recorded the information returned. This resulted in data for some 750 million accounts \u2013 the size of the global <em>Twitter<\/em> userbase (or more precisely, account base) around September 2013. (We\u2019ll share some analysis of the global trends in <em>Twitter<\/em> account sign-ups in a separate post in the near future.) This comprehensive snapshot of global <em>Twitter<\/em> accounts provides us with an opportunity to go looking specifically for Australian users. To do so, we drew on three key elements of each user profile: the free-text profile description and location fields as entered by the account creator, as well as the profile timezone they chose from the pull-down menu of presets offered by <em>Twitter<\/em>. On the basis of the latter, we selected all users who had chosen one of the eight state-based Australian timezone options, while for the former two fields, we developed a long list of search terms relating to Australian towns, cities, and states, and to Australia itself, using a number of common variations. Any account that matched our criteria for \u201cAustralianness\u201d in any of these three fields has been included in our selection. To go through the full list of search terms would take up another post, but we worked with a list of the 50-odd largest cities in Australia, added in a handful of popular variations, included the state names and their abbreviations, and also used terms such as \u201cAustralia\u201d, \u201cStralya\u201d, \u201cdown under\u201d, and others. Following a test run, we further refined these terms, to include popular misspellings (\u201cAustalia\u201d, \u201cTasmainia\u201d) and remove false positives. This turned out to be a somewhat time-consuming exercise: many place names in Australia are re-used from Europe (\u201cPerth\u201d, \u201cIpswich\u201d) or duplicated in other new world countries (Brisbane, California; Victoria, British Columbia); some Australian place names also appear in popular media (some users claim to be from the \u201cCity of Townsville\u201d or indeed the \u201cCiudad de Townsville\u201d in homage to the <em>Powerpuff Girls<\/em>, or from <em>Finding Nemo<\/em>\u2019s \u201c42 Wallaby Way, Sydney\u201d). Where possible we\u2019ve filtered out any false positives which could be clearly identified. In the end, this process of filtering the total dataset of over 750 million <em>Twitter<\/em> accounts left us with some 2.8 million accounts whom we are confident to classify as \u2018Australian\u2019 for the purposes of this study. For many of these, we are also able to assign a likely state and\/or city, based on which of our search terms helped identify the account; here, we give greatest credence to the information contained in the location field of the <em>Twitter<\/em> profile, followed by description and timezone. Where we identified users <em>only<\/em> based on their timezone, we have assigned a state, but have refrained from assigning them to the state\u2019s capital city. Inevitably, some false positives will remain in our dataset, and some accounts will be miscategorised \u2013 \u201cSydneysider now living in Melbourne\u201d or \u201cAustralian in New York\u201d may lead to false location assignments, and descriptions like \u201cKorean student in Brisbane\u201d or even \u201cDreaming of travelling through Australia\u201d would have matched our search terms, but do not relate to the accounts of Australian users in a narrow sense. However, given the size of the total dataset our best-match approach using automated processes is the best option available to us, and I\u2019d guess that some 90-95% of the accounts we\u2019ve matched are genuine Australian users: either Australians in Australia, Australians elsewhere in the world, or non-Australians living in Australia. The outliers from this population are likely to show up in our further analysis, too. There will also be some false negatives, of course: accounts which give no indication of their Australian connections anywhere in their location, description, or timezone details (including users who have filled in none of their profile details at all). It seems likely that the greatest number of these will be amongst the most recently registered accounts (whose owners may not yet have had a chance to fully customise their <em>Twitter<\/em> settings), so we\u2019ll largely ignore this group for now \u2013 we\u2019ll re-run our survey of the total <em>Twitter<\/em> userbase at some point in the future to examine how these accounts may have developed, as well as to gather data on the accounts which were created after our initial data-gathering exercise finished in September 2013.<\/p>\n<h1>Findings<\/h1>\n<p>By the end of August 2013, then, the Australian Twittersphere included some 2.79 million accounts, by our criteria. <em>Per capita<\/em>, using <a href=\"http:\/\/www.abs.gov.au\/AUSSTATS\/abs@.nsf\/allprimarymainfeatures\/4C1B7DF31E5FD78FCA257CFB0014E8B2?opendocument\">the Australian Bureau of Statistics\u2019 figures for September 2013<\/a>, this would translate to a 12% sign-up rate, though that figure must be viewed with some caution: some <em>Twitter<\/em> users will operate multiple accounts (e.g. for private and professional use), while in other cases several users will share the same group account. This is why we\u2019re careful here to speak of 2.79 million <em>accounts<\/em>, rather than users. This figure is in line with <a href=\"http:\/\/www.socialmedianews.com.au\/social-media-statistics-australia-june-2014\/\">existing reports and guesstimates for the size of the Australian Twittersphere<\/a>, if somewhat below <a href=\"http:\/\/mumbrella.com.au\/nielsen-launches-tv-twitter-ratings-australia-215590\">the 4 million Australian accounts that Twitter, Inc. itself apparently boasted some months ago<\/a>. Figures from the company itself should always be taken with a grain of salt, of course; they\u2019re largely released for corporate promotion reasons, and may well reflect the total number of Australian-based accounts ever created, rather than the number of accounts which are still in existence at present (which is what we measured). On the other hand, there is also an unknown number of accounts which our methods would not identify as Australian, based on publicly available profile details, but which Twitter, Inc. (which would have identified the IP address from which a <em>Twitter<\/em> account was created) would classify as Australian. This also explains some of the discrepancy in numbers. Here\u2019s how that population has grown month by month over the seven years covered by our dataset (click on the images for larger versions): <a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"Australian Twitter Accounts\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts_thumb.png\" alt=\"Australian Twitter Accounts\" width=\"609\" height=\"484\" border=\"0\" \/><\/a> From a slow start over the first couple of years (which is similar outside of Australia, too), there\u2019s finally a sudden and rapid rise in new registrations per month in early 2009, peaking at over 100,000 new account registrations each in March and April 2009. (And there may well have been more than this: the 100,000+ accounts we see joining in these months are only those which were still in existence when we gathered our data in late 2013, of course.) From this early excitement, things slow down considerably towards the end of 2009 \u2013 and then trends start to point upwards again: the average number of new accounts joining per month during the following years is somewhere around 40-50,000. Finally, there is a substantial increase in new registrations in August 2013; this may be partly related to the impending federal election, but probably also reflects the fact that <em>Twitter<\/em>\u2019s spam bot-checking systems may not yet have had a chance to remove any offending new accounts. We should also note, though, that what our data cannot (yet) tell us is the number of accounts which are being deleted each month, and how those deletions compare to the influx of new accounts. We\u2019ll have a better indication of this after the next iteration of our survey, which will allow us to examine the discrepancies between the two datasets: accounts present in the September 2013 dataset but absent from the new iteration must have been deleted (by their owners, or by Twitter, Inc.) in the meantime. State-by-state patterns vary quite considerably at times. There are unusual spikes in ACT and Queensland account registrations between April and September 2012, for example which do not appear to be motivated by specific local events; ACT sign-ups per month rise from below 1,000 to over 4,000 accounts during that period, for example. From a preliminary review of the accounts which joined during that time, it appears that a considerable number of them belong to fans of The Janoskians, One Direction, and other teen bands, so perhaps there was a concerted effort by some of these bands to get their fans on <em>Twitter<\/em>? <a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-by-State.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"Australian Twitter Accounts (by State)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-by-State_thumb.png\" alt=\"Australian Twitter Accounts (by State)\" width=\"605\" height=\"484\" border=\"0\" \/><\/a> Other spikes are clearly driven by more sinister motives. The large spike in generically \u2018Australian\u2019 accounts in January 2013 is caused almost entirely by a large number of spam bots being created at virtually the same time, for example: of the 1,106 new accounts on 16 January 2013 alone, we counted 170 accounts claiming to be \u201cAustralia&#8217;s support member for the Global Information Network\u201d; 153 offering \u201cAustralian Business for Sale listings\u201d; 155 promoting \u201csoftware and services in Singapore. Australia. China and Japan\u201d; and 164 accounts claiming to be an \u201cIndependent Mortgage Broker in Australia\u201d \u2013 that\u2019s almost two thirds of the \u2018Australian\u2019 accounts for that day. Clearly <em>Twitter<\/em>\u2019s spam account filters still have some way to go. But genuine events in the world also result in increased sign-ups. During the first quarter of 2011, for example, we see a considerable spike in new Queensland-based accounts on 11 and 12 January, <a href=\"http:\/\/mappingonlinepublics.net\/2011\/01\/17\/the-queensland-floods-on-twitter-a-brief-first-look\/\">as floodwaters threaten inner-city Brisbane<\/a>, and during the following days; in Victoria, New South Wales, and other states the sign-up rate also increases notably. Similarly, as a devastating earthquake hits Christchurch, New Zealand, on 22 February, Australians also sign up in larger numbers than usual. The pattern does not repeat (other than perhaps in Queensland, once again) following the 11 March earthquake and tsunami on the east coast of Japan, however. <a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-Q1-2011.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"Australian Twitter Accounts (Q1-2011)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-Q1-2011_thumb.png\" alt=\"Australian Twitter Accounts (Q1-2011)\" width=\"610\" height=\"484\" border=\"0\" \/><\/a> The graph above also shows a considerable dip in new registrations on 18 February 2011 \u2013 this may well be due to an outage in <em>Twitter<\/em>\u2019s account registration systems. The geographical distribution of these accounts should necessarily be treated with a certain degree of caution, given the vagaries of correctly identifying cities and states from the free text provided by users in the location and description fields. However, the patterns we\u2019re able to determine from our best guess at the likely location of each user do reflect both the overall distribution of the Australian population and the relative likelihood (based on infrastructural and socioeconomic factors) of local residents joining <em>Twitter<\/em> that we would expect to see: <a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-geo.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"Australian Twitter Accounts (geo)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-geo_thumb.png\" alt=\"Australian Twitter Accounts (geo)\" width=\"610\" height=\"484\" border=\"0\" \/><\/a> The major population centres are clearly leading the way. Sign-up rates <em>per capita<\/em> seem to be strongest in the state capitals and on the Gold Coast, but this may be an artefact of our approach, which focussed on identifying mentions of the 50-odd major population centres in Australia in the location and description fields of users\u2019 <em>Twitter<\/em> profiles. Because of the greater national and international recognition of such centres, city users may state that they\u2019re from state capitals while those from small rural and regional locations might just mention their state. In a further iteration of our work, we\u2019ll check against a longer list of localities in Australia, and the patterns may well change. We\u2019re on more solid ground when we examine the sign-up rates for each state. This aggregates users who name specific cities with those who only specify a state, and accounts for some 2.4 million of our total 2.8 million identified accounts \u2013 about 420,000 accounts we identified as \u2018Australian\u2019 referenced only generic terms (\u201cAustralia\u201d, \u201cdown under\u201d, etc.), but did not include any more specific location details. <a href=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-State-and-City.png\"><img loading=\"lazy\" decoding=\"async\" style=\"background-image: none; padding-left: 0px; padding-right: 0px; display: inline; padding-top: 0px; border: 0px;\" title=\"Australian Twitter Accounts (State and City)\" src=\"https:\/\/mappingonlinepublics.net\/dev\/wp-content\/uploads\/2014\/07\/Australian-Twitter-Accounts-State-and-City_thumb.png\" alt=\"Australian Twitter Accounts (State and City)\" width=\"608\" height=\"484\" border=\"0\" \/><\/a> For most states, the sign-up rate ranges between 8 and 11 per cent, with Queensland and (perhaps somewhat surprisingly) the Northern Territory taking the lead of this group. There are likely to be any number of factors which have resulted in these slight differences in <em>Twitter<\/em> adoption across the country; for Queensland, for example, the well-publicised utility of <em>Twitter<\/em> during recent natural disasters may well have contributed to an above-average take-up. If the 420,000 accounts which we could not allocate to any specific state were distributed proportional to the states\u2019 population figures, this would boost each sign-up rate by another 1.8 percentage points, incidentally. But the major story here is of course the ACT, which records a whopping <em>per capita<\/em> take-up rate of 30%. We\u2019ll have to look more closely into what factors are responsible for this pattern \u2013 but so far we have not seen any indications that an unusually large number of false positives have slipped through our net. There are, however, unusually many accounts whose only identifying feature is their ACT timezone setting, and it is always possible that people from other UTC+10 timezones (for example in the northern hemisphere) might have chosen the ACT timezone rather than searching for their own options in the pull-down menu available on the <em>Twitter<\/em> site. Another factor that might drive the abnormally high number of accounts with some relation to the ACT is a combination of the socioeconomic make-up of the ACT population, and the fact that (as the seat of the federal government) there will be a very substantial number of organisational accounts, politicians, journalists, public servants, and other likely <em>Twitter<\/em> adopters in Canberra and surrounds. Additionally, there may also be a significant discrepancy between the number of formally registered ACT residents and the number of people who actually live and\/or work in Canberra at least part of the time. If we break down state numbers per city, the capital cities unsurprisingly account for the majority of <em>Twitter<\/em> accounts. There are also many accounts for which a city couldn\u2019t be determined \u2013 these are accounts which merely chose an Australian timezone, which named only their state in the location or description field, or which stated a location other than the 50-odd most populous Australian cities we searched for. Further, though, it is also notable that Queensland\u2019s <em>Twitter<\/em> population appears to be most geographically dispersed: in addition to the Gold Coast (which is a major population centre in its own right, of course), it also boasts the widest range of other centres with <em>Twitter<\/em> userbases numbering above 1,000 accounts. This is largely reflecting the population distribution across various regional centres in central and far north Queensland, but may also point to the useful role <em>Twitter<\/em> now regularly plays during Queensland\u2019s summer storm season. So much for a first overview of the overall figures. Over the next months, we\u2019ll delve much more deeply into the patterns which this massive dataset of Australian <em>Twitter<\/em> accounts reveals \u2013 and we\u2019ll also develop a number of approaches to mapping the follower\/followee networks of this <em>Twitter<\/em> population.<\/p>\n<!-- AddThis Advanced Settings generic via filter on the_content --><!-- AddThis Share Buttons generic via filter on the_content -->","protected":false},"excerpt":{"rendered":"<p>Twitter is widely used in Australia, but we don\u2019t actually know such a great deal about the structure and dynamics of the Australian Twittersphere. Back in 2011\/12, our research began to identify Australian Twitter users and map their follower\/followee connections in order to develop a better understanding of the structure of the network and from &hellip; <\/p>\n<p class=\"link-more\"><a href=\"https:\/\/mappingonlinepublics.net\/dev\/2014\/08\/04\/first-steps-in-exploring-the-australian-twittersphere\/\" class=\"more-link\">Continue reading<span class=\"screen-reader-text\"> &#8220;First Steps in Exploring the Australian Twittersphere&#8221;<\/span><\/a><\/p>\n<p><!-- AddThis Advanced Settings generic via filter on get_the_excerpt --><!-- AddThis Share Buttons generic via filter on get_the_excerpt --><\/p>\n","protected":false},"author":2,"featured_media":2809,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[175,176,8],"tags":[278,10,279,277,298,276],"class_list":["post-2816","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-capture","category-processing","category-twitter","tag-adoption","tag-australia","tag-geographic-distribution","tag-history","tag-twitter","tag-userbase","entry"],"_links":{"self":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2816","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/comments?post=2816"}],"version-history":[{"count":4,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2816\/revisions"}],"predecessor-version":[{"id":2820,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/posts\/2816\/revisions\/2820"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media\/2809"}],"wp:attachment":[{"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/media?parent=2816"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/categories?post=2816"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mappingonlinepublics.net\/dev\/wp-json\/wp\/v2\/tags?post=2816"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}