First look at analysing threaded Twitter discussions from large archives using NodeXL #moocmooc

This post is a bit messy. I got caught trying out too many ideas at once, but hopefully you'll still find it useful

Sheila recently posted Analytics and #moocmooc in which she collects some thoughts on the role of analytics in courses and how some of the templates I’ve developed can give you an overview of what is going on.  As I commented in the post I still think there is more work to make archives from event hashtags more useful even if just surfacing tweets that got most ‘reaction’.

There are three main reactions that are relatively easy to extract from twitter: retweets, favouring and replies. There are issues with what these actions actually indicate as well as the reliability of the data. For example users will use ‘favouring’ in different ways, and not everyone uses a twitter client that can or uses a reply tweet (if you start a message @reply without clicking a reply button Twitter looses the thread).

But lets ignore these issues for now and start with the hypothesis that a reaction to a tweet is worth further study. Lets also, for now, narrow down on threaded discussions. How might we do this? As mentioned in Sheila's post we’ve been archiving #moocmooc tweets using Twitter Archiving Google Spreadsheet TAGS v3. As well as the tweet text other metadata is recorded including a tweet unique identifier and, where available the id of the tweet it is replying to.

Google Spreadsheet columns

We could just filter the spreadsheet for rows with reply ids but lets take a visual approach. Downloading the data as a Excel file we can open it using the free add-in NodeXL.

NodeXL allows us to graph connections, in this case conversation threads. NodeXL allows use to do other useful things like group conversations together to make further analysis easier. Skipping over the detail here’s what you get if you condense 6,500 #moocmooc tweets into grouped conversations.

 moocmooc grouped converstations

This is more than just a pretty picture. In NodeXL I’ve configured it so that when I hover over each dot which represents and individual tweet I get a summary of what was said by who and when (shown below).

NodeXL being used to examine nodes

It’s probably not too surprising to see strings of conversations, but by graphing what was an archive of over 6500 tweets we can start focusing on what might be interesting subsets and conversation shapes. There are some interesting patterns that emerge:

conversation group 1 conversation group 2conversation group 3

Within NodeXL I can extract these for further analysis. So the middle image can be viewed as:

Examination of conversation group 2

There’s a lot more you can do with this type of data, start looking at how many people are involved in conversations, number of questions per conversations and lots more. I should also say before I forget that NodeXL can be configured to collect twitter search results with it’s built-in twitter search tool. It can also be configured to do the collection on a regular basis (hmm I should really have a go at doing that myself). So potentially you’ve got a nice little tool to analysis twitter conversations in real-time …

If you’d like to explore the data more it’s available from the NodeXLGraphGallery. I’m going off to play some more ;)

Share this post on:
| | |
Posted in NodeXL, Twitter and tagged on by .

16 thoughts on “First look at analysing threaded Twitter discussions from large archives using NodeXL #moocmooc

  1. Lorna M. Campbell

    Brilliant work Martin! Being able to analyse and view data and interactions like this opens up so many possibilities for those facilitating and participating in moocs and other large scale online events.

  2. Pingback:

    First look at analysing threaded Twitter discussions from large archives using NodeXL #moocmooc JISC CETIS MASHe | #MOOCMOOC | Scoop.it

  3. Pingback:

    First look at analysing threaded Twitter discussions from large archives using NodeXL #moocmooc JISC CETIS MASHe | MOOCs and Open Educational Resources | Scoop.it

  4. Pingback:

    Sheila’s work blog » eAssessment Scotland - focus on feedback

  5. Pingback:

    Notes from the Twitter backchannel at eAssessment Scotland 2012 #eas12 JISC CETIS MASHe

  6. Pingback:

    August round-up/roundup » Danegeld

  7. Pingback:

    ‘Activity’ in the Twitter backchannel at #altc2012 day 1 JISC CETIS MASHe

  8. Pingback:

    Any Questions? Filtering a Twitter hashtag community for questions and responses [situational awareness] #CFHE12 JISC CETIS MASHe

  9. Pingback:

    CFHE12 Week 1 Analysis: Twitter isn’t so massive JISC CETIS MASHe

  10. Pingback:

    Summary of some of the monitoring tools and recipes I use at JISC CETIS JISC CETIS MASHe

  11. Su Butcher (@SuButcher)

    This is great Martin- would love to use NodeXL too for some of our discussions.
    I’ve downloaded the plugin but can’t seem to work out how to add it to Excel. I’m using version 2011 for mac – so I’m going to ask on the Codeplex site as they don’t seem to have any visible information about how to install.
    S

      1. Su Butcher (@SuButcher)

        That’s great – I knew you’d know.
        I’ve got a netbook that runs Windows 7 but I can’t justify buying a whole new Office suite just for this feature – so would be very interested in your web tool when it turns up :-)

  12. Pingback:

    #LAK13: Recipes in capturing and analyzing data – Twitter Jisc CETIS MASHe

Comments are closed.