Tag Archives: InfoVis

Network analysis of the #hcsmca Twitter community: lurking as a form of legitimate peripheral participation?

In the ethnography, “Situated Learning” (Lave & Wenger, 1991) it was observed that learning a trade or profession such as a tailor or midwifery was best supported by engaging in this activity within the actual community in which it was taking place. In this context the learner, as an apprentice, can be exposed to others with varied skill levels within that particular job or trade from which they can learn. Initially they may engage in some limited tasks such as maintaining inventories of equipment or tools and ordering supplies. Over time and with more exposure to the task their role will evolve and increase in responsibility. For this to take place they must learn from others with more experience. Some members of this particular community may have expert status whereas others may be at more of an intermediary level. At the beginning those new to the community participate only on a peripheral level. As novices they have yet to learn the terms, concepts and practices that would allow them to engage in the profession in a meaningful way. For example, someone new to programming may subscribe to a mailing list or follow a newsgroup that discusses the computer language they want to learn. These groups are often composed of individuals with varying levels (novices, intermediaries, experts) of skill level forming what has been termed “communities of practice”. This legitimate peripheral participation or “lurking” is an acceptable and supported behaviour amongst many well established online communities. After reading the messages for a period of time novices may feel more comfortable and post questions of their own. This may lead to some form of debate amongst other participants in which new knowledge is co-created. Novices may contribute in other ways by sharing information related to issues they have already encountered. For example, the novice programmer may have been advised before participating in the message forum that using an integrated development environment (IDE) will aid their learning of how to program. Over time the community shares their experiences and members of all levels engage and learn from and with each other. This phenomena has been documented amongst mailing lists and newsgroups.

But what about the newer forms of social media such as Twitter?

Founded by social media expert and plain language writer Colleen Young (@colleen_young) the Health Care Social Media in Canada (hcsmca) Twitter-based community was designed as a means by which Canadians with an interest in social media within a health care context could exchange information. By posting tweets using the acronym, “hcsmca” those wanting to share and learn more about this topic area can follow the posts. Each week the community meets for a live tweet-up in which messages are exchanged in real time providing for a more conversational tone to the exchange. I have participated in this community almost since its inception. Over this time I have wondered about the types of connections that were being formed, what information is being shared and learned and how effective Twitter is as forms of information dissemination in this context.

To explore this further I examined the network relationships in the hcsmca community with NodeXL (http://nodexl.codeplex.com/). Using the import tool I limited the results to 100 people for this initial exploration. I requested edges (or connections) for each of these Twitter scenarios: “follows” relationship (an individual and their followers), “replies-to relationship in tweet” (a reply to an individual tweet), “mentions relationship in tweet” (a tweet that mentions a user) and a “tweet that is not a reply-to or mention” (a posted message or tweet). NodeXL calculates a variety of statistics related to network analysis. By using filters you can refine the resulting graph in form that provides meaning.

Image I provides one static representation of a many possible layouts of the results. The NodeXL tool allows for more dynamic views (e.g. colour coded relationships between users such as “follows”, “replies-to relationship in tweet” and depictions of the other metrics mentioned above). It also provides for the ability to re-position the location of each user. Image I (below) demonstrates one instance of these options.

Network relationships of hcsmca Twitter communityImage I: Network analysis of #hcsmca community – November 26th, 2011

To better view the relationships I limited the out degree (people with the most connections) to seven. I then arranged the display from left to right by eigenvector centrality (a measure of importance in the network). Community leader Colleen Young, who often moderates the weekly tweet chats is positioned at the far left as she has the highest eigenvector centrality in this group. @DoctorFullerton is next, @nursefriendly and @ehealthmusings follow and so on. What may be of most interest are the two outliers positioned on the far right: @infoway and @jasonboies. They were represented in the graph because they had an out degree value greater than seven. However, I am curious as to why they had no connections to the remaining members in this particular snapshot of the #hcsmca community tweets. Does this indicate some form of lurking? How can this behaviour be explained?

In order to understand this further a content analysis of the tweets will be conducted. In the next installment I will explore the contents of these tweets using Netlytic (http://netlytic.org/), an Internet Community Text Analyzer.

Reference

Lave, J., & Wenger, E. (1991). Situated learning: legitimate peripheral participation. Cambridge [England] ; New York: Cambridge University Press.

Recommended Reading

Hansen, D. L., Schneiderman, B., & Smith, M. A. (2010).  Analyzing social media networks with NodeXL: insights from a connected world. Burlington, MA: Morgan Kaufmann.

Valente, T. W. (2010).  Social networks and health: models, methods, and applications. Oxford ; New York: Oxford University Press.

Thanks to @marc_smith for his assistance.

Tables as a form of information visualization

Readers, you may find this blog posting of interest:

http://datamining.typepad.com/data_mining/2010/08/the-interpretation-of-tables-in-texts-2000.html

First, this guy (not to be rude, his name is Matthew Hurst) did his PhD on the depiction of data in tables. This is interesting in of itself. By tables I mean a plain old box with fields in rows and columns. It may seem “useless” or “stupid” to a lot of people but how many of us read data in this format today? Excel alone means probably millions. I, for one, am glad that people are working on ways to improve this.

Now comes the value added part. The author goes on to reference an article, “Exploiting a Web of Semantic Data for Interpreting Tables”, which can be found here:

http://journal.webscience.org/322/

The abstract states:

Much of the world’s knowledge is contained in structured documents like spreadsheets, database relations and tables in documents found on the Web and in print. The information in these tables might be much more valuable if it could be appropriately exported or encoded in RDF, making it easier to share, understand and integrate with other information. This is especially true if it could be linked into the growing linked data cloud. We describe techniques to automatically infer a (partial) semantic model for information in tables using both table headings, if available, and the values stored in table cells and to export the data the table represents as linked data. The techniques have been prototyped for a subset of linked data that covers the core of Wikipedia.

I’m looking forward to what this collaboration yields.

Visualization: Indexed. By Jessica Hagy

In honor of Jessica’s fine (last but not least) chapter in, “Beautiful Visualization” I offer my own , perhaps feeble attempt, at the fine art of indexing information. Try to imagine it is on an index card.

middle

venn diagram: me, you, stickiness

Can you guess the reference? Hint: It is the title of a song.

Take 2: the iPad’s competition

This post just in from Flowing Data, one of my favourite data visualization blogs. A diagram of where the iPad fits in compared to available technologies.

ipad competition

ipad competition

This is an interesting analysis. I had not considered it as a gaming console. Probably because I don’t play computer games.  Not sure if any of the games available allow for multi-player functionality.

As an e-Reader, yes I think that’s a fair comparison as I mentioned in my original post.  The iPad doesn’t use e ink but it makes up for this by displaying in colour IMHO.

As a computer, no, I don’t think it is in the same ballpark.  You can’t multi-task with an iPad. No comparison there.

As a “catch-up” I don’t think that is a fair either. None of those products have been around long enough nor are they in use enough to merit comparison.

I think the graphic is bang on when it outlines planned uses by percentage (Internet surfing = 68%, email = 44%, e-books = 37%, reading newspapers = 28%, watching video 24%).  A lot of consumption and little collaboration.

BTW, those categories don’t add up to one hundred percent. I wonder how the question to collect that information was worded?

None of these, with the exception of email, are really all that collaborative (depending on what one does while surfing, of course) in nature. No specific mention of “social networking” as a category  What happened to email being dead and  Facebook and Twitter now rule? I guess the authors’ of this survey didn’t see that post!

Warning – too much consumption instead collaboration may lead to isolation and a diminished awareness of “what all the cool kids” are doing.

Mixing methods to improve decision making: visualization

It is generally accepted amongst social scientists who collect information or measure phenomena that the combination of quantitative and qualitative methods yields the richest data. The information collected on a survey, in which many of the questions are close-ended can be contextualized by the material from the qualitative part of the study. In the field of mixed methods there are a variety of techniques to execute these measures such as collecting both kinds of data at the same time or obtaining the information sequentially, giving yourself time to evaluate the initial findings to strengthen your (and ultimately the readers’) understanding, interpretations and recommendations based on the findings.

Another important component to understanding results is the use of graphs. Many statistical packages provide ways to produce bar charts, line graphs, box plots and other similar ways to visualize information. These can also increase understanding the information. A picture can really be a thousand words.

But what about data/findings/visualizations to support decision making?

Perhaps an even better way to conduct mixed method research is to include a third component, a visualization that represents both the quantitative and qualitative findings. In particular, the ways in which information can be used to support decision making. For example, the web site Patients Like Me provides graphs for quantitative information such as age, gender, top treatments. It also provides forum postings, which contain qualitative information about the patients’ experiences. But all of this data is separate from each other. You cannot cross tabulate age with gender within an illness.

What if these were combined into a graph so that a 29 year old woman, recently diagnosed with MS, experiencing fatigue and considering taking steroids, could find a way to a way to help make this decision? Wouldn’t this make for a better collaboration?

Social networking and the cure for cancer

Well, one might ask, “What does social networking have to do with the cure for cancer”? I’ve been wondering that myself…I think that the kind of information that is shared in health-related social networks, especially as they become more and more refined in the data they collect (e.g. www.patientslikeme.com) are going to be very revealing in terms of patterns once the data is mined and visualized. Let’s not forget the pioneering work by John Snow, who used mapping to determine the origin of a cholera outbreak in London during the 1850s. I’m sure his colleagues thought he was crazy, running up and down the street, pen and paper in hand, making note of the locations where the deceased resided! Sure research on a cellular and biomedical level in relation to cancer treatments is necessary. Hopefully though one day Tak Mak will come out from under the fume hood and see what the social scientists are doing…