Readers: 76 | Updated: 10-08

Facebook Dataset Identified

Translate Into:

It now appears that the Tastes, Ties and Times dataset has been identified.  According to privacy scholar Michael Zimmer, the dataset of Facebook profiles is from Harvard College.  In my original post on the matter, I discussed how “fingerprints” of friend networks could be used to identify the dataset.  It did not require such complicated measures.  Using the codebook and statements from the researchers, Dr. Zimmer was able to target and ultimately identify the source of the dataset.  Importantly, now that the dataset is identified, it would be trivial to run a network comparison and produce probability estimates of the individuals in the anonymized set.

In an article to be published in Social Networks (Lewis et al., 2008), the authors provide more insight into the set.  This information seems to support the Harvard hypothesis, providing demographic information on the sample that could be correlated with statistics from the registrar.  This information, once semi-private, is now completely public.  It is only a matter of time before a grad student or assistant prof, seeking a publication and a little press, identifies the set (and no, it won’t be me).

In the discussion between Zimmer and the PI, a number of common themes emerge.  They include the notion that Facebook users have no right to privacy, that by sharing, users actually intend for the information to be public.  This is a straw man hypothesis, one that assumes an intentionality on behalf of the individual that simply does not exist.  Even in a semi-public like Facebook, our expectation of audience and viewership is quite small (a recent survey found that most users expected their profiles to be viewed primarily by their close group of friends).

This episode is an important example for IRB’s*, which have widely different interpretations of social networks research.  The goal of the IRB is to prevent subjects from harm that arises from the research process.  I am in agreement that subjects who post public profiles are open to research, as long as the research isn’t personally identifiable and properly protects the subjects.  This is clearly a different case, in which data sourced for acceptable research purposes was repurposed, and its form now clearly poses a risk to the subjects.  I want to be clear about this point, though.  The original research mission (to collect and analyze a set with proper safeguards) was within bounds; the follow-up distribution is the element that clearly poses risk.

The researchers should have convened a panel with a privacy expert (like Dr. Zimmer) to assess the risks of data disclosure to the human subjects.  Had such a panel taken place, I am confident that the PI’s would have assessed the risks of disclosure in a different light.  Perhaps that is the takeaway from this situation.  Research that pushes the boundaries of technology and privacy provide IRB’s with unique challenges.  Some IRB’s respond conservatively, stifling research and innovation.  Finding the balance that encourages innovative research while protecting subjects is a challenge, and perhaps the right place for an expert mediator.  Should Schools of Information prepare information ethicists for this role?

* IRB = Institutional Review Board, a panel of local experts in research ethics and methodology that oversees institutional research, both in industry and academia.

Lewis, K., Kaufman, J., Gonzalez, M., Wimmer, A., and Christakis, N.  (2008).  Tastes, ties, and time: A new social network dataset using Facebook.com.  Social Networks, In Press, Accepted Manuscript, –.  http://www.sciencedirect.com/science/article/B6VD1-4T3M686-1/1/9c1b6aafad0f69c524f7c5f982eb2268



From The Blogs

Internet Observation

2007
Google惧怕Facebook的理由?
Robert Scoble有一篇题为“为什么Mahalo、TechMeme和Facebook会在四年内打败Google”的博客,其中他表明基于社会图景(如,Facebook关系)或人的(如,Mahal... 查看全文

Internet Observation

01-20
大家都别用Facebook
Facebook现在有5900万的用户,每周新增用户也有200万。但是你在上面绝不会看到Tom Hodgkinson的真实信息——他现在已经看清了这个网站的操纵者们到底在玩什么把戏。在这篇文章里,91... 查看全文

Internet Observation

2007
Facebook公司诉其发展商:网络平台攻击
FACEBOOK和它的开创者们:论坛评论最近我看到几篇关于FACEBOOK公司最新宣言,此公司的论坛开创者如何着手它的论坛。幸运的是杂志今天以题目“FACEBOOK开创者的担心”作了报到。Zucker... 查看全文

2007
Pwning Facebook : How To Think Like An Uber Affiliate
If you’re truly an uber affiliate marketer, you’ll have the ability to see a new opportunity, exploit it as much you can, and walk away a lot richer in a short period of time. About a week ago, I blog... 查看全文

Mashable!

03-09
Facebook Tabs Questions Answered
Yesterday on their developers blog, Facebook answered a number of questions that have recently arose regarding the new tabbed profile slated for release this April. We previously published the screens... 查看全文

b5media Science and Health Channel Feed

04-02
6 new genes identified in type 2 diabetes [Diabetes Notes]
The new magic number is 16 among diabetic researchers. Scientists identified 6 new genes which play a role in the development of type 2 diabetes and among the group is the second gene known to also pl... 查看全文

OnStartups

07-24
Facebook Acquires Twitter and 4 More Deals That Should Happen
Today's big news from TechCrunch is is that Google is in the final stages of acquiring digg for about $200 million.Makes sense to me.Particularly given some of Google's recent experiments having socia... 查看全文

b5media Science and Health Channel Feed

04-14
Is Social Networking Tiring You Out? [CFS Squared: Tales of CFS]
I have had it with social networking. Its too hard to follow what everybody is doing. And I no longer care. Last night I deleted my facebook account, and it felt really good. All those invitations to ... 查看全文

Internet Observation

2007
Facebook 垃圾信息
前几个月有许多博客和媒体发表文章讨论Facebook是否会代替电子邮件。我也在使用Facebook中的个人联系人的功能来与更多的人保持联系,这样我就不会收到那些来自电子邮件的无关信息。我过去经常会站在... 查看全文

Andy Beal's Marketing Pilgrim

03-06
Facebook Working on MySpace Music Clone?
Rumors have been circulating about Facebook working on a music platform similar to MySpaces for months.Today theyre being revived with news that Facebooks music division has been in talks with record ... 查看全文
More Articles