The Evolution of Privacy on Facebook

About

Facebook is a great service. I have a profile, and so does nearly everyone I know under the age of 60.

However, Facebook hasn't always managed its users' data well. In the beginning, it restricted the visibility of a user's personal information to just their friends and their "network" (college or school). Over the past couple of years, the default privacy settings for a Facebook user's personal information have become more and more permissive. They've also changed how your personal information is classified several times, sometimes in a manner that has been confusing for their users. This has largely been part of Facebook's effort to correlate, publish, and monetize their social graph: a massive database of entities and links that covers everything from where you live to the movies you like and the people you trust.

This blog post by Kurt Opsahl at the the EFF gives a brief timeline of Facebook's Terms of Service changes through April of 2010. It's a great overview, but I was a little disappointed it wasn't an actual timeline: hence my initial inspiration for this infographic.

Let me be clear about something: I like Facebook. It's helped me reconnect with dozens of people with whom I'd lost touch, and I admire the work their team does. I hope your takeaway from this infographic isn't "I'm deleting my account"; rather, I hope it's "I'm checking my privacy settings right now, and changing them to a level with which I'm comfortable".

Data

The data for this chart was derived from my interpretation of the Facebook Terms of Service over the years, along with my personal memories of the default privacy settings for different classes of personal data. The population sizes are statistics from Google, the Facebook Data Team, and wild guesses based on what seemed reasonable to me.

I welcome data corrections, so please leave a comment below if you have better numbers to share.

Types of Personal Data

Facebook's classification system for personal data has changed significantly over the years. I tried to capture what I thought were broad topics that have remained relatively consistent. But they might need some explanation.

Likes: a person, band, movie, web page, or any other entity represented in Facebook's social graph that has a "like" button. "Likes" started with status updates, but have now grown to encompass pretty much everything. In Facebook Newspeak, they're a "Connection".
Name, Picture, Gender, Birthday, Contact Info: self-explanatory
Extended Profile Data: Your family members, city, place of birth, religious views, favorite authors, schools attended -- anything that is an entity you can list a relationship to in your profile.
Friends: The people you've friended
Networks: The personal networks you've set up on Facebook (e.g. colleges & universities or companies).
Wall posts & Photos: Self-explanatory.

Audiences

Audience sizes are based on averages, interpolations of those averages across time, and guesses from my personal experience where that data was unavailable.

One thing you may notice is that by 2009, the term "Network" for the inner circle is replaced by "FoF", or "Friends of Friends". Facebook introduced this in 2008 to cater to users whose networks were too large to be manageable. My guess is that this effectively shrank the potential number of people who could see this particular kind of data. I ballparked an estimate for the average size of this extended friend network by taking the average number of friends a user had in 2009 (130) and assuming there was on average a 2/3rds overlap with each of their friends, yielding an average of 8450 people.

Implementation

The audience scale is logarithmic, so that we can compare audience sizes of 100 and 1 billion. I also did a big no-no and mapped the audience size to the length of the slice, not its area. I don't feel too terrible about this, because the area comparison is already distorted by the log scale. Plus, frankly, the linear scale just looks better.

I built this sketch using Processing.js. You're welcome to download the source. Sorry, no Internet Explorer.

About me

My name's Matt McKeon. I'm a developer with the Visual Communication Lab at IBM Research's Center for Social Software. The views expressed here are my own, and do not reflect those of IBM. You can find me on Twitter and (hah) Facebook.

UPDATES

05/19/10 5:50PM EST - Since I've received quite a few inquiries, I've clarified my position on usage of this graphic online and in print.
05/15/10 11:15AM EST - Haggai Hundert was kind enough to show me proof that Photo Albums existed in 2005. Therefore, I've reinstituted them for that year. Thanks Haggai!
05/12/10 10:40PM EST - Found an official blog post that suggests Photo Albums didn't exist in 2005. I've grayed them out for that year.
05/12/10 8:30PM EST - For completeness, I've included "Contact Info" as a personal data category. I'm making the assumption that it's defaulted to Friends Only since 2005.
05/11/10 11:45PM EST - After reading a couple of old posts on the Facebook blog, I decided that my using "Network" to refer to both Networks and Friends-of-Friends was too confusing, and my guesses on the sizes of these networks were sloppy. I've reduced my estimates of the FoF network, and fixed the title so that it changes at what I think is the appropriate time. See the "Audiences" section of the writeup for details.
05/11/10 8:30AM EST - Added subtitle and WARNING blurb below vis.
05/10/10 11:10PM EST - Found an error: the "Networks" slice should be tracking the "Name" and "Picture" slices, since those are revealed when searching for a user.
05/10/10 9:30PM EST - After further reflection, I've decided that "Demographics" is too broad a category. I split it into "Birthday" and "Gender"; the rest is captured by "Extended Profile Data".
05/10/10 6:30PM EST - Blurb added to the About section RE: I like Facebook, really, I do!
05/10/10 8:30AM EST - Corrections: Jaoe and becca pointed out that Wall Posts and Photos now default to "Everyone"; Abbey recalls that Name and Picture have been available to all Facebook users since 2005.
05/07/10 11:30PM EST - the "Like" category is now grayed out before 2009 (thanks "pops"!) and photos are now shown as being available to your network by 2006 (thanks theharmonyguy!).