What Facebook Looks Like Today

Facebook's world mapBy now, everyone knows that Facebook has pretty much won the social media wars, as early entrant and rival MySpace hemorrhages employees as it tucks its tail between its legs and slinks away.

And Facebook itself is a good chronicler of the hyperactivity of Facebookers wordwide. Recently, it published some stats on “what 20 minutes on Facebook looks like.” Among the revelations:

 ~10.2 million comments uploaded every 20 minutes
 ~2.7 million photos uploaded
 ~2.0 million “friend” requests accepted
 ~1.8 million status updates posted
 ~1.6 million wall posts
 ~1.5 million event invites sent out
 ~1.3 million photos tagged
 ~1 million links shared

Fan designations (or “likes”) are now reaching stratospheric proportions for some celebrities. And who were the most popular in 2010 based the “most liked” status? The results show a major skew towards the younger generation … and toward entertainers rather than political, scientific or academic leaders:

 Lady Gaga: ~25 million people “like”
 Eminem: ~24 million people
 Megan Fox: ~20 million people
 Vin Diesel: ~19 million people
 Rihanna: ~19 million people

Where does President Barack Obama rank by comparison? He’s at ~17 million “likes” – right along with Bob Marley, Li’l Wayne, Justin Bieber and Shakira.

Personally, I found the trends in relationship status to be the most interesting. There were quite a few relationship changes … but perhaps not as many as you might expect considering that there are an estimated 600 million active users on Facebook these days.

For the record, here’s what happened with personal relationships in 2010:

 ~44 million people changed their status to “single”
 ~37 million changed their status to “married”
 ~28 million changed their status to “in a relationship”
 ~6 million changed their status to “engaged”
 ~3 million changed their status to “it’s complicated”

Notice that the number of people who migrated away from marriage were nearly equally matched by those becoming engaged or getting hitched. As the famous French saying goes, Plus ça change, plus c’est la même chose. (The more things change, the more they stay the same.)

The Big Dig: Scraping and Scooping the Web

Data ScrapersI’ve blogged before about how the Internet is making people’s lives pretty much an open book.

Most people who are online are pretty aware of how their reputation can be affected by their Facebook or MySpace pages and other public or quasi-public online information. But The Wall Street Journal has been publishing a series of stories on how much more pervasive than that digital snooping has become.

The series is titled “What They Know” … and it’s well-worth checking out. The most recent article appeared on the front page of the October 12, 2010 edition of the WSJ, and focuses on the phenomenon of “data scraping.”

For those who aren’t familiar with the term, “scraping” is a method by which sophisticated software is used to access and scoop up information that has been posted anonymously on sites that are supposed to be closed to prying eyes. One example cited in the WSJ article of a site that has been scraped is PatientsLikeMe, which has message boards and forums dealing with mental disorders, depression and other issues that most people would prefer to keep private.

People who post on discussion forums like these do so using pseudonyms, and the identity of the posters is carefully guarded by the host sites.

But it turns out that these sites are little match for the sophisticated IT capabilities of companies like Nielsen and PeekYou, who are in the business of matching psychographics as well as demographics to individual people for purposes of serving up relevant advertising — and goodness knows what else.

Think of it as the “lifestyle” direct mail lists of yesteryear – but now on steroids.

PeekYou has applied for a patent on a system whereby it matches real people to the pseudonyms used on forums, blogs, Twitter and other social media outlets. Taking a “peek” at the company’s patent application reveals the great lengths their systems go to ferret out and cross-analyze small, innocuous bits of information that, taken together, find the “needle in a haystack” match to the actual individual:

 Birthday match
 Age match
 First name match
 Nickname match
 Middle name match
 Middle initial match
 Gender match
 e-Mail address match
 Phone number match
 Physical address match
 Username match

When you consider that the same type of powerful computers that are used to analyze and process search engine queries are the ones processing millions or billions of information bits and instantaneously testing and slotting them based on relational patterns … it’s not hard to understand how, over time, eerily accurate portraits of individuals can be drawn that not only correctly reflect the “demographics” of the person, but also a host of psychographic and behavioral aspects such as:

 Shopping habits
 Recreational pursuits
 Personal finance profile
 Health information
 Political leanings
 Hobbies and interests
 Spirituality/religiosity
 Sexual preference or sexual proclivities

The WSJ articles detail how web sites are attempting to stay one step ahead of the “scrapers” by employing software that alerts them to suspicious “bot” activity on forums and other password-protected areas. It’s often a losing battle … and is that particularly surprising?

These days, not even the Orthodox monks at Mount Athos are protected, probably!

Where are Newspapers Now?

Newspaper ad revenues continue in the doldrums.John Barlow of Barlow Research Associates, Inc. reminds me that it’s been awhile since I blogged about the dire straits of America’s newspaper industry. The twin whammies of a major economic recession along with the rapidly changing ways Americans are getting their news have hammered advertising revenues and profits, leading to organizational restructuring, bankruptcies, and more.

But with the recession bottoming out (hopefully?), there was hope that the decline in newspaper ad revenues might be arrested as well.

Well, the latest industry survey doesn’t provide much cause for celebration. A poll of ~2,700 small and mid-size businesses conducted this summer by Portland, OR-based market research firm ITZBelden and the American Press Institute finds that ~23% of these businesses plan to cut back on newspaper advertising this year.

The kicker is that these revenues are being spent, but they’re being put to use in other advertising media.

The ITZBelden survey found that a similar ~23% of companies plan to up their 2010 digital ad spending anywhere from 10% to 30%. This compares to only about 10% planning to increase their print advertising by similar proportions.

Moreover, the survey findings reveal that small and mid-size U.S. businesses have moved into digital marketing in a significant way. Not only do more than 80% of them maintain web sites, they’re active in other areas, including:

 ~45% maintain a Facebook or MySpace page
 ~23% are engaged in online couponing
 ~13% are involved with Craigslist
 ~10% are listed on Yelp! or similar user-review sites

One area which is still just a relative blip on the screen is mobile advertising, in that fewer than 4% of the respondents reported activities in that advertising category.

Where are these advertisers planning to put their promotional funds going forward? While newspapers should continue to represent around one quarter of the expenditures, various digital media expenditures will account for ~13% of the activity, making this more important than direct mail, TV and Yellow Pages advertising.

There was one bright spot for newspapers in the survey, however. Respondents expressed a mixture of confusion and bewilderment about the constantly evolving array of digital marketing communications options opening up … and they’re looking for support from media experts to guide their plans and activities.

And where do they see this expert advice coming from? Newspaper ad reps.

Perhaps the Yellow Book’s “Beyond Yellow” small business advertising campaign – you know, the one that touts not only the Yellow Pages advertising but also web development, online advertising, search marketing and mobile advertising – is onto something.

The End of Privacy

An article by technology author Steve Lohr published last week in The New York Times caught my eye. Titled “How Privacy Vanishes Online,” it explores how conventional notions of “privacy” have become obsolete over the past several years as more people engage in cyber/social interaction and web e-commerce.

What’s happening is that seemingly innocuous bits of information are being collected, “read” and reassembled by computers to build a person’s identity without requiring direct access to the information.

In effect, technology has provided the tools whereby massive amounts of information can be collected and crunched to establish patterns and discern all sorts of “private” information.

The proliferation of activity on social networking sites such as Flickr, Facebook and LinkedIn is making it easier than ever to assemble profiles that are uncanny in their accuracy.

Pulling together disparate bits of information helps computers establish a “social signature” for an individual, which can then be used to determine any number of characteristics such as marital status, relationship status, names and ages of children, shopping habits, brand preferences, personal hobbies and other interests, favorite causes (controversial or not), charitable contributions, legal citations, and so on.

One of the more controversial experiments was conducted by MIT researchers last year, dubbed “Project Gaydar.” In a review of ~4,000 Facebook profiles, computers were able to use the information to predict male sexual preference with nearly 80% accuracy – even when no explicit declaration of sexual orientation was made on the profiles.

Others, however, have pointed to positive benefits of data mining and how it can benefit consumers. For instance, chain grocery stores can utilize data collected about product purchases made by people who use store loyalty cards, enabling the chains to provide shoppers relevant, valuable coupon offers for future visits.

Last year, media company Netflix awarded a substantial prize to a team of computer specialists who were able to develop software capabilities to analyze the movie rental behavior of ~500,000 Netflix subscribers … and significantly improve the predictive accuracy of product recommendations made to them.

To some, the Netflix program is hardly controversial. To others, it smacks of the “big brother” snooping that occurred in an earlier time during the Supreme Court confirmation hearings for Robert Bork and Clarence Thomas, when over-zealous Senate staffers got their hands on movie store rental records to determine what kind of fare was being watched by the nominees and their families.

Indeed, last week Netflix announced that it will not be moving forward with a subsequent similar initiative. (In all likelihood, this decision was influenced by pending private litigation more than any sort of altruism.)

Perhaps the most startling development on the privacy front comes courtesy of Carnegie Mellon University, where two researchers have run an experiment wherein they have been able to correctly predict the Social Security numbers for nearly 10% of everyone born between 1989 and 2003 – almost 5 million people.

How did they do it? They started by accessing publicly available information from various sources including social networking sites to collect two critical pieces of information: birthdate, plus city or state of birth. This enabled the researchers to determine the first three digits of each Social Security number, which then provided the baseline for running repeat cycles of statistical correlation and inference to “crack” the Social Security Administration’s proprietary number assignment system.

So as it turns out, it’s not enough anymore merely to be concerned about what you might have revealed in cyberspace on a self-indulgent MySpace page or in an ill-advised newsgroup post.

Social Security numbers … passwords … account numbers … financial data. Today, they’re all fair game.