Jobs & Community
- Post your job here!
  Anywhere, anytime - Telecom Ramblings
- Featured Job Listings
  List one or more jobs!
  Reach thousands of industry insiders!
- Visit our Jobs Board

Community Resources

How to keep Big Data (theoretically) anonymous

May 17th, 2013 by Telecom Asia · 1 Comment

This article was authored by John C. Tanner, and was originally posted on telecomasia.net.

ITEM: Researchers claim to have developed a model for crunching the Big Data generated from cellular networks without revealing the identities of mobile users.

The research team from AT&T, Rutgers University, Princeton, and Loyola University has built a “mobility model” of Los Angeles and New York City, using location data points from mobile voice calls and text messages on AT&T’s network in those two cities. The model aggregates the data, produces representative “synthetic call records”, then mathematically obscures any data that could identify people, reports Technology Review:

The new approach starts by aggregating traces of real human movements, then identifying common locations that might indicate home, work, or school. Next, it creates a set of transportation models. These models generate route tracks of people that the researchers call “synthetic,” because they are merely representative of the aggregate data, and not of actual people.

But the third part is the key. Even these supposedly synthetic records can closely match real ones (especially when the underlying aggregate sample is small). So an algorithm, using an emerging technique known as differential privacy, calculates exactly how high this risk is, and how to reduce it by altering the data.

In other words, you can inject “noise” into the model, such as changing the aggregated home and work locations or call times to reduce reliance on the data of a single user.

That’s key because other research has already demonstrated that it’s possible totake anonymous mobile user data and pin down a person’s name and address with it. In March, researchers at MIT and the Université Catholique de Louvain in Belgium took data from a million and a half mobile users and managed to identify 95% of them using just four location reference points.

The question, of course, is to what extent the above model will work in the real world, or how long it will take for someone to find a way around it.

For that matter, there’s also the question of how closely cellcos will follow that model, which will depend on things like the local regulatory environment and whether they can monetize anonymous data as effectively as identifiable data. While anonymous data is theoretically useful for things like street traffic planning and mapping out things like ethnic divides, malaria outbreaks and poverty levels, there’s not necessarily a commercial business model in there.

Also, there’s arguably competitive pressure from OTT internet players like Google and Facebook who are already gathering tons of user data for the benefit of their advertising customers (and, sometimes, whatever government agencies might want access to it). If they aren’t keeping all their data anonymous, why should cellcos be expected to do so?

Meanwhile, while we’re on Big Data, the commercial value therein and the level of anonymity it provides, you might want to check out this piece from Kate Crawford of the MIT Centre for Civic Media, which addresses five myths about Big Data, including this one: “Big Data Is Anonymous, so It Doesn’t Invade Our Privacy.”

Crawford’s verdict: “Flat-out wrong.”

If you haven't already, please take our Reader Survey! Just 3 questions to help us better understand who is reading Telecom Ramblings so we can serve you better!

Join the Discussion!

1 Comment, Add Yours!

Anonymous says:

May 17, 2013 at 10:27 am

“there’s not necessarily a commercial business model in there” – could be another use is actually intended

Reply

Ramblings’ Jobs

Post a Job - Just $99/30days
Event Calendar

Johnny on More Dark Fiber For Crown Castle: “Why no mention of Crown Castle?” Apr 10, 20:55

Rob Powell on DDoS attacks may no longer be new, but they’re still an evolving threat: “The article title explicitly says “DDoS attacks may no longer be new”…” Feb 6, 20:24

Communication automation on It Takes a Village to Deliver a Service: How Networks Can De-Complexify an Expanding Ecosystem: “Communication APIs can be a major boost to businesses and it’s a shame that more small businesses aren’t even aware…” Jan 16, 21:10

mhammett on HPE Buys Juniper, Looks Toward AI: “This sounds absolutely terrible for the marketplace.” Jan 10, 11:54

J on DDoS attacks may no longer be new, but they’re still an evolving threat: “New? What are you talking about? Ddos has been going on for over 20 years…ignorant.” Dec 28, 16:17

Eric S on Network Maps: Canada: “Check out this interactive map for Southern Ontario Canada” Dec 28, 13:56

Stewart Greer on AI for the Telecoms Contact Center: Think Marathon, not Sprint: ““Caught in the gravitational pull of this AI-powered galaxy! Navigating through the cosmos of smart features and seamless interactions is…” Nov 29, 01:30

Steve on Zayo Acquired Globalways, Expands in Europe: “Exciting news about Zayo’s acquisition of Globalways GmbH, a German metro fiber provider! With 360 route kilometers of fiber in…” Nov 21, 14:00

David Marsh on Consulting Services: “Do you still provide consulting ?” Nov 21, 00:01

mhammett on California Turns to Lumen for the Middle Mile: “That seems a bit of a contradiction.” Oct 26, 10:42

Telecom Ramblings

Jobs & Community

Community Resources

Featured Articles

How to keep Big Data (theoretically) anonymous

May 17th, 2013 by Telecom Asia · 1 Comment

Join the Discussion!

1 Comment, Add Yours!

Leave a Comment

Ramblings’ Jobs

Event Calendar

Recent Comments