Most people understand the implications of virtual meetings through emails and phone calls. Some of us are even aware that they are being constantly tracked. However, we assume that our physical meetings remain private. It is extremely unlikely that someone would follow everyone around all the time. Thus we seem to care less, if not at all, about the privacy of our physical meetings.

Multiple systems collect location data at scale. Telecom, tech companies, and governments log user location with more precision than most people expect.

Tracking technologies: Cellular location identification, GPS systems, Wi-Fi Localization

Google was found to be collecting location data from Android devices even when GPS was explicitly turned off, potentially affecting over two billion monthly active users worldwide.

You’d think this volume of data, spanning zettabytes, is too much to make any sense out of. Making sense is equivalent to finding needles in a hay stack. There has also been research into location privacy and customer pushback has led to adoption of recent privacy techniques into how some companies collect data.

But scale and algorithms are good at exactly this kind of problem. Patterns that would be invisible to a human analyst combing through raw data are easy for machines to pick out. Moreover these privacy techniques are at a relatively nascent stage and they do not hide everything about the user.

Recent research has pointed out that we can infer sensitive information about people and who they meet, just from location data.

At a population scale, location data alone is enough to piece together who you know, where you go, and when you meet.

NSA Co-Traveller Program

With this much location data available, it’s possible to piece together who meets whom across large populations.

A couple who meet frequently are likely to be socially connected — friends, relatives, co-workers, partners. Using this intuition, researchers show that one can accurately infer users’ social connections from location data of a significant population. Where and when people meet can also reveal the nature of relationship between the pair. For example, frequent meetings at a coffee shop near an office might indicate colleagues, while a weekly visit to a hospital might indicate a health situation.

Working with the InfoLab at USC, my thesis introduced this problem and proposed methods to preserve the privacy of meetings, including co-location k-anonymity. The work was published in IEEE TKDE.