PetaPixel

CrowdOptic Discovers Islands of Popular Photo Subjects in Oceans of Images

We live in a world that’s teeming with digital photographs. More photos are now uploaded every two minutes than were created during the entire 1800s. Facebook is seeing thousands of photographs uploaded to its servers every second of the day, and Instagram was flooded with 10 storm-related photos per second during Hurricane Sandy.

With such a large quantity of photographs flooding the web, it’s clear that visual data mining will be an in-demand market in the coming years as more and more people look to glean valuable images from the torrent of useless pixels. One of the companies trying to occupy this space is CrowdOptic, a San Francisco-based startup that’s building some pretty interesting location-based photo curation technologies.

While organizing and browsing photos by their embedded location data is already a common (and controversial) practice, CrowdOptic’s technology is different. Instead of focusing purely on where photographs were taken, their goal is to determine what that photos were taken of.

When there are tens, hundreds, or thousands of photos of a particular subject uploaded to the web every second, how does one separate the noteworthy photos from the unless images?

For example, let’s say you were a photo editor at a major newspaper looking for the most significant photographs captured by the public during Hurricane Sandy. Browsing through the endless stream of socially-shared photographs is impractical, and doing searches based on popular tags results in too much “noise” and too little “signal.”

That’s where CrowdOptic comes in. Its software can take hundreds of thousands of photographs of a particular subject and then use the images’ compass and EXIF data to figure out common points of focus.

As a demonstration of its Enterprise Photo Curation engine, the company fed the software roughly 1000 photographs of Hurricane Sandy shared through various social media channels. An analysis on the photographs took a little more than 1 second to complete, and revealed that the single most popular subject in the set was a tree that was uprooted by the winds:

The second most significant cluster in the set was of a flooded region in Woods Hole, Falmouth, MA:

As another demo, the developers also ran the algorithm on a set of 1000 random socially-shared photos of the London Olympics. Using GPS, compass, time-stamps, and triangulation techniques, the program determined that the most popular subject was the ArcelorMittal Orbit observation tower outside the Olympic Stadium in Olympic Park, Stratford, London:

CrowdOptic CEO and serial entrepreneur Jon Fisher tells us that while many companies offer “geocoding” of photos, his company is currently the only one that offers this “focalcoding”. That’s a word we might be hearing much more in the near future.


 
 
  • JosephRT

    I love the “London Olympus…” freudian slip.

  • http://www.petapixel.com Michael Zhang

    Thanks for the catch! I type one way more than the other :)

  • http://www.davidsanger.com David Sanger

    It would be great to press Apple and other to add compass, azimuth and accelerometer data to the EXIF standard.

    Then I can readily see and app that shows me, if I am standing in a specific spot, what other shots have been taken of the same subject.

  • Exifa

    Compass is standard in EXIF

  • http://garyobrien.com Gary O’Brien

    Doesn’t Facebook strip EXIF and IPTC data?

  • Exifa

    Social seems to be at different stages with EXIF, I know Flickr and Picassa leave it in etc.

  • nate parker

    Awesome! I lived in Woods Hole when I was in High School!

  • http://www.davidsanger.com David Sanger

    yes but I can also see the compass, azimuth and accelerometer data being embedded as a default in every photo