bmichelsonz

Big Data meets Collective Intelligence

by brenda michelson (bmichelsonz) on 04-11-2011 09:22 AM - last edited on 04-11-2011 11:34 AM

As social technologies and applications took off, the ethos of collective intelligence permeated our behavior as individualsconsumers, employees and citizens.  Everyone within reach of an internet connection or mobile phone has the ability to contribute and consume information, which can be augmented or acted upon by other individuals, local or afar, in a variety of pursuits.

The typical connection between social technologies and collective intelligence is the reams of data shared by individuals via venues such as Facebook, Twitter, Google+ and Wikipedia.  Collective intelligence as source of big data.

Recently though, I've noticed the emergence of companies that are applying collective intelligence to solve big data problems.  And not their big data problems, but yours.

First up, featured in a good series on the WSJ's Venture Capital Dispatch blog is 1010data.  According to the dispatch [emphasis is mine]:

"1010data set out to build a scalable, easy-to-use data warehouse, enabling customers to replace on-premise database software, machines and staff with a subscription fee. But what it’s built is much more than a hosted data service — it has pioneered a new type of data marketplace. Customers now come to 1010data not only to outsource the management of their own data, but to mix it together with data from third parties, such as credit bureaus and retailers, and analyze it all in a single place."

The dispatch describes Dollar General Corp's use case, which includes sharing (for a fee) its information with suppliers:

"How it’s been used: Dollar General Corp. became a customer about three years ago when it began using 1010data to analyze basket data to determine what items were commonly sold together. It gained insights such as that people who bought Gatorade most frequently bought laxatives as well.

Dollar shortly moved its entire data warehouse onto 1010data, including all point-of-sale and inventory data, making up 17 billion and 35 billion rows of data, respectively. The retailer provides consumer goods companies with access to the data about its own products and, for a higher price, products of its competitors, with some limitations. For example, companies can’t see inventory data on competing products."

A startup going even deeper on the collective intelligence aspect is Kaggle.  According to an All Things D post, Kaggle is applying the Netflix Prize model to all sorts of big data problems:

"Kaggle has facilitated breakthroughs in NASA’s analysis of dark matter, improved Allstate’s actuarial methods, predicted many of the top finishers of the Eurovision Song Contest, and is currently hosting a $3 million prize to devise ways to reduce unnecessary hospitalizations.

...Kaggle founder and CEO Anthony Goldbloom, a former statistician for the Australian treasury, says his company addresses “a serious market failure.” That is: Companies have data and can’t analyze it as well as they’d like, and academia is desperate for real-world data sets."

Interesting, Kaggle has found a way around the data scientist shortage:

"Today, Kaggle’s 17,000 participating data scientists — which happen to include former Netflix Prize winners — participate mostly for the challenge and the chance to prove themselves. But Goldbloom and Howard want to make it worth their while.

“In five years’ time, I want to do 10,000 competitions per year,” Goldbloom said. “I’m hoping competitors can earn a full-time income from Kaggle.”

And yes, you can submit a private problem, under NDA:

"On that front, Kaggle will now start paying small groups of data scientists — selected based on their past performance on Kaggle — to analyze sensitive data sets for which companies require NDAs."

Big data, innovating business and inspiring innovative business models.  Opportunity abounds.

 

Post a Comment
Be sure to enter a unique name. You can't reuse a name that's already in use.
Be sure to enter a unique email address. You can't reuse an email address that's already in use.
Type the characters you see in the picture above.Type the words you hear.

The HP Input Output site is sponsored by HP and features articles and content from HP and third-party contributors. Third-party articles and content, while paid for by HP, do not necessarily represent the views and opinions of HP. HP does not endorse this content and is not responsible for its accuracy, availability and quality.

Follow Us
Spotlight
The Permissions Your Database Users Really Need (Video) The 16 Linux Shell Commands Every Desktop Linux User Should Know 7 Deadly Sins of Job Searching: Why You Still Don't Have a Job, and How to Get Back on Track 9 Tech Analogies That No Longer Mean Anything To Those Young Whippersnappers
┼ Based on energy, paper and toner savings from regular printer usage. Results may vary.