Sarah Palin’s coke habit…

Sarah Palin’s coke habit, Elvis spottings on Hollywood Boulevard, and the moon landing conspiracy…these are some of the things you’d be seeing on Wavii if we believed everything that came in through our crawler.

It might seem obvious to you that these stories can’t be trusted, but one of the things that makes Wavii unique is that machines are creating the content for our feed, not people. This means we have to build systems that can make judgment calls about what info to trust, and what info to discard or put on hold to verify later.

How does our system know what to trust?

Sometimes it’s useful to think of our system as a small child learning about her world. Like this child, a machine learning algorithm can take everything it sees and hears into its view of the world. But, also like a child, the machine only knows what you’ve told it, and lots of things that are obvious to us adults aren’t so obvious to it.

Let’s pursue this analogy a little further and follow six-year-old Jane through a day in her life.

Jane

Jane’s day starts over breakfast one morning, when her older brother John tells her that the weatherman is predicting snow, and they are probably going to get an early dismissal from school.  The “weatherman” sounds like a trustworthy source, but last week John told Jane that it was raining cats and dogs, so this is likely some kind of trick. Jane is learning…she decides not to believe him.

When she gets on the bus to go school a few minutes later she overhears someone in the back talking about snow. She doesn’t hear the full sentence, but she is certain that she hears the words “snow” and “early dismissal.”  What are the odds, she thinks — especially after what John said this morning?

When she gets to school, two of her friends tell her that they heard there is going to be an early dismissal. And by snack time, there is an excited buzz around the classroom: everybody is talking about it! Jane overhears one teacher saying to another, “Right, we’re going home early.  Preposterous!” This seems to be almost certain confirmation of Jane’s hopes — since the teacher never lies — except that Jane doesn’t know what “preposterous” means.

The day drags on, and, sadly, there’s no snow in sight. Jane wonders how something she heard from so many people is wrong.  Then at recess she runs into her brother. He’s laughing his heart out and tells her that he started the rumor on the bus and it spread like wildfire! Jane definitely learned her lesson…

Wait, is Wavii just a bunch of six-year-olds?? I’m confused!

No, but our systems face the same two problems that Jane does:

  • Incomplete information about the world: In the same way that Jane didn’t know whether her brother was joking, where her classmates were getting their information from, or what a “weatherman” was, so too do our systems operate with only a partial picture of what the world looks like.
  • Noisy stream of data:  Just as Jane had to deal with overheard snippets of sentences on the bus and unfamiliar vocabulary from her teacher, our systems sometimes get only fragments of things or see unfamiliar language.

So Jane did exactly what all of us do on a daily basis when trying to process what we see and hear:

  • Used her prior experience to inform present decisions
  • Updated her beliefs as she got each new piece of evidence
  • Maintained an awareness of her own confidence in her beliefs

At Wavii, we incorporate these same strategies into making decisions about what news to post.  Prior experience can tell our technology that the New York Times is more trustworthy than National Enquirer, or help it to decide whether “Barack Obama’s engagement in Afghanistan” is the same type of engagement as “Prince William and Kate Middleton’s engagement”.  And like Jane, our tech will update its beliefs as it gets confirmation of a story from more sources — but also keep an eye out for the Johns of the news world.  Finally, our tech can quantify its own confidence in its output based on factors like how much  data it has seen about a subject and how well it has performed in the past.

We’ve found that thinking about how humans approach a learning problem can often provide a good starting point for brainstorming computational solutions. A small child’s perspective is especially useful because it forces us to trim down our assumptions. And who knows, if we’re lucky, in a few years our small child will grow up and start being an insolent teenager!

Posted in Product | 1 Comment

Wavii…is alive!

Hello interwebs! Starting today, we’re launching Wavii to the world…!

Sign up at our website or get our iPhone app.

Brief is better

Remember when you used to receive emails about your friends’ adventures? Julie would write you a pages-long email about her trip to Italy….the food she ate…the sites she saw….the guy she met. That was great, but it took a lot of time and it’s easy to lose interest. Then Facebook came along and these messages got way shorter as quick status updates…Julie checked-in to a few places, posted photos, and changed her relationship status. So much faster to keep up by reading a couple of feed items…

We’re doing something similar at Wavii, but instead of keeping up on your friends we help you keep up with everything else. Start by following topics you care about…maybe the President and his rivals, a few celebs, your favorite startup, some gadgets you own, the companies you invest in, or tons more. Then, Wavii scours the web for what’s happening with them and automatically builds status updates for your news feed…like who won the election by how much, which celebrities are dating, who acquired who, movie releases, scandalous patent lawsuits…you get the idea.

Following topics to personalize your Wavii feed

Keeping you up to date with the world

If you start preaching that people need better ways to find relevant content, you’ll probably see a lot of heads nod. When it comes to staying up on stuff you care about, there’s never enough time. And man, is it a tough problem to solve!

A lot of companies are working on this, but they’re all focusing on one thing…making it prettier and prettier to find and read the articles. Problem is, sometimes you just don’t have that kind of time. Here’s a crazy idea…instead of giving me an article to read each and every time I want to know what happened….just give me a quick “status update.”

DATING: Miley Cyrus and Justin Bieber

ACQUISITION: Zynga

NEW MUSIC VIDEO: Mary J Blige

Enter Wavii…that’s what we’re doing. We’re putting the story first, helping you keep up with news feeds filled with brief status updates about what happened. Come give it a shot, and tell us what you think.

What’s new?

You may have read about Wavii back in January when we kicked off our private beta, or better yet, maybe you were a part of it. In any case, you might need to squint to recognize us — as part of our first time opening the service to everyone, a lot has changed…

Wavii for iPhone

The full Wavii experience in your pocket. Need I say more?

Getting Wavii setup Wavii feed on your iPhone

Wavii is social

Keeping up on the world is far more fun when you have someone to share it with. Tell us what you like, love, and love to hate! @mention your friends, share with Facebook or Twitter, your friends are at your fingertips.

React and discuss what's happening with friends

Take a little peek!

Each status update in your news feed tells you the brief story, but sometimes you want more. A quick click on any update let’s you peek right in…read the articles, find related stories, and see what your friends think.

Dive into news events with peek

Discover

Apparently there’s an election later this year? Why aren’t you following your favorite candidate, race, or political pundit yet!? Or maybe you’re more a techie…follow your favorite startup or VC. No matter what you’re in to our discover page will help you get started today following the things you love!

Political topics recommended for you

Huge thanks to the Wavii team…they’ve been working as hard as they can to get us here today. We really hope you enjoy the product and looking forward to hearing what you think!

Adrian

Posted in Featured, Product | 2 Comments

T-shirt love!

Remember that back in January we ran a little game for you to earn early access to Wavii?

Wavii's Invitation Funnel

Everyone who earned an invite has been an awesome help testing Wavii and providing feedback. Even better, a few hundred of you earned our custom Wavii shirt!










Thanks for tweeting them out!

Adrian

Posted in Community, Product | Leave a comment

Wavii’s in a bit of a sticky situation!

This is what happens when your production engineering team finds the sticky stash :-)

IMG_4848
IMG_4878
IMG_4854
IMG_4871

See them all here!

Happy April Fools’ Day!

Adrian

Posted in Product | Leave a comment

#sxsw #waviigram

A few of our team went down to Austin for SXSW this year…and decided, what could be more appropriate than bringing along our friend Austin?

Turns out he was pretty popular!
























Mr. Powers was also nice enough to hand out some cool Wavii SWAG…

…except that he soon ran out!

‘Till next year Austin!

Adrian

Posted in Product | Leave a comment

Things that are like the things you like

How on earth are you meant to keep track of everything happening these days?

The Internet! It’s just getting bigger by the second, isn’t it? So here at Wavii we’re doing our best to not only distill the news we find into bite size chunks but to help you find the bites you want.

A key aspect of helping you discover content is through recommendations, and one of the fundamental technologies we use to do this is the collaborative filter.

The main idea behind a collaborative filter is that we think you’ll like things that are like the things you like.

Breaking this idea into two parts…

We think you’ll like the things that are like the things you like.

How do we know what you like? Well, we decide this based on your interaction with the site including, primarily, the topics you already follow.

We think you’ll like the things that are like the things you like.

How do we know what things are like each other? This is where the art of recommendation systems comes into play; what “signals” can we use to find topics that are like each other? Once we have them, how can we combine them?

Huh? How?

For example, one of the many signals we use at Wavii to denote similarity between topics is the idea of bilateral links on Wikipedia.

Consider the Wikipedia page for 90s grunge sensation Pearl Jam. In their infobox (that summary box on the right hand side) you can see there’s a link to their lead singer, Eddie Vedder.

If we then go look at the page for Eddie Vedder we can see there’s a link back to Pearl Jam.

We use this bilateral link as a signal to say that Pearl Jam and Eddie Vedder have some sort of relationship. If we consider a whole bunch of these links we can build reasonably interesting graphs:

Click through to view a bigger version, can you spot the great Kardashian constellation?

The challenge lies in how we combine this signal with the numerous others we have and we’re working hard on making it perfect! If you feel like having a hack yourself here’s the link data.

Posted in Engineering | Leave a comment

Spread the ♥ via Open Source

Today we are releasing Spread. In the midst of our kerfuffling about Hadoop we wrote a tiny app to make MapReduce-like operations just another part of the unix toolchain. Spread’s only charter is to shuffle data across a fleet of machines so that identical keys are guaranteed to exist on the same machine. That single operational concept provides all of MapReduce when used with sort, awk, grep, and fleet tools like Chef.

Spread is moving into a friendly neighborhood of Wavii open source projects:

rails-dev-tweaks

rails-dev-tweaks speed up your Rails 3.1+ development environment.  It disables unnecessary work by your gems for certain kinds of requests, such as assets. Rather than waiting for all your dependencies to improve their performance in development mode, use a sledge hammer to make life that much better!

pfp

pfp is a pretty fast statistical parser for probabilistic context free grammars. pfp uses the exhaustive CYK algorithm found in the Stanford NLP parser (and gratefully uses its trained grammar), but boasts very healthy speed improvements, and a flexible set of API bindings.

Listy

Listy is a deterministic caching mechanism for django projects. It will keep the cache in-sync with the database by updating during changes instead of relying on timeouts. As implied by the name, Listy supports looking up lists of objects.

We’re happy to share some of our projects with you – we certainly benefit from the open-source community.  And if you’re reading this, then so do you!

Posted in Engineering | Leave a comment

Testers, testers, testers, testers, testers!

(sorry, couldn’t resist)

Okay, we love testers…and especially the hundreds of you that have sent us tons of feedback on where we’re doing something right and where we still need to get the house in order. I can’t tell you how awesome this is for us….so thank you!

This feedback has driven a bunch of feature updates. Here are a few.

Peek view

One of the most common requests is for a quick way to dive deeper into the stories surfaced in your feed…to read more of the source articles, find related content, etc., without leaving Wavii. We listened, and built a new preview pane we call the ‘peek view,’ which allows you to get all these details without ever leaving your feed — it’s snappy too! We’ve just started rolling out a preview version of it, and we’ll be turning on more and more parts of it over the next week or two. Start using it today by clicking on any feed item…

Happy, frownie, love…surprised?

Share your reaction to what’s happening in your feed. Did a startup acquisition make you angry? Or, did you like it, or LOVE it!? From what we heard in your feedback it did, and you wanted to tell the world about it. Now you can – just pick the face that matches your mood.

Follow story types too!

At first release, we only gave the option for you to follow people, products, companies, etc. We know you want more variety, so instead of just following the actors in a story, you can now the follow the story types — like ‘movie trailer releases,’ ‘celebrity breakups,’  ’quarterly earnings,’ and many more. So you don’t have to follow each and every movie that comes out, or every celebrity couple, etc., to get the latest. We’ve just turned on the first handful, and more are coming soon.

Quarterly earnings is a story topics that you can follow

Families

You don’t want to be following “iPhone” and miss what’s happening with the iPhone 4s. Or follow Facebook, but miss important news about Zuck because you forgot to follow him too. We came up with a solution for that – starting in the next few days we’ll begin pushing news about closely related things into your feed. If we wind up showing you something you really don’t want, you can easily tell us that and we’ll fix it going forward :-)

Keep the feedback coming, and if you’re not in yet hold tight (or register for early access), we’re weeks away from letting everyone into Wavii.

More to come!

Dan

Posted in Community, Product | 3 Comments

NSFW: Data Porn

It’s been busy here at Wavii — our first 3,000 users were let on to the site! We know a lot of you are still waiting for invitations, but don’t worry, the gates will be opening soon. In the meantime, we thought we’d share some fun numbers from last week.

To kick off our beta testing we wanted a small set of adventurous users to help find kinks in the system (thank you!). So we started by inviting 1,000 readers from TechCrunch and PandoDaily.

Wavii's Invitation FunnelInspired by our friends over at Everlane

But we also wanted an ongoing trickle of new users as well, so we created a way for more people to get in — each person who signs up at our home page is given a unique URL that they can share with their friends on Facebook and Twitter. As more of your friends use your link, the sooner you will be invited to Wavii.

Look at the Curves on Those!

There has been a lot of interest in how the invitation process went, so we started diving into the numbers… It turns out that building spreadsheets and trawling our databases is tedious and boring; we can do better than that.

I’m really excited to share with you a d3.js powered force-directed graph of our sign-ups!

Watch the HTML5 visualization:

Unfortunately that visualization is a little intense for some slower computers (hey, it’s my first d3 visualization – and it’s got a lot of room for improvement).

We’ve also made a YouTube version of the same thing:

Each bubble is a person that signed up on our site. People who arrived from TechCrunch are on the left in green, and people from PandoDaily are on the right in orange, and everyone else comes in from the bottom. Bubbles get larger and spawn smaller bubbles each time they refer someone, and then slowly die off as people stopped using their links.

  • Kevin Rose’s tweet caused 1,119 sign-ups, about 45 of those within a minute. He also drew other influencers like Dave Powell, who referred nearly 400 himself.
  • Marissa Mayer’s tweet caused 473 sign-ups, garnered influencers like Ricardo Galli and Adam Nash, and triggered an article.
  • PandoDaily caused 813 – taking 39 minutes to get to 100 – and TechCrunch caused 731, taking 44 minutes for their first hundred.
  • My favorite tidbit is the tweet by Guns N’ Roses. Apparently, every few weeks they tweet their followers in an effort to cause some havoc on the web and take down websites – we were their latest target. Thankfully, our servers held strong and they didn’t succeed :P

The Raw Numbers

  • So far, over 30,000 people have signed up.
  • Of those sign-ups, 14% managed to convince at least one other person to sign up on their behalf. Because of this, they got to see a sneek peek of the site.
  • Another 3% managed to get 5+ friends to sign up and are comfortably in the priority queue for our next major wave of Wavii invites.
  • 2% convinced 10+ friends to sign up, giving them an immediate invite to the site.
  • Finally, and much to our CEO Adrian’s chagrin, 1% will be getting snazzy new T-Shirts by referring 20+ friends.

Oh, There’ll be More

We hope you found this interesting; and we welcome any questions you might have. Also, if you feel like mucking about with the raw data for the visualization, feel free!

We are working hard to iterate upon our user feedback – and we will be getting the rest of our invites out as soon as possible.

-Ian

Posted in Product | 5 Comments

Wavii: Instant news feeds for any topic

If you’ve been following this blog, you know that we’re making Facebook out of Google… be one of the first 1,000 people today to sign up for early access via TechCrunch or PandoDaily, and we’ll send you an invite to our private beta.

Today, most people get their news by searching Google or browsing sensationalized headlines that repeat the same story. If you’re a bit savvier you use Twitter or a news reader, hoping they’ll point to stuff you care about… but no where can you simply follow your interests, like that politician you love to hate, your company, your sports team, this summer’s movies, etc.

Why not? Because today, nobody is creating feeds for all of these topics.

On Facebook you can get a feed for your friends. They can do this because your friends check-in when they go places, tag photos, like pages, update their relationship status, and more. Each of these common actions tells Facebook what’s happening and it converts them into feed items.

Wavii is creating these feeds for everything else — by teaching computers to do the work your friends do on Facebook.

Follow nearly any topic

You can follow a much broader range of topics, because we don’t need that topic to be involved to create its feed. Wavii’s engine reads the web, learns what’s happening about that topic, and automatically creates a news event for its feed.

During our beta we’re not supporting every topic yet, but we’re adding more each day.

Wavii feed

Clear and unbiased

Even if the same story in the news is written 6 different ways in 18 different articles, we simplify it all into a single, bite-sized news event in your feed…no spin, no repetitive articles all saying the same thing.

Of course, you can click on any feed item to dive into the details…see the sources, related events, etc.

Get the full story for any topic

You can browse a “timeline” for any topic on Wavii, to quickly learn what’s happening today or what happened in the past. The topic’s page also includes related topics so you can easily find more topics to follow.

Conversation starters

New social websites often feel like ghost towns. Nobody wants to post on a new site until their friends have already posted…chicken…egg. At Wavii this isn’t an issue. We fill your feed with events about the topics you follow, so there’s always something new to discuss, share, and more.

Only what you want

Your feed is personalized using what you actually follow, so it’s just what you care about. To make this easier, we look at your Facebook account and suggest some things for you to follow… and as you use Wavii we learn what you like, and make more recommendations based on that.

Select from recommended topics when setting up your Wavii account

Tailored content for each type of news event

We show the right visuals and content to enhance common types of stories. For example, news about an acquisition shows the company’s previous acquisitions, or an engagement between two celebrities shows a ring and an image of them together.

Building Wavii

In our perfect world, you can follow any topic you want. That’s a lot of topics, and to manually create feeds for each one like Facebook does would require thousands of people inputting data all day long…not at all practical. So instead, we’re training computers to do the work. Our engine reads everything being created across the web (e.g., articles, blogs, tweets, pictures), figures out what’s going on, and immediately creates feed items with tailored visualizations, related data, images, videos, etc. The hard part is teaching the system to identify the right topics when they’re mentioned (is “George Bush” referring to Senior or Junior), and to understand exactly what’s happening no matter how it’s written (bought, purchased, picked up, acqui-hired, and swallowed are all ways to say a company was acquired). To do this without making too many mistakes, we use a lot of data and a lot of machine learning. We’ll talk more about our tech in the coming weeks.

The result is your auto-magical news feed, instantly created for any topic you want to follow. Be one of the first to get started today!

Dan

Posted in Featured, Product | 1 Comment