Chris Dixon

Graphs

It has become customary to use “graph” to refer to the underlying data structures at social networks like Facebook. (Computer scientists call the study of graphs “network theory,” but on the web the word “network” is used to refer to the websites themselves).

A graph consists of a set of nodes connected by edges. The original internet graph is the web itself, where webpages are nodes and links are edges. In social graphs, the nodes are people and the edges friendship. Edges are what mathematicians call relations. Two important properties that relations can either have or not have are symmetry (if A ~ B then B ~ A) and transitivity (if A ~ B and B ~ C then A ~ C).

Facebook’s social graph is symmetric (if I am friends with you then you are friends with me) but not transitive (I can be friends with you without being friends with your friend).  You could say friendship is probabilistically transitive in the sense that I am more likely to like someone who is a friend’s friend then I am a user chosen at random. This is basis of Facebook’s friend recommendations.

Twitter’s graph is probably best thought of as an interest graph. One of Twitter’s central innovations was to discard symmetry: you can follow someone without them following you. This allowed Twitter to evolve into an extremely useful publishing platform, replacing RSS for many people. The Twitter graph isn’t transitive but one of its most powerful uses is retweeting, which gives the Twitter graph what might be called curated transitivity.

Graphs can be implicitly or explicitly created by users. Facebook and Twitter’s graphs were explicitly created by users (although Twitter’s Suggested User List made much of the graph de facto implicit). Google Buzz attempted to create a social graph implicitly from users’ emailing patterns, which didn’t seem to work very well.

Over the next few years we’ll see the rising importance of other types of graphs. Some examples:

Taste: At Hunch we’ve created what we call the taste graph. We created this implicitly from questions answered by users and other data sources. Our thesis is that for many activities – for example deciding what movie to see or blouse to buy – it’s more useful to have the neighbors on your graph be people with similar tastes versus people who are your friends.

Financial Trust: Social payment startups like Square and Venmo are creating financial graphs – the nodes are people and institutions and the relations are financial trust. These graphs are useful for preventing fraud, streamlining transactions, and lowering the barrier to accepting non-cash payments.

Endorsement: An endorsement graph is one in which people endorse institutions, products, services or other people for a particular skill or activity. LinkedIn created a successful professional graph and a less successful endorsement graph. Facebook seems to be trying to layer an endorsement graph on its social graph with its Like feature. A general endorsement graph could be useful for purchasing decisions and hence highly monetizable.

Local: Location-based startups like Foursquare let users create social graphs (which might evolve into better social graphs than what Facebook has since users seem to be more selective friending people in local apps). But probably more interesting are the people and venue graphs created by the check-in patterns. These local graphs could be useful for, among other things, recommendations, coupons, and advertising.

Besides creating graphs, Facebook and Twitter (via Facebook Connect and OAuth) created identity systems that are extremely useful for the creation of 3rd party graphs. I expect we’ll look back on the next few years as the golden age of graph innovation.

It’s not that seed investors are smarter – it’s that entrepreneurs are

Paul Kedrosky recently speculated that there might be seed fund “crash” looming. Liz Gannes followed up by suggesting seed investors are a fad akin to reality-TV celebrities:

In many ways, what [prominent seed funds] are saying is that they’re just smarter, and as such will outlast all the copycat and wannabe seed funders as well as the stale VCs with a fresh coat of paint. But then — Kim Kardashian is the only one who can make a living tweeting. At some point it will be quite obvious whether the super angels’ investments and strategy succeed or fail.

Here’s the key point these analyses overlook: It’s not the seed investors who are smarter – it’s the entrepreneurs. Consider the case of the last company I co-founded, SiteAdvisor. We raised our first round of $2.6M at a $2.5M pre-money valuation. After the first round of funding, investors owned 56% of the company. Moreover, the $2.6M came in 3 tranches: $500K, another $500K, and then $1.6K.  To get the 2nd and 3rd tranches we had to hit predefined milestones and re-pitch the VC partnerships. Had we instead raised the first $1M from seed funds, we would have been free to raise the remaining money at a higher valuation. In fact, after we spent less than $1M building the product, we raised more money at a $16M pre-money valuation. We never even touched the $1.6M third tranche even though it caused us to take significant dilution. This was a very common occurrence before the rise of seed funds, due to VCs pressuring entrepreneurs to raise more money than they needed so the VCs could “put more money to work.” When SiteAdvisor was eventually acquired, we had spent less than a third of the money we raised. Compare the dilution we actually took to what we could have taken had we raised seed before VC:

Professional seed funds barely existed back then, especially on the East Coast. And even if they did, I’m not sure I would have been savvy enough to opt for them over VCs. I thought the brands of the big VCs would help me and didn’t really understand the dynamics of fund raising.* Today, entrepreneurs are much savvier, thanks to the proliferation of good advice on blogs, via mentorship programs, and a generally more active and connected entrepreneur community. For example, Founder Collective recently backed two Y-Combinator startups who decided to raise money exclusively from seed investors despite having top-tier VCs throwing money at them at higher valuations. These were “hot” companies who had plenty of options but realized they’d take less start-to-exit dilution by raising money from helpful seed investors first and VCs later.

Will there be there a seed fund crash? Seed fund returns are highly correlated with VC returns which are highly correlated with public markets and the overall economy. I have no idea what the state of the overall economy will be over the next few years. Perhaps it will crash and take VCs and seed funds down with it. But I do have strong evidence that prominent seed funds will outperform top-tier VC funds, because I know the details of their investments, and that their portfolios contain the same companies as top-tier VCs except the they invested in earlier rounds at significantly lower valuations.  So unless these prominent seed funds were incredibly unlucky picking companies (and since they are extremely diversified I highly doubt that), their returns will significantly beat top-tier VC returns.

* Note that we have nothing but gratitude toward the SiteAdvisor VCs – Rob Stavis at Bessemer and Hemant Taneja at General Catalyst. They offered what was considered a market deal at the time and supported us when (almost) no one else would.