| Sun | Mon | Tue | Wed | Thu | Fri | Sat |
|---|---|---|---|---|---|---|
| 1 | 2 | |||||
| 3 | 4 |
5 |
6 |
7 | 8 |
9 |
| 10 | 11 | 12 | 13 | 14 | 15 | 16 |
| 17 | 18 | 19 |
20 | 21 | 22 | 23 |
24 |
25 |
26 |
27 | 28 |
29 |
« Auld lang syne: networks as behavioral flows | Main | Social network analysis, the NSA, and “pattern analysisâ€? »
22 December 2005
Social relations between individuals can be complex systems. How the structure of social networks impacts the behaviour of a system has been researched recently. These are i.e. power grids, neural networks, the World Wide Web or stock markets. Although different in the underlying interaction dynamics or micro-physics, all these networks have shown a tendency to self-organize in structures that share common features. In particular, the number of connections, for each element, or node, of the network follow a power law distribution. Networks that fulfill this property are referred to as scale-free (SF) networks M. Bartolozzi, D. B. Leinweber1, A. W. Thomas. (2005).
I would like to draw your attention to 2 projects which are using the power law in a direct and indirect way. First, there is the use of virtual stock markets to improve market research. Second, a recent project concerning blogs and virtual stock markets (VSMs) tries to proove the existence of powerlaw.
VSMs aven been applied in the form of a political stock market to predict the outcome of US presidential election in 1988 at the University of Iowa. The results of these studies demonstrate that the predictions outperform opinion polls in terms of forecast accuracy. Furthermore, the results of political stock markets show that VSMs can perform well even if their participants are not a representative sample. The reason is that VSMs elicit the participants' assessments of the market outcome and a rational participant should not trade according to individual preferences, but according to the prediction of the market outcome based on the overall preferences of market participants. Thus, the decision is based on the most common features an individual anticipates in the market (powerlaw). More virtual stock market research is in this area is currently underway by an international research team (Martin Spann (U Passau), Gerrit van Bruggen (EU Rotterdam), Ely Dahan (UCLA) and Gary Lilien (U Penn)). Although it is more focused on business and market research some outcomes might be useful in other research areas.
BlogShares is the exploration of an emerging social network. Blogs are valued by their incoming links from other blogs. A blog is defined as a company and links become the main source of value in the VSM. Players speculate on thousands of blogs by buying and selling shares or rather the shifts of attention within the network. Blogshares claims to have proven the powerlaw which is in this case that 20% of the blogs contain 80% of all incoming links.
Posted by Alexander Schellong at December 22, 2005 5:48 PM
Finding data out there that fits the power law is interesting -- but it is interesting in the same sense that learning that the probability that 2 or more people in a room of only 24 randomly selected people will have the same birthday is greater than 0.5. Its a neat curiosity. But can you explain exactly how learning that something fits a power law probability distribution is useful in a scientific sense? If correct, it will, by definition, tell you the probability distribution of the next item (stock, blog, etc.), but will it necessarily tell you anything (or anything important) about the social, political, economic, or other mechanism that gives rise to the observed power law histogram? This does not seem so obvious to me since different underlying mechanisms can give rise to the same aggregate distribution. So why should anyone make a fuss every time a variable seems to fit the power law (whether a distribution _looks_ like a power law statistially fits a power law density is another issue that we can skip over for now).
Posted by: Gary King at December 31, 2005 5:50 PM
To echo this point, is there anything intrinsically more interesting about saying that something fits a power law distribution as compared to a normal distribution?
It does seem to me that sometimes it is useful when there is some presumption that the distribution looks different than a power law to say that it is power law distributed. By analogy, this hanukah I was playing dreidel with my kids. The dreidel has four sides, and one assumes that each side has an equal chance of coming up. However, out of a 100+ spins, the gimmel only came up once or so, meaning that it was rather likely we had a weighted dreidel-- good to know, because the game went on forever as a result. For a social science example, Matt Hindman's work on "Googlearchy" is a nice application along these lines, where he finds that for various issue areas a fairly small number of websites dominate. This is different from the factual assertions of some that the Internet would open the doors to a larger number of perspectives. But that's different from asserting that a power law says something very specific about the process that produced that distribution. Indeed, the apparent omnipresence of powerlaws (firm sizes, city sizes, word frequency in Ulysses, degree distribution of websites, forest fires) suggests many processes lead to power laws.
I would suggest that for a given model to produce a power law distribution similar to that observed in real world data is thus a weak test of that model. It is one "peg" of a model to data, but to validate a model should require multiple pegs. (A broader discussion of the issue of model validation will wait for another entry.)
Zipf's law (and relatives) is a partial exception to this, because it specifies not just a power law distribution, but the particular shape of the distribution, which is more powerful. I do not know of a definitive explanation for why city sizes in different countries almost universally fit the same distribution (except, perhaps, the stochastic growth model of Simon's). Again, however, there may turn out to be many ways to produce Zipf's law. (If readers know of recent developments along these lines, please post.)
References:
www.princeton.edu/~mhindman/googlearchy--hindman.pdf
Simon, H.A. (1955). On a class of skew istribution functions. Biometrika, 42, 425-440.
Human Behavior and the Principle of Least Effort
GK Zipf - 1949 - Addison-Wesley
Posted by: David Lazer at January 2, 2006 11:59 PM