Main

November 20, 2009

Wall Street Journal article: Science as a Team Sport

Today I was quoted in the Wall Street Journal in an article on scientific collaborations by Robert Lee Hotz. (They also did a short video accompanying the online edition, in which I talk a little bit more about the advantages and problems of collaboration.) Here's an excerpt of the piece:

Once a mostly solitary endeavor, science in the 21st century has become a team sport. Research collaborations are larger, more common, more widely cited and more influential than ever, management studies show. Measured by the number of authors on a published paper, research teams have grown steadily in size and number every year since World War II.

To gauge the rise of team science, management experts at Northwestern University recently analyzed 2.1 million U.S. patents filed since 1975 and all of the 19.9 million research papers archived in the Institute for Scientific Information database. "We looked at the recorded universe of all published papers across all fields, and we found that all fields were moving heavily toward teamwork," says Northwestern business sociologist Brian Uzzi.

As research projects grow more complicated, management becomes a variable in every experiment. "You can't do it alone," says research management analyst Maria Binz-Scharf at City College of New York. "The question is how you put it all together."

The key is bringing the people together in the first place, which has sped technological advancements that often benefited the rest of us. The ease of global business and social networking today owes much to the World Wide Web, which was designed to aid information-sharing between scientists. It was invented at the European Organization for Nuclear Research (CERN), the home of the Large Hadron Collider.

New online science management experiments are underway. Last year, the National Science Foundation started a $50 million project to map all plant biology research, from the level of molecules to organisms to entire ecosystems, so scientists can swoop through shared data as if they were using Google Earth. Last month, U.S. computer experts launched a $12 million federal project to create a national biomedical network called VIVOweb to encourage collaborations.

Scientists are experimenting with the new technology of teamwork even in mathematics, where researchers customarily work alone.

This is such an exciting area of research. Together with Leslie Paik and Avrom Caplan (both from the City College of New York), I will be devoting a good part of the next three years to study how scientists collaborate, especially how the collaborative production of scientific knowledge changes as collaborations are increasingly virtual. This work is supported by the NSF (see here for the project abstract and here for the CCNY press release).

September 21, 2009

Communities in Networks

Uncovering the "community" structure of social networks has a long history, but communities play a pivotal role in almost all networks across disciplines. Intuitively, one can think of a network community as consisting of a group of nodes that are relatively densely connected to each other but sparsely connected to other dense groups of nodes. Communities are important because they are thought to have a strong bearing on functional units in many networks. So, for example, communities in social networks can correspond to different social groups, such as family, whereas web pages dealing with a given subject tend to form topical communities.

The concept is simple enough, but it turns out that coming up with precise mathematical definitions and algorithms for community detection is one of the most challenging problems in network science. Recently, a lot of the research in this area has been done using ideas from statistical physics, which has an arsenal of tools and concepts to tackle the problem. Unfortunately (but understandably) relatively few non-physicists like to read statistical physics papers.

Together with my colleagues Mason Porter (Oxford University) and Peter Mucha (University of North Carolina at Chapel Hill), we thought it would be useful to let others take a peek at some of this work. In an effort to put in context some of the hundreds of papers, we recently compiled an introductory review on some of our favorite approaches to community detection. While there are excellent existing reviews, our "Communities in Networks", published by Notices of the American Mathematical Society (AMS), tries to make sense of this smorgasbord of methods and, hopefully, lets a broader audience get a flavor of this exciting field.

I hope to be making a couple of postings on community structure and community detection later on. In the meantime, you can see for yourself if we have succeeded by checking out the freely accessible article on the AMS website, or by going to arXiv or SSRN.

community.gif
The largest connected component of a coauthorship network connecting physicists who have published together on networks. Each node is colored according to community membership.

July 14, 2009

The complexity of Government 2.0

In today's post, I would like to address three issues related to Government 2.0: transparency, citizenship and agenda hi-jacking.

First, while we read a lot about transparency, it is easier said than done. For example, transparency levels may be highly dependent on the government context and its potential (unintended) impact on either discloser or public behavior--whether citizens or corporations. Second, when participation is emphasized--whether online of offline--, we need to revisit our understanding of citizenship today and in the future. Thirdly, political agendas/policies may be "hi-jacked" by bottom-up Internet-based approaches of proposing alternatives which also relates back to the question of citizenship and legitimacy.

Government 2.0 is the flavor of the year. Other terms now being introduced are WikiGovernment, Collaborative Government, Information Government or the U.S. administration's Open Government. While the terms might differ and the authors that introduce them slightly vary in their description and priorities, all of them intend to convey the same ideas: participation, collaboration, transparency and technology jointly allow for a new form of government and governance. Certain things are here to stay; others will pass out of fashion quickly.

The following quotes may illustrate my concerns:

A memo released by the White House, called federal agency heads to "upgrading the capacity of regulatory agencies for using the Internet to become more open, efficient, and responsive". The National Performance Review (NPR) recommended to "[u]se information technology and other techniques to increase opportunities for early, frequent, and interactive public participation during the rulemaking process and to increase program evaluation efforts."

This sounds familiar. However, the White House memo dates back to Dec 17, 1999 and NPR's recommendation back to September 1993. Therefore, policies that connect openness and responsiveness to the potential of technology have been around for over 40 years in government. Some think that eGovernment is dead. But its ideas are quite alive; especially thoughts on eDemocracy seem to finally become reality. eGovernment (the internal/external use of technology in government) does not contradict Government 2.0 anyways. On a 50.000 foot level the use of social media in government is the use of technology.The envisioned transformation requires patience and long-term support from policy makers because government is a complex ecosystem which is resilient to change.

Along these lines, I recently read an interesting blog post (Steve Radick) which reminded me of a post I contributed to this blog (why government is ahead in web 2.0 in 2008.

Of course we should not let the past constrain our vision about the future. Yet, the past may prevent us from being overly optimistic or in other words, overly disappointed when all things envisioned don't become reality.

Transparency
The Obama administration's agenda on transparency (the latest move was making information on government IT spendingavailable) is amazing but these policies as a form of regulation are not new to government. For an overview of transparency initiatives and regulations visit freedominfo.org, wobsite.be or Wikipedia. The European Commission also introduced a directive on the re-use of public sector information in 2003. Unfortunately, it is difficult to get a full overview and understanding of the level of progress of the latter in EU Member States. Consequently, it should openly be discussed how the level of transparency of a government or any of its agencies can be measured.

While Vivek Kundra agrees in principle that all public government data should be online, he also cautions that the reality is government data sits in more than 10,000 different systems, many of them written in old programing languages or are still locked in dusty paper archives. Accordingly, eGovernment is not dead. Without the appropriate infrastructure (interoperability standards, electronic records management, enterprise architecture) projects such as data.gov can only achieve parts of their true potential.

In general, for transparency we have two primary actors: the discloser and the user. There are many ways for discloser to provide less than complete information or hide important information by providing excessive amounts of information. Placing data in the public domain does not guarantee that it will be used or used in the intended way. Data may be ignored, approached with indifference, misunderstood or misused. For example, data may make it easier for special interest groups to lobby for their own interests. Transparency activities are complex and need full commitment of a government body.

Finally, government and politics are based on the type and flow of information. Transparency policies, social media and the influx of "believers of openness" in government have slightly altered the process. That may have two effects.

On the one hand, it has become more difficult to contain information. At the same time the need to monitor the "global thought stream" is increasing to be able to proactively react to emerging "crisis". These continue to be defined by traditional media (tv, radio, press) once they declare some Internet trend "news" (Note the change: Digital collective action can quickly lead to more media coverage; past: media leading to collective action).

On the other hand, transparency and social media could lead to even tighter confidentiality protocols and altered behaviour of elected officials. "Negative" media coverage/spin continues to be "sunlight" which government tries to avoid at all costs. A recent episode of "The Daily Show" provides a case in point.

The Daily Show With Jon StewartMon - Thurs 11p / 10c
Cheney Predacted
www.thedailyshow.com
Daily Show
Full Episodes
Political HumorSpinal Tap Performance

Mainstream media also like to quote twitter messages of U.S. members of Congress and adding their spin to 140 character thoughts. Some of the early adopters still offer unique commentary. How long will this be he case?

Citizenship and Participation
Despite all the anti-American sentiment around the globe, the Obama administration has remarkably managed to export its open government policy around the globe. It spread virally through the Internet. Inspired by U.S. and UK based initiatives, individuals (early adopters) in other countries have started applying these initiatives to their national context (mostly exact copies) or supporting calls for government action ("democratization"). Numerous "experts" are presenting (mostly the same) ideas and good practice cases to government officials. Many of those officials are still struggling with the topic. For example, many are still wondering about the best way of "eParticipation" which is the current buzz.

However, there is an underlying question we need to answer that is far more complex and fundamental than eParticipation:

How do we define citizenship in an era of Government 2.0?

This requires a return to political theorists such as Aristotle, John Rawls and Jurgen Habermas as well as multi-disciplinary deliberation of what we would like citizenship to be. Because in the near future, every established form of decision making--especially on the political level--will experience collective action based on the increase of expressive capability of the Internet (Everyone can claim for a democratization of "something" pointing to the potential use of social media). In addition, the digital divide between those who are offline, those who are online and those who "live" online ("Netcitizen") continues to exist.

Similar to transparency, the opportunity to participate may simply be neglected until a true need arises. An average worker might only have 2-3 hours available per day to engage in participatory action which are competing with many other leisure activities. Consequently, there is also the issue of legitimacy of those participatory actions that were either offered by government or started by citizens.

Agenda hi-jacking
To prove my last point, I would like to draw on a current example from Europe. In November 2009, the EU Ministerial eGovernment Conference will take place in Malmoe, Sweden. It is planned to present a ministerial declaration on eGovernment in the EU for the next seven years. This declaration will be the result of back-room dealings between EU Member States (MS).

However, this year a group of people led by two companies decided to use a
social media facilitated bottom-up approach to create a declaration
alongside the official one in Malmoe for eGovernment 2015 It is also their goal to get official endorsement of their version from the European Commission. As the content of the platform is openly accessibly, ideas might even find their way into the official document. The group's motivation is probably a mix of self-marketing, fascination for social media and spirit to influence policy making.

So far, 75 individuals participated in the activity. It will be interesting to see how many people will sign the declaration. It will also be interesting to see whether and when the media will pick-up the story of alternative agenda and how much pressure this will exert on policy makers. Considering the total population of 500 Mio EU citizens, legitimacy of this initiative is questionable.

Nevertheless, the EU is at a crossroads: If it does not open up more, it will further strip itself of legitimacy. Gov 2.0 type activities provide one avenue to strengthen the EU and its institutions.

Finally, with regards to research, I see two issues. First, old and new research from various disciplines relating to Government 20 is not connected. Second, researchers can hardly keep pace with the current output of Government 2.0 policies and projects being implemented.

March 12, 2007

Microbias and Macroperformance

This is an abstract of todays PNG/CCCSN seminar with Daniel Diermeier (Kellogg School of Management, Northwestern University). We encourage you to discuss his presentation via comments on the blog.

"We use agent-based modeling to study collective problem solving in complex social networks where information aggregation and consensus building is modeled as the density classification task. We show that simple individual aggregation rules in conjunction with complex interaction patterns are highly efficient in solving the density classification task. We then investigate the effect of conservatism and partisanship on classification efficiency in large populations. We find that conservative agents enhance the populations’ ability to efficiently solve the density classification task despite large levels of noise in the system. In contrast, we find that the presence of even a small fraction of partisans holding the minority position will result in deadlock or a consensus on an incorrect answer. Our results provide a possible explanation for the emergence of conservatism and suggest that even low levels of partisanship can lead to significant social costs."

Here are the related publications:
Global Coordination in Modular Networks
Efficient system-wide coordination in noisy environments

August 16, 2006

"Network elasticity" and "individual plasticity"

I briefly want to plug these two book end constructs that I framed in a paper I wrote some years ago in the Journal of Mathematical Sociology ("The Co-evolution of Network and Individual"), which examined how networks and nodes co-evolve. Essentially, network elasticity captures how endogenous the network is—how much nodes get to choose who they connect to. Individual plasticity, in turn, captures how endogenous “attributes" are—how much individuals are affected by who they are connected to. In this paper (and the discussion here) I apply these ideas to social influence processes, but the concepts are more general than that. I would argue that different social systems differ dramatically in how elastic their networks are, and how plastic the nodes are, which, in turn, has certain systemic implications.

The idea that individuals affect their network as compared to being affected by their network are sometimes placed at opposite ends of the spectrum; but of course, they are really orthogonal processes. Since this paper was written, statistical tools (e.g., by Tom Snijders and his team with Siena) have been refined to examine just such a coevolutionary process. My focus is really on something different than estimating the underlying transition probabilities for the change in state of particular relationships or nodes.

Rather, what I am focused on are the long run dynamic systemic consequences of different levels of elasticity and plasticity. For example, in Co-evolution, I examined the social network within a government agency, where the social structure was very rigid, where the ties of a new person were pretty much the same as their predecessor, and that individuals entered when their were early in their professional career and thus pretty malleable. The result was that structure drove attitudes, not the other way around. One could produce a 2 x 2 typology of networks and plausible resulting dynamics:

High plasticity and low elasticity: homophilous network, where the social structure will drive attitudes (e.g., traditional bureaucracy).

High elasticity and low plasticity: homophilous network, where social systems will polarize along nodal characteristics.

High elasticity and high plasticity: dual possibilities of emerging with a homogeneous, cohesive, network, or polarized cliques that do not talk to each other.

Low elasticity and low plasticity: heterophilous network.

Of course, the above depends a lot on the determinants of the social structure; an inelastic network that forces you to talk with likeminded people has very different implications than an inelastic network that forces you to talk with people who are different from you.

The Co-evolution of Individual and Network" Journal of Mathematical Sociology, January 2001, 69-108.