Programming Dynamic Models in Python-Part 3: Outbreak on a Network

In this post, we will continue building on the basic models we discussed in the first and second tutorials. If you haven’t had a chance to take a look at them yet, definitely go back and at least skim them, since the ideas and code there form the backbone of what we’ll be doing here.

In this tutorial, we will build a model that can simulate outbreaks of disease on a small-world network (although the code can support arbitrary networks).  This tutorial represents a shift away from both:

a) the mass-action mixing of the first two and and

b) the assumption of social homogeneity across individuals that allowed us to take some shortcuts to simplify model code and speed execution. Put another way, we’re moving more in the direction of individual-based modeling.

When we’re done, your model should be producing plots that look like this:

Outbreak on a small-world network
Outbreak on a small-world network

Red nodes are individuals who have been infected before the end of the run, blue nodes are never-infected individuals and green ones are the index cases who are infectious at the beginning of the run.

And your model will be putting out interesting and unpredictable results such as these:

Time vs. # of cases
Time vs. # of cases

In order to do this one, though, you’re going to need to download and install have igraph for Python on your system.

Individual-Based Networks

It is important to make the subtle distinction between individual and agent based models very clear here. Although the terms  are often used interchangeably, referring to our nodes, who have no agency, per se, but are instead fairly static receivers and diffusers of infection, as agents, seems like overreaching. Were they to exhibit some kind of adaptive behavior, i.e., avoiding infectious agents or removing themselves from the population during the infective period, they then become more agent-like.

This is not to under- or over-emphasize the importance or utility of either approach, but just to keep the distinction in mind to avoid the “when all you have is a hammer, everything looks like a nail” problem.

In short, adaptive agents are great, but they’re overkill if you don’t need them for your specific problem.

Small World Networks

The guiding idea behind small-world networks is that they capture some of the structure seen in more realistic contact networks: most contacts are regular in the sense that they are fairly predicable, but there are some contacts that span tightly clustered social groups and bring them together.

In the basic small-world model, an individual is connected to some (small, typically <=8) number of his or her immediate neighbors. Some fraction of these network connections are then randomly re-wired, so that some individuals who were previously distant in network terms – i.e., connected by a large number of jumps – are now adjacent to each other. This also has the effect of shortening the distance between their neighbors and individuals on the other side of the graph. Another way of putting this is that we have shortened the average path length and increased the average reachability of all nodes.

These random connections are sometimes referred to as “weak ties”, as there are fewer of these ties that bridge clusters than there are within clusters. When these networks are considered from a sociological perspective, we often expect to find that the relationship represented by a weak tie is one in which the actors on either end have less in common with each other than they do with their ‘closer’ network neighbors.

Random networks also have the property of having short average path lengths, but they lack the clustering that gives the small-world model that pleasant smell of quasi-realism that makes them an interesting but largely tractable, testing ground for theories about the impact of social structure on dynamic processes.

Installation and Implementation Issues

If you have all the pre-requisites installed on your system, you should be able to just copy and paste this code into a new file and run it with your friendly, local Python interpreter. When you run the model, you should first see a plot of the network, and when you close this, you should see a plot of the number of infections as a function of time shortly thereafter.

Aside from the addition of the network, the major conceptual difference is that the model operates on discrete individuals instead of a homogeneous population of agents. In this case, the only heterogeneity is in the number and identity of each individual’s contacts, but there’s no reason we can’t (and many do) incorporate more heterogeneity (biological, etc.) into a very similar model framework.

With Python, this change in orientation to homogeneous nodes to discrete individuals seems almost trivial, but in other languages it can be somewhat painful. For instance, in C/++, a similar implementation would involve defining a struct with fields for recovery time and individual ID, and defining a custom comparison operator for these structs. Although this is admittedly not a super-high bar to pass, it adds enough complexity that it can scare off novices and frustrate more experienced modelers.

Perhaps more importantly, it often has the effect of convincing programmers that a more heavily object-oriented approach is the way to go, so that each individual is a discrete object. When our individuals are as inert as they are in this model, this ends up being a waste of resources and makes for significantly more cluttered code. The end result can often be a model written in a language that is ostensibly faster than Python, such as C++ or Java, that runs slower than a saner (and more readable) Python implementation.

For those of you who are playing along at home, here are some things to think about and try with this model:

  1. Change the kind of network topology the model uses (you can find all of the different networks available in igraph here).
  2. Incorporate another level of agent heterogeneity: Allow agents to have differing levels of infectivity (Easier); Give agents different recovery time distributions (Harder, but not super difficult).
  3. Make two network models – you can think of them as separate towns – and allow them to weakly influence each other’s outbreaks. (Try to use the object-oriented framework here with minimal changes to the basic model.)

That’s it for tutorial #3, (other than reviewing the comment code which is below) but definitely check back for more on network models!

In future posts, we’ll be thinking about more dynamic networks (i.e., ones where the links can change over time), agents with a little more agency, and tools for generating dynamic visualizations (i.e., movies!) of stochastic processes on networks.

That really covers the bulk of the major conceptual issues. Now let’s work through the implementation.

Click Below to Review the Implementation and Commented Code!

Continue reading “Programming Dynamic Models in Python-Part 3: Outbreak on a Network”

Christakis and Fowler in Wired Magazine

Wired

Today marks the official release of Connected: The Surprising Power of Our Social Networks and How They Shape Our Lives by Nicholas A. Christakis & James H. Fowler.  There has been some really good publicity for the book including the cover story in last Sunday’s New York Times Magazine. However, given the crisp visualizations — my favorite is the above article from Wired Magazine.  Click on the visual above to read the article!

Positive Legal Theory and a Model of Intellectual Diffusion on the American Legal Academy [Repost from 4/22]

For the third installment of posts related to Reproduction of Hierarchy? A Social Network Analysis of the American Law Professoriate, we offer a Netlogo simulation of intellectual diffusion on the network we previously visualized.  As noted in prior posts, we are interested legal socialization and its role in considering the spread of particular intellectual or doctrinal paradigms. This model captures a discrete run of the social epidemiological model we offer in the paper.   As we noted within the paper, this represents a first cut on the question—where we favor parsimony over complexity.  In reality, there obviously exist far more dynamics than we engage herein.  The purpose of this exercise is simply to begin to engage the question. In our estimation, a positive theory of law should engage the sociology of the academy — a group who collectively socialize nearly every lawyer and judge in the United States. In the paper and in the model documentation, we offer some possible model extensions which could be considered in future scholarship.

Once you click through to the model, here is how it works:

(1) Click the Setup Button in the Upper Left Corner.  This will Display the Network in the Circular Layout.

(2) Click the Layout Button.  Depending upon the speed of your machine this may take up to 30 seconds.  Stop the Layout Button by Re-Clicking the Button.

(3) Click the Size Nodes by Degree Button. You Will Notice the Fairly Central Node Colored in Red.  This is School #12 Northwestern University Law School.  Observe how we have set the default infected school as #12 Northwestern (Hat Tip to Uri Wilensky).  A Full List of School Number is available at the bottom of the page when you click through.

(4) Now, we are ready to begin.  Click the Spread Once Button.  The idea then reaches its neighbors with probability p (set as a default at .05).  You can click the Toggle Infection Tree button (at any point) to observe the discrete paths traversed by the idea.

(5) Click the Spread Once Button, again and again.  Notice the plot tracking the time on the x axisand the number of institution infected on the y axis.  This is an estimate of the diffusion curve for the institution.

(6) To restart the simulation, click the Reinfect One button.  Prior to hitting this button, slide theInfected Slider to any Law School you would like to observe.  Also, feel free to adjust the p slider to increase or decrease the infectiousness of the idea.

Please comment if you have any difficulty or questions.  Note you must have Java 1.4.1 + installed on your computer.  The Information Technology professionals at many institutions will have already installed this on your machine but if not you will need to download it.   We hope you enjoy!

The S.I.R. Model — A Simple Model With Applications to Swine Flu, etc.

 

Virus on a Network

Last week we offered a model of intellectual diffusion built upon a standard fare social epidemiology model.  Given recent events within the United States, Mexico and potentially worldwide, we thought it would be worthwhile to highlight the classic S.I.R. (Susceptible, Infected, Recovered) model.  Netlogo offers a user friendly version of the model.  Using this platform, we hope the exploration of the dynamics of S.I.R. might prove illuminating.    

First, various hosts have different levels of interactions (work, home, transit, etc.) and so this network approach represents a blunt measure.   To start the model at the default parameters, push the SETUP Button and then the GO Button.  As the model runs, the plot tracks the Susceptible, Infected, Recovered.  The model contains a variety of  “sliders.”  The model can be rerun at lots of combinations of parameter levels.  Those “sliders” fall into several categories: Network Attributes, Virus Attributes, Node Attributes.   The full documentation is available here.  

With respect to the swine flu, one important parameter is the delay between when an individual becomes infectious and when that individual is likely to become symptomatic.  This parameter can be tuned in the simulation above using VIRUS-CHECK-FREQUENCY slider.  From the documentation… “Infected nodes are not immediately aware that they are infected. Only every so often (determined by the VIRUS-CHECK-FREQUENCY slider) do the nodes check whether they are infected by a virus.”  

An additional parameter worthy of consideration is the VIRUS-SPREAD-CHANCE.  Consider this slider as a rough measure of the underlying infectiousness of the virus in question.        

It is important to note the above simulation is an incredible simplification of the world faced by public health officials.  Additionally, this version of the model was designed to consider the spread of disease on a computer network.  Notwithstanding these limitations, we thought it useful to highlight a computational approach to this important matter of public concern.

The Revolution Will Not Be Televised — But Will it Come from HLS or YLS ? A Social Network Analysis of the Legal Academy (Part IV)

Law Prof Diffusion

This is the final installment of posts related to Reproduction of Hierarchy? A Social Network Analysis of the American Law Professoriate. Thanks for your emails.

Here is the plot we provide within the paper.  As a general proposition, we believe this represents an upper bound measure for the intellectual reach of an agenda offered by a given institution.  With respect to our version of the Reed Frost Epidemiological Model, we use the p parameter to model “idea infectiousness.”  When p = 1 every institution “contacted” by the idea is infected with the idea. When p = 0 no institution “contacted” by the idea is infected.  In this version, we use the programming language python to run the model 500 times per institution. The above plot represents an estimate of the “diffusion curve” for each of the 184 institutions in our model. Building off central limit type properties, this leaves a far better estimate of reach than is offered in the single model run from the previous Netlogo GUI.

A cursory review of the above plot demonstrates, we are far from the land of linearity.  Namely, a large number of institutions are able to reach much of the graph with very small changes in the value of p.

In the Structure of Scientific Revolutions, Kuhn quotes from Max Planck:  “a new scientific truth does not triumph by convincing its opponents and making them see the light, but rather because its opponents eventually die, and a new generation grows up that is familiar with it.” Following Planck, we believe retirement is indeed be an important mechanism.  However, we also argue the nature of the p parameter is a relevant consideration.  In fact, unpacking various dimensions of p is the key to the broader model. Specifically, what are the properties of an idea that generate its infectiousness? Of course, we might like to believe infectiousness is related to a class of normatively attractive properties such as promoting efficiency or justice.  However, it is not clear that this follows.

We took no pass on the question of whether some institutions would be better or worse at producing ideas with greater or lesser values of p. The motivated question for this post considers whether, in general, the institutions which are top producers of law professors are (1) leaders in innovation, (2) subsequent ratifiers of a newly established paradigm or (3) defenders of the status quo. In a deep sense, we are asking how to reasonably model decision making by the heterogeneous agents located at such institutions.  Do institutions reward or punish intellectual risk-taking, search, etc.?

While this is an empirical question beyond the scope of this post, it worth asking because it partially informs the micro-dynamics plausibly responsible for generating the spread of new intellectual paradigms.

Model of Intellectual Diffusion Upon the American Legal Academy


picture-211
For the third installment of posts related to Reproduction of Hierarchy? A Social Network Analysis of the American Law Professoriate, we offer a Netlogo simulation of intellectual diffusion on the network we previously visualized.  As noted in prior posts, we are interested legal socialization and its role in considering the spread of particular intellectual or doctrinal paradigms. This model captures a discrete run of the social epidemiological model we offer in the paper.   As we noted within the paper, this represents a first cut on the question—where we favor parsimony over complexity.  In the paper and in the model documentation, we offer some possible model extensions which could be considered in future scholarship.   

Once you click through to the model, here is how it works:

(1) Click the Setup Button in the Upper Left Corner.  This will Display the Network in the Circular Layout. 

(2) Click the Layout Button.  Depending upon the speed of your machine this may take up to 30 seconds.  Stop the Layout Button by Re-Clicking the Button. 

(3) Click the Size Nodes by Degree Button. You Will Notice the Fairly Central Node Colored in Red.  This is School #12 Northwestern University Law School.  Observe how we have set the default infected school as #12 Northwestern (Hat Tip to Uri Wilensky).  A Full List of School Number is available at the bottom of the page when you click through.     

(4) Now, we are ready to begin.  Click the Spread Once Button.  The idea then reaches its neighbors with probability p (set as a default at .05).  You can click the Toggle Infection Tree button (at any point) to observe the discrete paths traversed by the idea.  

(5) Click the Spread Once Button, again and again.  Notice the plot tracking the time on the x axis and the number of institution infected on the y axis.  This is an estimate of the diffusion curve for the institution.    

(6) To restart the simulation, click the Reinfect One button.  Prior to hitting this button, slide the Infected Slider to any Law School you would like to observe.  Also, feel free to adjust the p slider to increase or decrease the infectiousness of the idea. 

Please comment if you have any difficulty or questions.  Note you must have Java 1.4.1 + installed on your computer.  The Information Technology professionals at many institutions will have already installed this on your machine but if not you will need to download it.   We hope you enjoy!