political science – Computational Legal Studies

Final Data on the Physics of Law ONLINE Conference !

November 19, 2020November 23, 2020 Daniel Katz

Thanks to everyone who attended The Physics of Law Virtual Conference earlier this month. Overall, we had 292+ Attendees from 48 Countries watch the presentation of 20 Academic Papers by 62 Authors. We saw a wide range of methods from Physics, Computer Science and Applied Mathematics devoted to the exploration of legal systems and their outputs.

Methodological approaches included Agent Based Modeling, Game Theory and other Formal Modeling, Dynamics of Acyclic Digraphs, Knowledge Graphs, Entropy of Legal Systems, Temporal Modeling of MultiGraphs, Information Diffusion, etc.

NLP Methods on display included traditional approaches such as TF-IDF, n-grams, entity identification and other metadata extraction as well as more advanced methods such as Bert, Word2Vec, GloVe, etc.

Methods were then applied to topics including Attorney Advocacy Networks, Statutory Outputs from Legislatures, various bodies of Regulations, Contracts, Patents, Shell Corporations, Common Law Systems, Legal Scholarship and Legal Rules ∩ Financial Systems.

If you have an eligible paper – it is not too late to submit – papers are due in January. After undergoing the Peer Review process — Look for the Final Papers to be published in Frontiers in Physics in 2021.

Day One of The Physics of Law ONLINE Conference

November 12, 2020November 13, 2020 Daniel Katz

We are live at the Physics of Law Online Conference!

Over the next two days, we will have 20 Papers Presented from Scholars from Around the World …Click here to access the site so you can Sign Up for Day 2. If you would like to access the full agenda click here.

Election Models, Election Dynamics and Early Voting Data

October 24, 2020October 26, 2020 Daniel Katz

As it stands today, the Biden Campaign would appear quite likely but not guaranteed to win come November 3rd (or at some point thereafter). It could end early on November 3rd (if Florida appears to be trending toward Biden). Namely, it is hard to craft a scenario whereby Trump loses Florida and wins the White House. 538 has created an interactive where you can explore the inferential dynamics between the states (we learn about the likelihood in State B from the earlier results in State A). The interactive also highlights how results in early reporting states can reduce the remaining plausible paths to victory (there are only a few paths for Trump at this point).

Of course, it should be stated that remaining events or other issues could (potentially) change the dynamics or undermine the ability to leverage polls to make a proper inference. Here are few possibilities —

(1) Another October Surprise could drop between now and Election Day (there have already been several). However, it should be noted that one implication of all of this early voting is that the impact of a late October surprise is diminished.

(2) There could be systematic bias in polling (such as an unwillingness on behalf of voters to admit to pollsters their support for Trump). Alternatively, there could be a fundamental misunderstanding of the composition of the 2020 Electorate. As has been recently noted, the Trump Campaign has spent a significant amount of time on voter registration in several key battleground states. Will these newly registered folks actually vote ?

(3) Turnout dynamics associated with the cocktail of early voting (very large numbers so far), large scale absentee ballots (including rejection of ballots, delays in mail, etc.) or fear of turning up to the polls due to our latest COVID surge (the Trump campaign is counting on a Election Day surge). Any or all could impact the final outcome.

That said, if I had to bet I would bet on Biden to win (and give far better than even money).

We do have at least some information on the state of ongoing voting thanks to the Early Voting Tracking Project by Michael McDonald.

It is unprecedented turnout thus far. On its face this would purport to favor the Biden Campaign. However, the question remains whether this is merely a cannibalization of the normal Early In Person Voting and/or Election Day In Person Voting. In other words, how much will net turnout increase? Will it make a difference?

Taking Pennsylvania as a highly probable Tipping Point State, it will be interesting to see what percentage of mail in ballots are returned in the days to come. At the County level, there is significant variation in number of returned ballots thus far (even among those who have already requested a ballot).

Draft AGENDA – ‘Physics of Law’ – ONLINE Conference – November 12-13, 2020

October 21, 2020October 21, 2020 Daniel Katz

Here is the PDF of DRAFT Agenda for our Online Academic Conference entitled “The Physics of Law” which will take place on November 12-13. We have 20 Accepted Paper Abstracts from Research Teams from Around the World. Access to the Conference is FREE – but Registration is Required. Sign Up Today at PhysicsOfLaw.com !

Papers presented at this Conference are part of a Special Track for Frontiers in Physics and will appear in 2021 (after undergoing the Peer Review Process). Note although this is a technical conference — papers will reflect a range of methodological approaches (i.e. may be either Theoretical or Empirical).

The Physics of the Law – Legal Systems Through the Prism of Complexity Science [Online Conference – November 12 – 13, 2020]

October 7, 2020October 6, 2020 Daniel Katz

On November 12-13, 2020 – we will be hosting an Online Conference entitled “The Physics of Law – Legal Systems Through the Prism of Complexity Science.”

Papers presented at this Conference are part of a Special Track for Frontiers in Physics and will appear in 2021 (after undergoing the Peer Review Process).

After our Call for Papers — we have 20 Accepted Paper Abstracts for Papers which will presented at the Online Conference on November 12-13, 2020.

Note although this is a technical conference — papers will reflect a range of methodological approaches (i.e. may be either Theoretical or Empirical).

Access to the Conference is FREE – but Registration is Required. Sign Up Today at PhysicsOfLaw.com

Frontiers in Physics – Special Collection “The Physics of the Law: Legal Systems Through the Prism of Complexity Science”

August 21, 2020September 9, 2020 Daniel Katz

Open Call for Papers for a Special Collection in FRONTIERS in PHYSICS — “The Physics of the Law: Legal Systems Through the Prism of Complexity Science.” So far we have more than 30+ Scholars who have accepted our call for papers but we welcome others who would like to participate. Abstracts are due September 14th.

We welcome Original Research and Reviews where complexity science and quantitative approaches are deployed to evaluate the law / legal systems. Papers will be Peer Reviewed under the standards of Frontiers in Physics (or allied Frontiers Journals).

Papers can be empirical or theoretical but should be technical. If you have any questions feel free to message me.

An Online Virtual Conference will be held in early November.

Full Submission are due in January 2021.

More to come … please share!

Complex Societies and the Growth of the Law — v1.02

August 11, 2020September 9, 2020 Daniel Katz

Updated Version of our Paper — ’Complex Societies and the Growth of the Law’ is now on SSRN / arXiv. It is primarily a methods and measurement paper combining Network Science, Natural Language Processing, etc. to evaluate the growth of the law as a function of time. #LegalComplexity #LegalScience #NLP #NetworkScience #ComplexSystems #DataScience

SSRN LINK: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3602098
arXiv LINK: https://arxiv.org/abs/2005.07646

ABSTRACT – While a large number of informal factors influence how people interact, modern societies rely upon law as a primary mechanism to formally control human behaviour. How legal rules impact societal development depends on the interplay between two types of actors: the people who create the rules and the people to which the rules potentially apply. We hypothesise that an increasingly diverse and interconnected society might create increasingly diverse and interconnected rules, and assert that legal networks provide a useful lens through which to observe the interaction between law and society. To evaluate these propositions, we present a novel and generalizable model of statutory materials as multidimensional, time-evolving document networks. Applying this model to the federal legislation of the United States and Germany, we find impressive expansion in the size and complexity of laws over the past two and a half decades. We investigate the sources of this development using methods from network science and natural language processing. To allow for cross-country comparisons over time, we algorithmically reorganise the legislative materials of the United States and Germany into cluster families that reflect legal topics. This reorganisation reveals that the main driver behind the growth of the law in both jurisdictions is the expansion of the welfare state, backed by an expansion of the tax state.

‘Swing Voters, Swing Stocks, Swing Users: Scientists Develop a General Technique for Identifying Swing Components’

June 10, 2020September 9, 2020 Daniel Katz

Nice Shoutout (and perhaps a more approachable explanation) for our paper on the Santa Fe Institute Website.

“We propose a generalizable approach for identifying pivotal components across a wide variety of systems,” says author Edward Lee, a Program Postdoctoral Fellow who studies collective behavior at the Santa Fe Institute. “These systems go beyond voting, and include social media (like Twitter), biology (like the statistics of neurons), or finance (like fluctuations of the stock market).”

In the paper, Lee and his co-authors, Daniel Katz (Illinois Tech), Michael Bommarito (Stanford CodeX), and Paul Ginsparg (Cornell University) identify a statistical signature of pivotal components that they then trace to communities on Twitter, votes in the Supreme Court and Congress, and stock indices within financial markets. They find wide diversity in how social systems depend on sensitive points, when such points exist at all.”

Access our Paper – Sensitivity of Collective Outcomes Identifies Pivotal Components in the June 2020 Issue of Journal of the Royal Society Interface.

Sensitivity of Collective Outcomes Identifies Pivotal Components’ Published in the June Issue of The Journal of the Royal Society Interface

June 3, 2020September 9, 2020 Daniel Katz

Very excited that our paper – ‘Sensitivity of Collective Outcomes Identifies Pivotal Components’ has now been published in the June Issue of The Journal of the Royal Society Interface. https://royalsocietypublishing.org/doi/10.1098/rsif.2019.0873

Using the information geometry of minimal models from statistical physics, we develop an approach to identify pivotal components in wide variety of systems. We then apply this approach to a wide variety of empirical datasets including political voting, financial markets and social systems. We find remarkable variety from systems dominated by a median-like component to those without any single special component. Other systems (e.g., S&P sector indices) show varying levels of heterogeneity in between these extremes. Our information-geometric approach provides a principled, quantitative framework that may help assess the robustness of collective outcomes to targeted perturbation and compare social institutions, or even biological networks, with one another and across time.

#ComplexSystems #SocialPhysics #InformationGeometry #MedianVoter #Robustness #Sensitivity #Finance #PoliticalVoting

Asking About Social Circles Improves Election Predictions ( via Nature Human Behavior)

March 17, 2018 clsadmin

SCOTUS Crowdsourcing Paper Road Show (Presentation at University of Michigan Center for Political Studies / ISR)

January 12, 2018 clsadmin

Excited to take the show on the road next week where we will be presenting our SCOTUS Crowdsourcing Paper at University of Michigan Center for Political Studies and at the University of Minnesota Law School.

What A Difference 2 Percentage Points Makes and Nate Silver vs Huff Po Revisited

November 11, 2016 clsadmin

There is likely to be lots of recriminations in the pollster space but I think one thing is pretty clear — there was not enough uncertainty in the various methods of poll aggregation.

I will highlight the Huffington Post (Huff Po) model because they had such hubris about their approach. Indeed, Ryan Grimm wrote an attempted attack piece on the 538 model which stated “Silver is making a mockery of the very forecasting industry that he popularized.” (Turns out his organization was the one making the mockery)

Nate Silver responded quite correctly that this Huff Po article is “so fucking idiotic and irresponsible.” And it was indeed.
screen-shot-2016-11-11-at-11-08-37-am

Even after the election Huff Po is out there trying to characterize it as a black swan event. It is *not* a black swan event. Far from it … and among the major poll aggregators Five Thirty Eight was the closest because they had more uncertainty (which turns out was quite appropriate). Specifically, the uncertainty that cascaded through 538’s model was truthful … and just because it resulted in big bounds didn’t mean that it was a bad model, because the reality is that the system in question was intrinsically unpredictable / stochastic.

From the 538 article cited above “Our finding, consistently, was that it was not very robust because of the challenges Clinton faced in the Electoral College, especially in the Midwest, and therefore our model gave a much better chance to Trump than other forecasts did.”

Take a look again at the justification (explanation) from the Huffington Post: “The model structure wasn’t the problem. The problem was that the data going into the model turned out to be wrong in several key places.”

Actually the model structure was the problem in so much as any aggregation model should try to characterize (in many ways) the level of uncertainty associated with a particular set of information that it is leveraging.

Poll aggregation (or any sort of crowd sourcing exercise) is susceptible to systemic bias. Without sufficient diversity of inputs, boosting and related methodological approaches are *not* able to remove systematic bias. However, one can build a meta-meta model whereby one attempts to address the systemic bias after undertaking the pure aggregation exercise.

So what is the chance that a set of polls have systematic error such that the true preferences of a group of voters is not reflected? Could their be a Bradley type effect? How much uncertainty should that possibility impose on our predictions? These were the questions that needed better evaluation pre-election.

It is worth noting that folks were aware of the issue in theory but most of them discounted it to nearly zero. Remember this piece in Vanity Fair which purported to debunk the Myth of the Secret Trump Voter (which is the exact systematic bias that appeared to undermine most polls)?

Let us look back to the dynamics of this election. There was really no social stigma associated with voting for Hillary Clinton (in most social circles) and quite a bit (at least in certain social circles) with voting for Trump.

So while this is a set back for political science, I am hoping what comes from all of this is better science in this area (not a return to data free speculation (aka pure punditry)).

P.S. Here is one more gem from the pre-election coverage – Election Data Hero Isn’t Nate Silver . It’s Sam Wang (the Princeton Professor had HRC at more than a 99% chance of winning). Turns out this was probably the worst performing model because it has basically zero model meta-uncertainty.

Evaluating Counterfactual History – Which Cases Will be Affected by the Passing of Justice Scalia?

February 17, 2016 clsadmin

The Three Forms of (Legal) Prediction: Experts, Crowds and Algorithms — Professors Daniel Martin Katz & Michael J. Bommarito

September 3, 2015 clsadmin

The Three Forms of (Legal) Prediction: Experts, Crowds and Algorithms — Professors Daniel Martin Katz & Michael J. Bommarito from Daniel Martin Katz

Econometrics (hereinafter Causal Inference) versus Machine Learning

September 2, 2015 clsadmin

Perhaps some hyperbolic language in here but the basic idea is still intact … for law+economics / empirical legal studies – the causal inference versus machine learning point is expressed in detail in this paper called “Quantitative Legal Prediction.” Mike Bommarito and I have made this point in these slides, these slides, these slides, etc. Mike and I also make this point on Day 1 of our Legal Analytics Class (which really could be called “machine learning for lawyers”).

Daniel Martin Katz, Ron Dolin & Michael Bommarito, Legal Informatics, Cambridge University Press (2021) (Edited Volume) < Cambridge >

Corinna Coupette, Janis Beckedorf, Dirk Hartung, Michael Bommarito, & Daniel Martin Katz, Measuring Law Over Time: A Network Analytical Framework with an Application to Statutes and Regulations in the United States and Germany, 9 Front. Phys. 658463 (2021) < Frontiers in Physics > < Supplemental Material >

Daniel Martin Katz, Legal Innovation (Book Forward) in Mapping Legal Innovation: Trends and Perspectives (Springer) (Antoine Masson & Gavin Robinson, eds.) (2021) < Springer >

Michael Bommarito, Daniel Martin Katz & Eric Detterman, LexNLP: Natural Language Processing and Information Extraction For Legal and Regulatory Texts in Research Handbook on Big Data Law (Edward Elgar Press) (Roland Vogl, ed.) (2021) < Edward Elgar > < Github > < SSRN > < arXiv >

Daniel Martin Katz, Corinna Coupette, Janis Beckedorf & Dirk Hartung, Complex Societies and the Growth of the Law, 10 Scientific Reports 18737 (2020) < Nature Research > < Supplemental Material >

Edward D. Lee, Daniel Martin Katz, Michael J. Bommarito II, Paul Ginsparg, Sensitivity of Collective Outcomes Identifies Pivotal Components, 17 Journal of the Royal Society Interface 167 (2020) < Journal of the Royal Society Interface > < Supplemental Material >

Michael Bommarito, Daniel Martin Katz & Eric Detterman, OpenEDGAR: Open Source Software for SEC EDGAR Analysis, MIT Computational Law Report (2020) < MIT Law > < Github >

J.B. Ruhl & Daniel Martin Katz, Mapping the Law with Artificial Intelligence in Law of Artificial Intelligence and Smart Machines (ABA Press) (2019) < ABA Press >

J.B. Ruhl & Daniel Martin Katz, Harnessing the Complexity of Legal Systems for Governing Global Challenges in Global Challenges, Governance, and Complexity (Edward Elgar) (2019) < Edward Elgar >

J.B. Ruhl & Daniel Martin Katz, Mapping Law’s Complexity with ‘Legal Maps’ in Complexity Theory and Law: Mapping an Emergent Jurisprudence (Taylor & Francis) (2018) < Taylor & Francis >

Michael Bommarito & Daniel Martin Katz, Measuring and Modeling the U.S. Regulatory Ecosystem, 168 Journal of Statistical Physics 1125 (2017) < J Stat Phys >

Daniel Martin Katz, Michael Bommarito & Josh Blackman, A General Approach for Predicting the Behavior of the Supreme Court of the United States, PLoS ONE 12(4): e0174698 (2017) < PLoS One >

J.B. Ruhl, Daniel Martin Katz & Michael Bommarito, Harnessing Legal Complexity, 355 Science 1377 (2017) < Science >

J.B. Ruhl & Daniel Martin Katz, Measuring, Monitoring, and Managing Legal Complexity, 101 Iowa Law Review 191 (2015) < SSRN >

Paul Lippe, Daniel Martin Katz & Dan Jackson, Legal by Design: A New Paradigm for Handling Complexity in Banking Regulation and Elsewhere in Law, 93 Oregon Law Review 831 (2015) < SSRN >

Paul Lippe, Jan Putnis, Daniel Martin Katz & Ian Hurst, How Smart Resolution Planning Can Help Banks Improve Profitability And Reduce Risk, Banking Perspective Quarterly (2015) < SSRN >

Daniel Martin Katz, The MIT School of Law? A Perspective on Legal Education in the 21st Century, Illinois Law Review 1431 (2014) < SSRN > < Slides >

Daniel Martin Katz & Michael Bommarito, Measuring the Complexity of the Law: The United States Code, 22 Journal of Artificial Intelligence & Law 1 (2014) < Springer > < SSRN >

Daniel Martin Katz, Quantitative Legal Prediction – or – How I Learned to Stop Worrying and Start Preparing for the Data Driven Future of the Legal Services Industry, 62 Emory Law Journal 909 (2013) < SSRN >

Daniel Martin Katz, Joshua Gubler, Jon Zelner, Michael Bommarito, Eric Provins & Eitan Ingall, Reproduction of Hierarchy? A Social Network Analysis of the American Law Professoriate, 61 Journal of Legal Education 76 (2011) < SSRN >

Michael Bommarito, Daniel Martin Katz & Jillian Isaacs-See, An Empirical Survey of the Written Decisions of the United States Tax Court (1990-2008), 30 Virginia Tax Review 523 (2011) < SSRN >

Daniel Martin Katz, Michael Bommarito, Juile Seaman, Adam Candeub, Eugene Agichtein, Legal N-Grams? A Simple Approach to Track the Evolution of Legal Language in Proceedings of JURIX: The 24th International Conference on Legal Knowledge and Information Systems (2011) < SSRN >

Daniel Martin Katz & Derek Stafford, Hustle and Flow: A Social Network Analysis of the American Federal Judiciary, 71 Ohio State Law Journal 457 (2010) < SSRN >

Michael Bommarito & Daniel Martin Katz, A Mathematical Approach to the Study of the United States Code, 389 Physica A 4195 (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz & Jonathan Zelner, On the Stability of Community Detection Algorithms on Longitudinal Citation Data in Proceedings of the 6th Conference on Applications of Social Network Analysis (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz, Jonathan Zelner & James Fowler, Distance Measures for Dynamic Citation Networks 389 Physica A 4201 (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz & Jonathan Zelner, Law as a Seamless Web? Comparing Various Network Representations of the United States Supreme Court Corpus (1791-2005) in Proceedings of the 12th International Conference on Artificial Intelligence and Law (2009) < SSRN >

Marvin Krislov & Daniel Martin Katz, Taking State Constitutions Seriously, 17 Cornell Journal of Law & Public Policy 295 (2008) < SSRN >

Daniel Martin Katz, Derek Stafford & Eric Provins, Social Architecture, Judicial Peer Effects and the ‘Evolution’ of the Law: Toward a Positive Theory of Judicial Social Structure, 23 Georgia State Law Review 975 (2008) < SSRN >

Daniel Martin Katz, Institutional Rules, Strategic Behavior and the Legacy of Chief Justice William Rehnquist: Setting the Record Straight on Dickerson v. United States, 22 Journal of Law & Politics 303 (2006) < SSRN >

Daniel Martin Katz, Michael Bommarito, Tyler Sollinger & James Ming Chen, Law on the Market? Abnormal Stock Returns and Supreme Court Decision-Making < SSRN > < arXiv > < Slides >

Daniel Martin Katz, Michael Bommarito & Josh Blackman, Crowdsourcing Accurately and Robustly Predicts Supreme Court Decisions < SSRN > < arXiv > < Slides >

Daniel Martin Katz & Michael Bommarito, Regulatory Dynamics Revealed by the Securities Filings of Registered Companies < Slides >

Pierpaolo Vivo, Daniel Martin Katz & J.B. Ruhl (Editors), The Physics of the Law: Legal Systems Through the Prism of Complexity Science, Special Collection for Frontiers in Physics (2021 Forthcoming) < Frontiers in Physics >

Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian Bother & Daniel Martin Katz, Law Smells – Defining and Detecting Problematic Patterns in Legal Drafting < SSRN >

Ilias Chalkidis, Abhik Jana, Dirk Hartung, Michael Bommarito, Ion Androutsopoulos, Daniel Martin Katz & Nikolaos Aletras, LexGLUE: A Benchmark Dataset for Legal Language Understanding in English < arXiv > < SSRN >