data mining – Page 2 – Computational Legal Studies

LegalAnalyticsCourse.com — Now Online (but still in beta)

January 31, 2015 clsadmin

This intro class is designed to train students to efficiently manage, collect, explore, analyze, and communicate in a legal profession that is increasingly being driven by data.

Our goal is to imbue our students with the capability to understand the process of extracting actionable knowledge from data, to distinguish themselves in legal proceedings involving data or analysis, and assist in firm and in-house management, including billing, case forecasting, process improvement, resource management, and financial operations.

This course assumes prior knowledge of statistics, such as might be obtained in Quantitative Methods for Lawyers or through advanced undergraduate curricula. This class is not for everyone; for many, it will prove to be challenging. With that warning, we encourage you to consider your interest and career aspirations against the unique experience and value of this class. To our knowledge, this is the only existing class that teaches these quantitative skills to lawyers and law students.

Still in beta – we will be adding much more to this site as we move forward!

Legal Analytics – Introduction to the Course – Professors Daniel Martin Katz + Michael J Bommarito

January 16, 2015 clsadmin

Here is an introductory slide deck from “Legal Analytics” which is a course that Mike Bommarito and I are teaching this semester. Relevant legal applications include predictive coding in e-discovery (i.e. classification), early case assessment and overall case prediction, pricing and staff forecasting, prediction of judicial behavior, etc.

As I have written in my recent article in Emory Law Journal – we are moving into an era of data driven law practice. This course is a direct response to demands from relevant industry stakeholders. For a large number of prediction tasks … humans + machines > humans or machines working alone.

We believe this is the first ever Machine Learning Course offered to law students and it our goal to help develop the first wave of human capital trained to thrive as this this new data driven era takes hold. Richard Susskind likes to highlight this famous quote from Wayne Gretzky … “A good hockey player plays where the puck is. A great hockey player plays where the puck is going to be.”

Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images (via arXiv)

December 9, 2014 clsadmin

The Next Era of Designers Will Use Data as Their Medium (via Wired)

December 4, 2014 clsadmin

Using R for Quantitative Methods for Lawyers and Legal Analytics Courses (Professors Katz + Bommarito)

November 28, 2014 clsadmin

While its performance is sometimes problematic for some extremely large data problems, R (with R studio frontend) is the data science language du jour for many small to medium data problems. Among other things, R is great because it is open source, hyper customizable with thousands of packages available to be loaded for a specific problem.

While Python and SQL are also important parts of the overall data science toolkit, we use R as our preferred language in both Quantitative Methods for Lawyers (3 credits) as well as in our Legal Analytics course (2 credits). We have found that students who are diligent can make amazing strides in a relatively short amount of time. For example, see this final project by Pat Ellis from last year’s course.

Here are some introductory resources that we have developed to get folks started: Loading R and R Studio
R Boot Camp – Part 1 – Loading Datasets and Basic Data Exploration
Data Cleaning and Additional Resources
R Boot Camp – Part 2 – Statistical Tests Using R
Basic Data Visualization in R
Scatter Plots, Covariance, Correlation Using R
Intro to Regression Analysis Using R

Over the balance of the 2014-2015 academic year, Mike and I will be introducing a variety of new things to the quantitative sequence including dplyR, etc. … more to come …

14th FRAP – Finance, Risk and Accounting Management Perspectives Conference @ Oriel College – Oxford

September 23, 2014 clsadmin

Tomorrow I will presenting initial results for my new project called ‘Law on the Market’ (co-authored with Jim Chen, Michael Bommarito & Tyler Soellinger) at the Oxford FRAP Finance Conference at Oriel College!

Coding for Lawyers – A Free Online Textbook (By V. David Zvenyach)

September 14, 2014 clsadmin

Artificial Intelligence, Robotics and the Future of Jobs (via Pew Research)

August 16, 2014 clsadmin

This Computer Program Can Predict 7 out of 10 Supreme Court Decisions (via Vox.com)

August 4, 2014 clsadmin

The story is here. Full form interview with Mike + Josh is here. (I unfortunately could not participate because I was teaching my ICPSR class). Our paper is available on SSRN and on the physics arXiv.

Predicting the Behavior of the Supreme Court of the United States: A General Approach (Katz, Bommarito & Blackman)

July 24, 2014 clsadmin

Abstract: “Building upon developments in theoretical and applied machine learning, as well as the efforts of various scholars including Guimera and Sales-Pardo (2011), Ruger et al. (2004), and Martin et al. (2004), we construct a model designed to predict the voting behavior of the Supreme Court of the United States. Using the extremely randomized tree method first proposed in Geurts, et al. (2006), a method similar to the random forest approach developed in Breiman (2001), as well as novel feature engineering, we predict more than sixty years of decisions by the Supreme Court of the United States (1953-2013). Using only data available prior to the date of decision, our model correctly identifies 69.7% of the Court’s overall affirm and reverse decisions and correctly forecasts 70.9% of the votes of individual justices across 7,700 cases and more than 68,000 justice votes. Our performance is consistent with the general level of prediction offered by prior scholars. However, our model is distinctive as it is the first robust, generalized, and fully predictive model of Supreme Court voting behavior offered to date. Our model predicts six decades of behavior of thirty Justices appointed by thirteen Presidents. With a more sound methodological foundation, our results represent a major advance for the science of quantitative legal prediction and portend a range of other potential applications, such as those described in Katz (2013).”

You can access the current draft of the paper via SSRN or via the physics arXiv. Full code is publicly available on Github. See also the LexPredict site. More on this to come soon …

ZoningCheck Puts Computable Municipal Codes to Use (via TechRepublic)

July 19, 2014 clsadmin

HT: Dazza Greenwood from MIT Media Lab

Data Science, Big Data and Statistics – Can We All Live Together?

July 4, 2014 clsadmin

Data Science, Big Data and Statistics – can we all live together? from Chalmers Internal on Vimeo.

Invention as a Combinatorial Process: Evidence from U.S. Patents (via arXiv.org)

June 14, 2014 clsadmin

Twitter to Release All Tweets to Scientists (via Scientific American)

May 27, 2014 clsadmin

<Update–Only a Select Number of Institutions/Researchers Will Have Access>

Searching for Superspreaders of Information in Real-World Social Media (via ArXiv.org / MIT Technology Review)

May 15, 2014 clsadmin

Daniel Martin Katz, Ron Dolin & Michael Bommarito, Legal Informatics, Cambridge University Press (2021) (Edited Volume) < Cambridge >

Corinna Coupette, Janis Beckedorf, Dirk Hartung, Michael Bommarito, & Daniel Martin Katz, Measuring Law Over Time: A Network Analytical Framework with an Application to Statutes and Regulations in the United States and Germany, 9 Front. Phys. 658463 (2021) < Frontiers in Physics > < Supplemental Material >

Daniel Martin Katz, Legal Innovation (Book Forward) in Mapping Legal Innovation: Trends and Perspectives (Springer) (Antoine Masson & Gavin Robinson, eds.) (2021) < Springer >

Michael Bommarito, Daniel Martin Katz & Eric Detterman, LexNLP: Natural Language Processing and Information Extraction For Legal and Regulatory Texts in Research Handbook on Big Data Law (Edward Elgar Press) (Roland Vogl, ed.) (2021) < Edward Elgar > < Github > < SSRN > < arXiv >

Daniel Martin Katz, Corinna Coupette, Janis Beckedorf & Dirk Hartung, Complex Societies and the Growth of the Law, 10 Scientific Reports 18737 (2020) < Nature Research > < Supplemental Material >

Edward D. Lee, Daniel Martin Katz, Michael J. Bommarito II, Paul Ginsparg, Sensitivity of Collective Outcomes Identifies Pivotal Components, 17 Journal of the Royal Society Interface 167 (2020) < Journal of the Royal Society Interface > < Supplemental Material >

Michael Bommarito, Daniel Martin Katz & Eric Detterman, OpenEDGAR: Open Source Software for SEC EDGAR Analysis, MIT Computational Law Report (2020) < MIT Law > < Github >

J.B. Ruhl & Daniel Martin Katz, Mapping the Law with Artificial Intelligence in Law of Artificial Intelligence and Smart Machines (ABA Press) (2019) < ABA Press >

J.B. Ruhl & Daniel Martin Katz, Harnessing the Complexity of Legal Systems for Governing Global Challenges in Global Challenges, Governance, and Complexity (Edward Elgar) (2019) < Edward Elgar >

J.B. Ruhl & Daniel Martin Katz, Mapping Law’s Complexity with ‘Legal Maps’ in Complexity Theory and Law: Mapping an Emergent Jurisprudence (Taylor & Francis) (2018) < Taylor & Francis >

Michael Bommarito & Daniel Martin Katz, Measuring and Modeling the U.S. Regulatory Ecosystem, 168 Journal of Statistical Physics 1125 (2017) < J Stat Phys >

Daniel Martin Katz, Michael Bommarito & Josh Blackman, A General Approach for Predicting the Behavior of the Supreme Court of the United States, PLoS ONE 12(4): e0174698 (2017) < PLoS One >

J.B. Ruhl, Daniel Martin Katz & Michael Bommarito, Harnessing Legal Complexity, 355 Science 1377 (2017) < Science >

J.B. Ruhl & Daniel Martin Katz, Measuring, Monitoring, and Managing Legal Complexity, 101 Iowa Law Review 191 (2015) < SSRN >

Paul Lippe, Daniel Martin Katz & Dan Jackson, Legal by Design: A New Paradigm for Handling Complexity in Banking Regulation and Elsewhere in Law, 93 Oregon Law Review 831 (2015) < SSRN >

Paul Lippe, Jan Putnis, Daniel Martin Katz & Ian Hurst, How Smart Resolution Planning Can Help Banks Improve Profitability And Reduce Risk, Banking Perspective Quarterly (2015) < SSRN >

Daniel Martin Katz, The MIT School of Law? A Perspective on Legal Education in the 21st Century, Illinois Law Review 1431 (2014) < SSRN > < Slides >

Daniel Martin Katz & Michael Bommarito, Measuring the Complexity of the Law: The United States Code, 22 Journal of Artificial Intelligence & Law 1 (2014) < Springer > < SSRN >

Daniel Martin Katz, Quantitative Legal Prediction – or – How I Learned to Stop Worrying and Start Preparing for the Data Driven Future of the Legal Services Industry, 62 Emory Law Journal 909 (2013) < SSRN >

Daniel Martin Katz, Joshua Gubler, Jon Zelner, Michael Bommarito, Eric Provins & Eitan Ingall, Reproduction of Hierarchy? A Social Network Analysis of the American Law Professoriate, 61 Journal of Legal Education 76 (2011) < SSRN >

Michael Bommarito, Daniel Martin Katz & Jillian Isaacs-See, An Empirical Survey of the Written Decisions of the United States Tax Court (1990-2008), 30 Virginia Tax Review 523 (2011) < SSRN >

Daniel Martin Katz, Michael Bommarito, Juile Seaman, Adam Candeub, Eugene Agichtein, Legal N-Grams? A Simple Approach to Track the Evolution of Legal Language in Proceedings of JURIX: The 24th International Conference on Legal Knowledge and Information Systems (2011) < SSRN >

Daniel Martin Katz & Derek Stafford, Hustle and Flow: A Social Network Analysis of the American Federal Judiciary, 71 Ohio State Law Journal 457 (2010) < SSRN >

Michael Bommarito & Daniel Martin Katz, A Mathematical Approach to the Study of the United States Code, 389 Physica A 4195 (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz & Jonathan Zelner, On the Stability of Community Detection Algorithms on Longitudinal Citation Data in Proceedings of the 6th Conference on Applications of Social Network Analysis (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz, Jonathan Zelner & James Fowler, Distance Measures for Dynamic Citation Networks 389 Physica A 4201 (2010) < SSRN > < arXiv >

Michael Bommarito, Daniel Martin Katz & Jonathan Zelner, Law as a Seamless Web? Comparing Various Network Representations of the United States Supreme Court Corpus (1791-2005) in Proceedings of the 12th International Conference on Artificial Intelligence and Law (2009) < SSRN >

Marvin Krislov & Daniel Martin Katz, Taking State Constitutions Seriously, 17 Cornell Journal of Law & Public Policy 295 (2008) < SSRN >

Daniel Martin Katz, Derek Stafford & Eric Provins, Social Architecture, Judicial Peer Effects and the ‘Evolution’ of the Law: Toward a Positive Theory of Judicial Social Structure, 23 Georgia State Law Review 975 (2008) < SSRN >

Daniel Martin Katz, Institutional Rules, Strategic Behavior and the Legacy of Chief Justice William Rehnquist: Setting the Record Straight on Dickerson v. United States, 22 Journal of Law & Politics 303 (2006) < SSRN >

Daniel Martin Katz, Michael Bommarito, Tyler Sollinger & James Ming Chen, Law on the Market? Abnormal Stock Returns and Supreme Court Decision-Making < SSRN > < arXiv > < Slides >

Daniel Martin Katz, Michael Bommarito & Josh Blackman, Crowdsourcing Accurately and Robustly Predicts Supreme Court Decisions < SSRN > < arXiv > < Slides >

Daniel Martin Katz & Michael Bommarito, Regulatory Dynamics Revealed by the Securities Filings of Registered Companies < Slides >

Pierpaolo Vivo, Daniel Martin Katz & J.B. Ruhl (Editors), The Physics of the Law: Legal Systems Through the Prism of Complexity Science, Special Collection for Frontiers in Physics (2021 Forthcoming) < Frontiers in Physics >

Corinna Coupette, Dirk Hartung, Janis Beckedorf, Maximilian Bother & Daniel Martin Katz, Law Smells – Defining and Detecting Problematic Patterns in Legal Drafting < SSRN >

Ilias Chalkidis, Abhik Jana, Dirk Hartung, Michael Bommarito, Ion Androutsopoulos, Daniel Martin Katz & Nikolaos Aletras, LexGLUE: A Benchmark Dataset for Legal Language Understanding in English < arXiv > < SSRN >