David Mimno

Bio: David Mimno is an associate professor in the department of Information Science at Cornell University. He holds a PhD from UMass Amherst and was previously the head programmer at the Perseus Project at Tufts and a researcher at Princeton University. His work has been supported by the Sloan foundation, the NEH, and the NSF.

I supervise PhD students in Information Science and Computer Science.


[Reviewing] The four categories of acceptable papers

[Topic modeling] How LDA algorithms work

Recent workshops and tutorials:

GenLaw, ICML workshop on Generative AI and Law, Honolulu Hawai'i, July 2023

Translation Tutorial: A Hands-On Introduction to Large Language Models for Fairness, Accountability, and Transparency Researchers, ACM FAccT tutorial, Chicago, June 2023


INFO 6010: Quantitative methods for Information Science [Spring 2023]

INFO 2950: Introduction to Data Science [Fall 2022]

INFO 3350/6350: Text mining for History and Literature. [Fall 2019]

INFO 6150/CS 6788: Advanced Topic Modeling [Spring 2021]


I ran the Text as Data (TADA) 2022 conference

The BERT for Humanists project makes large language models accessible for researchers working on text as data problems.

MALLET provides text classification and high-quality sampling-based topic modeling.

Here are some resources and tools that might be useful:
Cornell University, Department of Information Science, [lastname]@cornell.edu