Bio: David Mimno is an associate professor in the department of Information Science at Cornell University. He holds a PhD from UMass Amherst and was previously the head programmer at the Perseus Project at Tufts and a researcher at Princeton University. His work has been supported by the Sloan foundation, the NEH, and the NSF.
I supervise PhD students in Information Science and Computer Science.
[Reviewing] The four categories of acceptable papers
[Topic modeling] How LDA algorithms work
Recent workshops and tutorials:
GenLaw, ICML workshop on Generative AI and Law, Honolulu Hawai'i, July 2023
Translation Tutorial: A Hands-On Introduction to Large Language Models for Fairness, Accountability, and Transparency Researchers, ACM FAccT tutorial, Chicago, June 2023
INFO 6010: Quantitative methods for Information Science [Spring 2023]
INFO 2950: Introduction to Data Science [Fall 2022]
INFO 6150/CS 6788: Advanced Topic Modeling [Spring 2021]
I ran the Text as Data (TADA) 2022 conference
The BERT for Humanists project makes large language models accessible for researchers working on text as data problems.
MALLET provides text classification and high-quality sampling-based topic modeling.