I am an associate professor in the
College of Information and Computer Sciences
at UMass Amherst.
My group is the SLANG Lab,
part of UMass NLP.
I am also an associate director of the
Computational Social Science Institute,
and affiliated with
the UMass CogSci
the Centers for
Data Science and
Intelligent Information Retrieval.
Some current things:
Some current/recent collaborative projects:
- Co-Insights: Fostering community collaboration to combat misinformation. We are part of the UMass team, in a multi-site project with several other institutions.
- Understanding variation in African American Language: Corpus and prosodic fieldwork perspectives
with Kristine Yu,
Lisa Green,
Meghan Armstrong-Abrami;
see also our earlier work in disparities in natural language processing.
- Analyzing Cross-country Bias in News Coverage of International Conflicts and Disasters, 2023 Interdisciplinary Research Grant Project, with
Przemyslaw Grabowicz,
Ethan Zuckerman, and Paul Musgrave.
- SaTC: Identifying the Demographic Representativeness of Social Media Polls,
Przemyslaw Grabowicz.
- Leveraging Large Language Models to Provide Clinically Feasible Tools for Assessing Discourse in Individuals with Communication Impairments, 2024 Interdisciplinary Research Grant Project, with
Jacquie Kurland and
Anna Liu.
What can statistical text analysis tell us about society?
I develop
text analysis
that can help answer social science
I'm interested in
statistical machine learning and
natural language processing,
especially when informed by or applied to areas like
political science or sociolinguistics.
My work often uses text data from news and social media.
There is a rich set of other faculty at UMass interested in
areas from computational social science
to natural language processing.
See the Computational Social Science Institute (CSSI) website,
and UMass NLP affiliates.
I joined UMass after receiving my PhD
Carnegie Mellon University's
Machine Learning Department.
I have also been a
Visiting Fellow at Harvard IQSS,
interned with the Facebook Data Science team.
Before grad school,
I worked on crowdsourced annotations at CrowdFlower / Dolores Labs,
natural language search at Powerset.
I started studying the intersection of AI and social science
as an undergrad/masters student in
Symbolic Systems (cognitive science, more or less).
Link: Full bio.
(For others, see Google Scholar or my
Journal of Law and Courts, Mar. 2025.
arXiv:2502.08415, Feb 2025.
Proceedings of ACL 2024.
- Also plenary lightning talk (slides) at IC2S2 2024, for extended abstract titled "Making sense of public participation in rulemaking using argument explication" (Gupta, Zuckerman, and O'Connor).
Findings of ACL 2024.
First Workshop on Machine Learning for Ancient Languages (ML4AL), 2024.
NLP+CSS Workshop at NAACL 2024.
Proceedings of ACL 2023.
NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following.
- Also presented at New England Natural Language Processing, 2023.
Findings of ACL: EACL 2023.
Also earlier arxiv version (2022)
NLP+CSS workshop at EMNLP 2022 (Proceedings of the Fifth Workshop on Natural Language Processing and Computational Social Science)
Abstract presented at New Ways of Analyzing Variation (NWAV50), 2022.
Proceedings of the Workshop on Noisy User-generated Text (W-NUT) at COLING 2022.
Proceedings of the 1st Field Matters Workshop on NLP Applications to Field Linguistics, at COLING 2022.
Proceedings of the 2022 ACM Conference on Computer Supported Cooperative Work (ACM CSCW 2022).
ACM Transactions on Interactive Intelligent Systems, 2022.
- Runner-up for the ACM TiiS 2022 Best Paper Award.
First Workshop on Causal Inference & NLP (CI-NLP) at EMNLP 2021.
Abstract presented at 11th Annual Conference on New Directions in Analyzing Text as Data (TADA 2021).
Abstract presented at the UnImplicit workshop at ACL-IJCNLP 2021.
Findings of ACL 2021. Also presented at the CASE workshop at ACL-IJCNLP 2021
Global Networks. 2021.
NLP+CSS workshop at EMNLP 2020 (Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science).
NLP+CSS workshop at EMNLP 2020 (Proceedings of the Fourth Workshop on Natural Language Processing and Computational Social Science).
Proceedings of ACL 2020.
Proceedings of EMNLP 2019.
Abram Handler and
Brendan O'Connor.
Proceedings of EMNLP 2019.
arxiv preprint, 2019.
Proceedings of EMNLP 2018.
Proceedings of ACL 2018.
Proceedings of NAACL 2018.
3rd Workshop on Noisy User-generated Text (W-NUT) at EMNLP 2017.
paper award.
Proceedings of WWW 2017.
PLOS-ONE, November 2014.
- Also arXiv:1210.5268; an earlier version was from Oct. 2012 and poster at NIPS 2012 Workshop on Social Network and Social Media Analysis.
PhD Thesis, Carnegie Mellon University, 2014.
ACL Workshop on Interactive Language Learning, Visualization, and Interfaces, June 2014. (Proceedings of ACL 2014.)
In SemEval-2014 (Proceedings of the International (COLING) Workshop on Semantic Evaluations, Dublin, Ireland, August 2014).
Proceedings of ACL 2013.
Proceedings of ACL 2013.
Proceedings of NAACL 2013
arXiv:1310.1975, Oct 2013.
Data Analysis Project report, Machine Learning Department, CMU.
July 2013.
In Linguistic Annotation Workshop, 2013.
In First Monday 17.3, March 2012.
In NIPS Workshop on Comptuational Social Science and the Wisdom of Crowds, Sierra Nevada, Spain, December 2011.
In Proceedings of EMNLP 2011.
In ACL-2011 (short paper).
In NIPS-2010 Workshop on Machine Learning and Social Computing.
In Proceedings of
EMNLP 2010 (presentation).
- Appendix
- Data
- Press coverage:
New York Times,
All Things Considered,
Washington Post,
Wall Street Journal,
Associated Press,
New Scientist,
San Francisco Chronicle,
Ars Technica,
LA Weekly,
In ICWSM-2010 (presentation).
In ICWSM-2010 (demo track).
Superficial Data Analysis: Exploring Millions of Social Stereotypes.
In Beautiful Data, ed. Toby Segaran and Jeff Hammerbacher. O'Reilly Media. 2009.
In EMNLP-2008 (presentation).