I am an assistant professor in the
College of Information and Computer Sciences
at University of Massachusetts Amherst
(since Fall 2014).
I am affiliated with the
Computational Social Science Institute,
the Initiative in Cognitive Science,
the Centers for
Data Science and
Intelligent Information Retrieval.
CS 685/690N, Advanced Natural Language Processing
Talks on current research:
Recent press coverage:
What can statistical text analysis tell us about society?
that can help answer social science
I'm interested in
statistical machine learning and
natural language processing,
especially when informed by or applied to areas like
political science or sociolinguistics.
My work often uses text data from news and social media.
See also my earlier research statement
or publications below.
There is a rich set of other faculty at UMass interested in
areas from computational social science
to natural language processing.
See the Computational Social Science Institute (CSSI) website,
and this list of computation+language researchers and courses.
I joined UMass after receiving my PhD
Carnegie Mellon University's
Machine Learning Department,
where I was advised by
Noah A. Smith.
I have also been a
Visiting Fellow at Harvard IQSS,
interned with the Facebook Data Science team.
Before grad school,
I worked on crowdsourced annotations at CrowdFlower / Dolores Labs,
as well as
"semantic" search at Powerset.
I was an undergrad and masters student in the Stanford
Symbolic Systems Program, a.k.a. cognitive science
(more or less).
(Other publications on Google Scholar or
Forthcoming, Proceedings of NAACL 2018.
Relational Summarization for Corpus Analysis.
A Probabilistic Approach for Learning with Label Proportions Applied to the US Presidential Election.
Proceedings of ICDM 2017.
Proceedings of EMNLP 2017.
Fairness, Accountability, and Transparency
in Machine Learning (FAT/ML) workshop
at KDD 2017.
3rd Workshop on Noisy User-generated Text (WNUT) at EMNLP 2017.
Proceedings of WWW 2017.
PLOS-ONE, November 2014.
- Also arXiv:1210.5268; an earlier version was from Oct. 2012 and poster at NIPS 2012 Workshop on Social Network and Social Media Analysis.
PhD Thesis, Carnegie Mellon University, 2014.
ACL Workshop on Interactive Language Learning, Visualization, and Interfaces, June 2014. (Proceedings of ACL 2014.)
In SemEval-2014 (Proceedings of the International (COLING) Workshop on Semantic Evaluations, Dublin, Ireland, August 2014).
Proceedings of ACL 2013.
Proceedings of ACL 2013.
Proceedings of NAACL 2013
arXiv:1310.1975, Oct 2013.
Data Analysis Project report, Machine Learning Department, CMU.
In Linguistic Annotation Workshop, 2013.
In First Monday 17.3, March 2012.
In NIPS Workshop on Comptuational Social Science and the Wisdom of Crowds, Sierra Nevada, Spain, December 2011.
In Proceedings of EMNLP 2011.
In ACL-2011 (short paper).
In NIPS-2010 Workshop on Machine Learning and Social Computing.
In Proceedings of
EMNLP 2010 (presentation).
- Press coverage:
New York Times,
All Things Considered,
Wall Street Journal,
San Francisco Chronicle,
In ICWSM-2010 (presentation).
In ICWSM-2010 (demo track).
In Beautiful Data, ed. Toby Segaran and Jeff Hammerbacher. O'Reilly Media. 2009.
In EMNLP-2008 (presentation).