Brendan O'Connor

Twitter: @brendan642

Machine Learning Department
School of Computer Science
Carnegie Mellon University
5000 Forbes Ave.
Pittsburgh, PA 15213

Office: GHC 8009 (find me on IM to check if I'm there or nearby)

What can statistical text analysis tell us about society? I develop practical and scientifically rigorous text mining methods that can help answer social science questions.

I'm generally interested in machine learning and natural language processing, especially when informed by or applied to areas like political science or sociolinguistics. My work often uses text data from news and social media.

I'm a 5th year Machine Learning Ph.D. student at Carnegie Mellon University (CMU)'s School of Computer Science, in Noah's ARK research group.

I have also been a Visiting Fellow at Harvard IQSS, and an intern on the Facebook Data Science team. Before grad school, I worked on crowdsourced annotations at CrowdFlower / Dolores Labs, as well as "semantic" search at Powerset. I was an undergrad at Stanford with a BS/MS in Symbolic Systems (cognitive science or so).

Videos from past presentations

Selected recent publications

(See also Google Scholar.)

Other publications

Other papers on my CV or Google Scholar.


  • TweetNLP: tokenization and part-of-speech tagging for Twitter.
  • ARKref, a coreference resolution system.
  • ParseViz - quick and dirty parse tree/dependency visualization via graphviz.
  • tsvutils for tab-separated data processing
  • Other misc utilities (commandline, R, Python...)

Recent News

Demos etc.

  • Inactive demos

    Elsewhere on the Internet

  • Twitter
  • Facebook
  • LinkedIn
  • Github
  • Gists
  • Writings:

    My PGP key

    Other O'Connors

    There are many Brendan O'Connors in the world. If this is the wrong webpage, you may be interested in another Brendan O'Connor, My awesome sister, Maureen O'Connor, is a writer in New York.