TextXD 2019 Program

Our preliminary program is below, although we may make some adjustments in the coming weeks.

Program Overview

DayDateFocusLocation
Day 1Tuesday, Dec. 3Training workshopsSpieker Forum at Chou Hall
Day 2Wednesday, Dec. 4Talks and postersSpieker Forum at Chou Hall
Day 3Thursday, Dec. 5Talks and postersSpieker Forum a Chou Hall
Day 4Friday, Dec. 6Collaboration and codingBIDS (190 Doe)

Register for TextXD 2019 - Submit a poster application

Day 1: Tuesday, December 3rd (Workshops)

Location: Spieker Forum at Chou Hall

These workshops will generally be interactive coding sessions with jupyter notebooks, so we strongly recommend bringing a laptop with a working installation of Anaconda / Python. No prior experience with text analysis is assumed.

TimeTopicSpeakerInstitution
9amBreakfast
9:30amWelcome
9:40amText as Data Introduction
10:35amWeb APIs and ScrapingGeoff BaconUC Berkeley, Linguistics
11:30amCoffee Break
11:45amTopic modelingIlya AkdemirUC Berkeley, Law
12:45pmLunch
1:40pmWord embeddingsAlina Arseniev-KoehlerUCLA, Sociology
2:45pmSupervised machine learning
3:45pmCoffee Break
4pmDeep learningDima LituievUC San Francisco, Bakar Computational Health Sciences Institute
5pmDiscussion

Day 2: Wednesday, December 4th (Talks)

Location: Spieker Forum at Chou Hall

TimeTopicSpeakerInstitution
9amBreakfast
9:30amWelcome
9:40amKeynoteChris PottsStanford University, Linguistics
10:30amSession 1 - Psychological Threads
“I come before you a changed man”: Historical Changes in the Vocabulary of Parole Release DecisionsIsaac DalkeUC Berkeley, Sociology
“The words of trauma” - Text Analysis of the effect of War World II on Salinger’s literatureAnat Talmon, Chen EdelsburgStanford University, Psychology and Tel Aviv University
11:15amCoffee Break
11:30amSession 2 - Policy
The Effect of Gender Stereotypes on Educational Outcomes in the 1970s: A Historical Case StudyZachary BleemerUC Berkeley, Economics
State-level racial attitudes and adverse birth outcomes: applying natural language processing to Twitter data to quantify state context for pregnant womenThu NguyenUC San Francisco, Epidemiology & Biostatistics
NLP approaches to detecting behavioral failures in sustainable transportation infrastructureOmar Isaac AsensioGeorgia Institute of Technology, Public Policy
12:30pmLunch + Poster session
1:30pmKeynote: Towards Universal Language UnderstandingYunyao LiIBM, Scalable Knowledge Intelligence
2:15pmSession 3 - Theory and Methods
Interpreting and improving NLP models via disentangled interpretationsChandan SinghUC Berkeley, Computer Science
Cross-domain classificationBarea SinnoUniversity of Texas at Austin, Ohio State University
Automated methods enable direct computation on phenotypic descriptions for novel candidate gene predictionIan BraunIowa State University, Computational Biology
3:15pmCoffee Break
3:30pmSession 4 - Politics
Detecting Meaningful Multi-word Expressions in Political TextKenneth BenoitLondon School of Economics, Methodology
Who speaks for Women in the Indian Parliament?Saloni BhogaleAshoka University, Trivedi Centre
Sentiment is Not Stance: Target-Aware Classification for Political Text AnalysisSamuel E. BestvaterThe Pennsylvania State University, Political Science
4:30pmKeynoteJustin GrimmerStanford University, Political Science
5:30pmReception

Day 3: Thursday, December 5th (Talks)

Location: Spieker Forum at Chou Hall

TimeTopicSpeakerInstitution
9amBreakfast
9:30amWelcome
9:40amKeynoteKathleen CarleyCarnegie Mellon University, Computer Science
10:30amSession 5 - Innovation
Quantifying Innovation with BERT: Linguistic Prescience and Firm Stock ReturnsPaul VicinanzaStanford University, Graduate School of Business
Identifying (Dis)Continuities in Ed Tech’s Discourse of InventionSebastian Muñoz-Najar GalvezStanford University, Graduate School of Education
11:15amCoffee Break
11:30amSession 6 - Public Health
NLP for conversational dialogOrianna DeMasiUC Davis, Computer Science
#Vape: Measuring E-cigarette Influence on Instagram with Deep Learning and Text AnalysisJulia VasseyUC Berkeley, Public Health
No More Silence: Monitoring Bias with Word2VecLauren KaplanUC San Francisco, Medicine
12:30pmLunch + Poster session
1:30pmSession 7 - Lightning Talks
Hidden Political Dynasties in China: Analyzing Chinese Baby Names as Ultra-Short Political Text DataTao LiUniversity of Macau, Government & Public Administration
Are both policemen and policewomen police officers? The gender connotations of gender-fair languageAlina Arseniev-KoehlerUCLA, Sociology
Machine-learning Political Event Database SystemAlex HannaGoogle, ML Fairness
A pipeline for analyzing Akkadian textsAleksi SahalaUniversity of Helsinki, Linguistics
2pmSession 8 - Biomedical
Application of text mining methods to identify lupus nephritis from electronic health recordsMilena GianfrancescoUC San Francisco, Medicine
Unstructured Text Analysis in Electronic Health Records to Characterize Sepsis PresentationMeghana BhimaraoKaiser Permanente, Division of Research
Extracting patient-reported functional status and disease activity information from electronic health recordsTome EftimovStanford University, Biomedical Data Science
Natural language processing for automated rapid cancer ascertainmentLiyan LiuKaiser Permanente, Division of Research
3:15pmCoffee Break
3:30pmSession 9 - News and Media
“Downloading” the news: Reproducible access to text as dataCody HennesyUniversity of Minnesota, Libraries
Media Attention and Bureaucratic ResponsivenessAaron ErlichMcGill University, Political Science
How Do Threats Shift In-Group Identification?: When Natural Experiments Meet Text DataAndrew ThompsonMassachusetts Institute of Technology, Political Science
4:30pmKeynoteBrandon StewartPrinceton University, Sociology
5:30pmReception

Day 4: Friday, December 6th (Collaboration)

Location: Berkeley Institute for Data Science (190 Doe Library)

Theme: Text Analysis for Social Good

Day 4 will be at BIDS and will include a hackathon component as well as parallel breakout sessions for discussing major issues in text analysis / NLP. The hackathon will feature multiple projects with associated datasets and starter jupyter notebooks. Participants will form teams and apply text analysis methods of their choice, potentially leading to future research collaborations. Breakout sessions will feature introductory presentations followed by facilitated discussions leading to summary recommendations on the chosen topic.

TimeTopicBreakout session(s)
9amBreakfast
9:30amWelcome
9:40amProject introductions
10:00amCoding / collaborationPedagogy of Text Analysis - Evan Muzzall
11amCoffee Break
11:15amCoding / collaborationText Analysis for Social Good
12:30pmLunch
1:30pmCoding / collaborationTextXD 2020 priorities
3:00pmCoffee Break
3:15pmCoding / collaboration
4:00pmReport back & conference close