Program BigSurv20


Friday 6th November Friday 13th November Friday 20th November Friday 27th November
Friday 4th December

Not all presentations and recordings are publicly available. Please log in to access more BigSurv20 conference materials.



Friday 6th November


10:00 - 11:30 (ET, GMT-5)

7:00 - 8:30 (PT, GMT-8)

16:00 - 17:30 (CET, GMT+1)

Opening and keynote (all live)

Moderator: Peter Lugtig ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

Big Data Challenge Kickoff

Moderator: Barry Schouten ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

Classifieds: Coding open-ended responses using machine learning

Moderator: Stas Kolenikov ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

Using supervised classification for categorizing answers to an open-ended question on panel participation motivation
Anna-Carolina Haensch (GESIS Leibniz Institute for the Social Sciences) - Presenting Author
Bernd Weiss (GESIS Leibniz Institute for the Social Sciences)
Katja Bitz (University of Mannheim)

A framework for using machine learning to support qualitative data coding
Amanda Smith (RTI International) - Presenting Author
Peter Baumgartner (RTI International)
Murrey Olmsted (RTI International)
Dawn Ohse (RTI International)

Training deep learning models with active learning framework to classify “other (please specify)“ comments
Xin (Rosalynn) Yang (Westat) - Presenting Author
Ting Yan (Westat)
David Cantor (Westat)

Measuring the validity of open-ended questions: Application of unsupervised learning methods
Eric Plutzer (Penn State University)
Burt Monroe (Penn State University) - Presenting Author

Download presentation



Writerly Respondents: Explaining Nonresponse and Response Length for Open-Ended Questions
Arnold Lau (Pew Research Center) - Presenting Author



11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

Improving survey questions using machine learning and AI

Moderator: Katharina Meintinger ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

Open question formats: Comparing the suitability of requests for text and voice answers in smartphone surveys
Jan Karem Höhne (University of Mannheim) - Presenting Author
Annelies Blom (University of Mannheim)
Konstantin Gavras (University of Mannheim)
Melanie Revilla (RECSM-Universitat Pompeu Fabra)

The sound of respondents: How do emotional states affect the quality of voice answers in smartphone surveys?
Viewable in live Zoom session only
Christoph Kern (University of Mannheim) - Presenting Author
Jan Karem Höhne (University of Mannheim)
Stephan Schlosser (University of Göttingen)

Download presentation

Automated double-barreled question classification using machine learning
Viewable in live Zoom session only
King Chung Ho (SurveyMonkey) - Presenting Author
Shubhi Jain (SurveyMonkey)
Fernando Espino Casas (SurveyMonkey)
Zewei Zong (SurveyMonkey)
Jing Huang (Surveymonkey)

Using generative adversarial active learning to identify poor closed-ended survey responses
Viewable in live Zoom session only
Krishna Sumanth Muppalla (SurveyMonkey) - Presenting Author
Jin Yang (SurveyMonkey)
Jing Huang (SurveyMonkey)
Johan Lieu (SurveyMonkey)
Manohar Angani (SurveyMonkey)
Megha Rastogi (SurveyMonkey)

Improving SHARE translation verification
Viewable in live Zoom session only
Yi-Chen Liu (CIS, LMU Munich)
Yuri Pettinicchi (MEA-MPISOC) - Presenting Author
Alexander Fraser (CIS, LMU Munich)

Download presentation

11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

Properties of organic data and their integration with traditional surveys

Moderator: Peter Lugtig ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

Can social media data complement traditional survey data? A reflexion matrix to evaluate their relevance for the study of public opinion.
Maud Reveilhac (Lausanne University, Switzerland, Faculty of Political and Social Sciences, Institute of Social Sciences, Life Course and Social Inequality Research Centre) - Presenting Author
Stephanie Steinmetz (Lausanne University, Switzerland, Faculty of Political and Social Sciences, Institute of Social Sciences, Life Course and Social Inequality Research Centre)
Davide Morselli (Lausanne University, Switzerland, Faculty of Political and Social Sciences, Institute of Social Sciences, Life Course and Social Inequality Research Centre)

Integrating organic data and designed data for higher quality measurement: Overcoming coverage limitations of big data
Viewable in live Zoom session only
Leah Christian (Nielsen) - Presenting Author
Kay Ricci (Nielsen)



Towards a total error framework for sensor and survey data
Viewable in live Zoom session only
Lukas Beinhauer (Student at UU & intern at CBS) - Presenting Author
Ger Snijkers (Senior Methodoloog at CBS)
Jeldrik Bakker (Methodoloog at CBS)

Is bigger always better? Evaluating measurement error in organic TV tuning data
Kay Ricci (Nielsen) - Presenting Author
Leah Christian (Nielsen)

Multivariate density estimation by neural networks
Dewi Peerlings (Maastricht University and Statistics Netherlands) - Presenting Author
Jan van den Brakel (Maastricht University and Statistics Netherlands)
Nalan Bastürk (Maastricht University)
Marco Puts (Statistics Netherlands)

11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

I'm sensing there's an APP for that

Moderator: Bella Struminskaya ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

Using progressive web apps and computer vision to improve mobile data capture efforts
Michael Link (Abt Associates) - Presenting Author
Gabriel Krieshok (Abt Associates)



Activity trackers in social research: nonresponse and data quality issues
Vera Toepoel (Utrecht University) - Presenting Author

Download presentation



Social desirability in digital trace data collection
Florian Keusch (University of Mannheim) - Presenting Author
Ruben Bach (University of Mannheim)
Alexandru Cernat (University of Manchester)



Big data physiologic and ecological monitoring assessments and compliance in a complex, multi-site clinical longitudinal study (AURORA Study)
Charlie Knott (RTI International) - Presenting Author
Steve Gomori (RTI International)
Mai Nguyen (RTI International)
Sue Pedrazzani (RTI International)
Sridevi Sattaluri (RTI International)
Frank Mierzwa (RTI International)
Kim Chantala (RTI International)



Leveraging what’s there: A new approach to collecting screen time data in the 1970 British Cohort Study
Matt Brown (Centre for Longitudinal Studies, UCL)
Erica Wong (Centre for Longitudinal Studies, UCL) - Presenting Author



11:45 - 13:15 (ET, GMT-5)

8:45 - 10:15 (PT, GMT-8)

17:45 - 19:15 (CET, GMT+1)

Big brother, big data or both? Protecting privacy in the era of big data

Moderator: Don Jang ([email protected])
Slack link
Quick Zoom

Detailed zoom login information

Ethical issues in the use of big data for social research
Michael Weinhardt (Technische Universität Berlin) - Presenting Author



The paradox of data sharing and data privacy in the social sciences
Edward Freeland (Princeton University) - Presenting Author

Download presentation



The challenges of legal analysis, between text mining and machine learning
Viewable in live Zoom session only
Maria Francesca Romano (Scuola Superiore Sant'Anna) - Presenting Author
Giovanni Comandè (Scuola Superiore Sant'Anna)
Pasquale Pavone (Scuola Superiore Sant'Anna)
Denise Amram (Scuola Superiore Sant'Anna)

Measuring privacy and accuracy concerns for 2020 census data dissemination
Jennifer Childs (U.S. Census Bureau) - Presenting Author
Casey Eggleston (U.S. Census Bureau)
Aleia Clark Fobia (U.S. Census Bureau)

Download presentation