FROM EXPLORATORY TO CONFIRMATORY: TOWARDS DATA VISUALIZATION AS A COMPLETE ANALYSIS TOOL

dc.contributor.advisorElmqvist, Niklasen_US
dc.contributor.authorNewburger, Eric Cen_US
dc.contributor.departmentLibrary & Information Servicesen_US
dc.contributor.publisherDigital Repository at the University of Marylanden_US
dc.contributor.publisherUniversity of Maryland (College Park, Md.)en_US
dc.date.accessioned2023-06-23T06:01:32Z
dc.date.available2023-06-23T06:01:32Z
dc.date.issued2023en_US
dc.description.abstractConfirmatory statistics tests, performed and written with equations, are a standard in scientific publications, but may represent a barrier to entry for novice analysts who have less familiarity with purely calculative methods. Data visualization, often touted as useful for sharing completed analyses with lay audiences, is often used for early-stage exploratory analysis. Could visualization support hypothesis confirmation? Do people have the visual intuitions to make use of such a tool? What would a visual statistical test look like, and what features would it require for acceptance by the scientific community?This research begins with a crowd-sourced experiment which asked respondents to fit a normal curve to a series of data samples, displayed as bar histograms, dot histograms, box plots, or strip plots. The results suggest people have visual intuitions – though biased toward overestimating spread – for linking idealized probability distributions with real sample data. People performed differently depending upon graphic form, suggesting design choices for subsequent experiments. A second experiment tested whether novice users might be able to perform a statistical test (T-Test) using a visual analogue – two overlapping distributions (shown as overlapping normal curves, box plots, strip plots, bar histograms, or dot histograms). Respondents had some capacity for this task, performing best with normal curves than with more detailed graphics like histograms. The final investigation of this research paired the design lessons garnered during experiments 1 & 2 with an interview study of experienced statisticians to explore the design requirements for creating acceptable visual tools for inferential statistics. The interviews uncovered three design foci: that the tool must display multiple, contrasting facets of analysis; the tool should connect the test back to raw data; and include a visual representation of real effect sizes compared to the p-value of the test statistic. The final chapter of this dissertation uses the design principles determined by these three investigations to propose a prototype visual tool for conducting a two-sample t-test, along with suggested variations for other inferential statistics.en_US
dc.identifierhttps://doi.org/10.13016/dspace/tfmv-zgas
dc.identifier.urihttp://hdl.handle.net/1903/29999
dc.language.isoenen_US
dc.subject.pqcontrolledInformation scienceen_US
dc.subject.pqcontrolledStatisticsen_US
dc.subject.pquncontrolledConfirmatory statisticsen_US
dc.subject.pquncontrolledData visualizationen_US
dc.subject.pquncontrolledDesign requirementsen_US
dc.subject.pquncontrolledInferential statisticsen_US
dc.titleFROM EXPLORATORY TO CONFIRMATORY: TOWARDS DATA VISUALIZATION AS A COMPLETE ANALYSIS TOOLen_US
dc.typeDissertationen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Newburger_umd_0117E_23286.pdf
Size:
5.38 MB
Format:
Adobe Portable Document Format