Code Code Evolution: Understanding How People Change Data Science Notebooks Over Time

dc.contributor.authorRaghunandan, Deepthi
dc.contributor.authorRoy, Aayushi
dc.contributor.authorShi, Shenzhi
dc.contributor.authorElmqvist, Niklas
dc.contributor.authorBattle, Leilani
dc.date.accessioned2023-09-14T17:02:25Z
dc.date.available2023-09-14T17:02:25Z
dc.date.issued2023-04
dc.description.abstractSensemaking is the iterative process of identifying, extracting, and explaining insights from data, where each iteration is referred to as the “sensemaking loop.” However, little is known about how sensemaking behavior evolves from exploration and explanation during this process. This gap limits our ability to understand the full scope of sensemaking, which in turn inhibits the design of tools that support the process. We contribute the first mixed-method to characterize how sensemaking evolves within computational notebooks. We study 2,574 Jupyter notebooks mined from GitHub by identifying data science notebooks that have undergone significant iterations, presenting a regression model that automatically characterizes sensemaking activity, and using this regression model to calculate and analyze shifts in activity across GitHub versions. Our results show that notebook authors participate in various sensemaking tasks over time, such as annotation, branching analysis, and documentation. We use our insights to recommend extensions to current notebook environments.
dc.description.urihttps://doi.org/10.1145/3544548.3580997
dc.identifierhttps://doi.org/10.13016/dspace/vlce-9evi
dc.identifier.citationDeepthi Raghunandan, Aayushi Roy, Shenzhi Shi, Niklas Elmqvist, and Leilani Battle. 2023. Code Code Evolution: Understanding How People Change Data Science Notebooks Over Time. In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems (CHI ’23), April 23–28, 2023, Hamburg, Germany. ACM, New York, NY, USA, 12 pages.
dc.identifier.urihttp://hdl.handle.net/1903/30482
dc.language.isoen_US
dc.publisherAssociation for Computer Machinery (ACM)
dc.relation.isAvailableAtCollege of Computer, Mathematical & Natural Sciencesen_us
dc.relation.isAvailableAtComputer Scienceen_us
dc.relation.isAvailableAtDigital Repository at the University of Marylanden_us
dc.relation.isAvailableAtUniversity of Maryland (College Park, MD)en_us
dc.subjectcomputational notebooks
dc.subjectmachine learning
dc.subjectsensemaking
dc.subjectdata science
dc.subjectdata exploration
dc.subjectanalysis
dc.titleCode Code Evolution: Understanding How People Change Data Science Notebooks Over Time
dc.typeArticle
local.equitableAccessSubmissionNo

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Raghnandan, D et al.pdf
Size:
2.05 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.55 KB
Format:
Item-specific license agreed upon to submission
Description: