Supporting data-intensive environmental science research: data science skills for scientific practitioners of statistics

dc.contributor.advisorChairperson, Graduate Committee: Stacey Hancocken
dc.contributor.authorTheobold, Allison Shayen
dc.contributor.otherStacey Hancock was a co-author of the article, 'How environmental science graduate students acquire statistical computing skills' in the journal 'Statistics education research journal' which is contained within this dissertation.en
dc.contributor.otherStacey Hancock and Sara Mannheimer were co-authors of the article, 'Designing data science workshops for data-intensive environmental science research' submitted to the journal 'Journal of statistics education ' which is contained within this dissertation.en
dc.contributor.otherStacey Hancock was a co-author of the article, 'Data science skills in data-intensive environmental science research: the case of Alicia and Ellie' submitted to the journal 'Harvard data science review' which is contained within this dissertation.en
dc.date.accessioned2022-03-29T18:10:12Z
dc.date.available2022-03-29T18:10:12Z
dc.date.issued2020en
dc.description.abstractThe importance of data science skills for modern environmental science research cannot be understated, but graduate students in these fields typically lack these integral skills. Yet, over the last 20 years statistics preparation in these fields has grown to be considered vital, and statistics coursework has been readily incorporated into graduate programs. As 'data science' is the study of extracting value from data, the field shares a great deal of conceptual overlap with the field of Statistics. Thus, many environmental science degree programs expect students to acquire these data science skills in an applied statistics course. A gap exists, however, between the data science skills required for students' participation in the entire data analysis cycle as applied to independent research, and those taught in statistics service courses. Over the last ten years, environmental science and statistics educators have outlined the shape of the data science skills specific to research in their respective disciplines. Disappointingly, however, both sides of these conversations have ignored the area at the intersection of these fields, specifically the data science skills necessary for environmental science practitioners of statistics. This research focuses on describing the nature of environmental science graduate students' need for data science skills when engaging in the data analysis cycle, through the voice of the students. In this work, we present three qualitative studies, each investigating a different aspect of this need. First, we present a study describing environmental science students' experiences acquiring the computing skills necessary to implement statistics in their research. In-depth interviews revealed three themes in these students' paths toward computational knowledge acquisition: use of peer support, seeking out a 'singular consultant,' and learning through independent research. Motivated by the need for extracurricular opportunities for acquiring data science skills, next we describe research investigating the design and implementation of a suite of data science workshops for environmental science graduate students. These workshops fill a critical hole in the environmental science and statistics curricula, providing students with the skills necessary to retrieve, view, wrangle, visualize, and analyze their data. Finally, we conclude with research that works toward identifying key data science skills necessary for environmental science graduate students as they engage in the data analysis cycle.en
dc.identifier.urihttps://scholarworks.montana.edu/handle/1/16705en
dc.language.isoenen
dc.publisherMontana State University - Bozeman, College of Letters & Scienceen
dc.rights.holderCopyright 2020 by Allison Shay Theobolden
dc.subject.lcshEnvironmental sciencesen
dc.subject.lcshResearchen
dc.subject.lcshElectronic data processingen
dc.subject.lcshStatisticsen
dc.subject.lcshComputer scienceen
dc.subject.lcshEducation--Curriculaen
dc.titleSupporting data-intensive environmental science research: data science skills for scientific practitioners of statisticsen
dc.typeDissertationen
mus.data.thumbpage107en
thesis.degree.committeemembersMembers, Graduate Committee: Jennifer Green; Megan Wickstrom; Mark Greenwooden
thesis.degree.departmentMathematical Sciences.en
thesis.degree.genreDissertationen
thesis.degree.namePhDen
thesis.format.extentfirstpage1en
thesis.format.extentlastpage246en

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
theobold-supporting-2020.pdf
Size:
2.09 MB
Format:
Adobe Portable Document Format
Description:
Supporting data-intensive environmental science research (PDF)

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
826 B
Format:
Plain Text
Description:
Copyright (c) 2002-2022, LYRASIS. All rights reserved.