Files in this item



application/pdf147_ready.pdf (537kB)
(no description provided)PDF


Title:Examining Data Processing Work as Part of the Scientific Data Lifecycle: Comparing Practices Across Four Scientific Research Groups
Author(s):Paine, Drew; Sy, Erin; Piell, Ron; Lee, Charlotte P.
Subject(s):data science including digital curation and big data
infrastructure studies
Abstract:Data processing is work that scientists must undertake in order to make data useful for analyses, and is a key component of twenty-first century scientific research. The analysis of scientific data is contingent upon the successful collection or production and then processing of data. This qualitative research study, of four data-intensive research groups, investigates scientists engaging in data processing work practices to describe and analyze three distinctive but intertwined practices: cleaning data products, selecting a subset of a data product or assembling a new data product from multiple sources, and transforming data products into a common format. These practices are necessary for researchers to transform an initial data product in to one that is ready for scientific analysis. This research finds that data processing work requires a high level of scientific and technical competence that does not merely set up analyses, but also often shapes and is shaped by iterations of research designs and research questions themselves.
Issue Date:2015-03-15
Series/Report:iConference 2015 Proceedings
Genre:Conference Paper/Presentation
Peer Reviewed:yes
Rights Information:Copyright 2015 is held by the authors. Copyright permissions, when appropriate, must be obtained directly from the authors.
Date Available in IDEALS:2015-03-24

This item appears in the following Collection(s)

Item Statistics