The ability to assess the quality and fit for purpose of a data set is a key component of data literacy. Evaluating the data and our own use of it is one step towards not perpetuating systemic injustice in our work.
Evaluating data is similar to evaluating other sources of information. We ask many of the same initial questions about how that data came to exist that we would ask of any piece of scholarship. However, we first look for the answers to these questions in the data's documentation.
Source | Bias |
---|---|
|
|
Authority | Currency |
|
|
Properly citing data allows research to be more easily discovered, reproduced, and interrogated. Good citations also make sure that research data's impact is measured and that researchers are credited for their work and contribution to the scholarly conversation.
There are no universal standards for data citation (yet!). In general you want to include as much information as you can to allow another researcher to replicate your process.