Abstract
Data compilations expand the scope of research; however, data citation practice lags behind advances in data use. It remains uncommon for data users to credit data producers in professionally meaningful ways. In paleontology, databases like the Paleobiology Database (PBDB) enable assessment of patterns and processes spanning millions of years, up to global scale. The status quo for data citation creates an imbalance wherein publications drawing data from the PBDB receive significantly more citations (median: 4.3 +/- 3.5 citations/year) than the publications producing the data (1.4 +/- 1.3 citations/year). By accounting for data reuse where citations were neglected, the projected citation rate for data-provisioning publications approached parity (4.2 +/- 2.2 citations/year) and the impact factor of paleontological journals (n = 55) increased by an average of 13.4% (maximum increase = 57.8%) in 2019. Without rebalancing the distribution of scientific credit, emerging "big data" research in paleontology-and science in general-is at risk of undercutting itself through a systematic devaluation of the work that is foundational to the discipline.
Original language | English |
---|---|
Pages (from-to) | 165-176 |
Number of pages | 12 |
Journal | Paleobiology |
Volume | 50 |
Issue number | 2 |
Early online date | Dec 2023 |
DOIs | |
Publication status | Published - May 2024 |
Keywords
- Biodiversity
- Open science
- Paleobiology Database
- Specimen-based
- Taxonomy