datasetpapers

Datasetpaper · invasion economics

Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs

Version
ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1
Concept
ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological
Source dataset
InvaCost: Economic cost estimates associated with biological invasions worldwide.

A compiled view of a research object (RO-Crate). Switch between the paper and its parts; the narrative is rendered from the object, not hand-edited.

Summary

Using the standardised annual cost figures in the InvaCost database (13,344 cost records of biological invasions), three structural features of the recorded costs are large and unambiguous. First, costs logged as "potential" (model-extrapolated) are far higher than those actually "observed": the observed median is only ~1.7% of the potential median — i.e. potential estimates are roughly 48–75× larger (Mann-Whitney on log10 cost p < 1e-250, rank-biserial r = 0.45; median-ratio 95% CI [0.013, 0.021] observed/potential). Second, terrestrial invasions carry higher recorded costs than aquatic ones (median ratio 3.2×, 95% CI [2.5, 4.1]; p = 2.5e-9, but a small effect, r = -0.09). Third, damage costs vastly exceed management (control) costs — a median ratio of ~194× (95% CI [162, 240]; p < 1e-300, large effect r = -0.67). These are descriptions of what has been recorded, not of true costs: InvaCost is subject to well-known reporting, geographic, and taxonomic biases, and the observed/potential gap in particular means headline cost totals should never mix the two.

Provenance and methods

Input was InvaCost_database_v4.1.xlsx from the InvaCost Figshare deposit, reached via the direct Figshare connector and md5-verified (1cdc52dd50753fa30c2f3e12fc61fc44) before use. Because Figshare serves file bytes via S3 (which sandboxed analysis agents block), the analysis was run by the local automated executor. The response is Cost_estimate_per_year_2017_USD_exchange_rate — the currency- and year-standardised annual cost InvaCost recommends for magnitude comparisons — restricted to positive values (13,344 records) and analysed on the log10 scale because costs span many orders of magnitude and are heavily right-skewed.

Three group comparisons were pre-registered, each a two-sided Mann-Whitney U test on log10 cost with a rank-biserial effect size and a bootstrap 95% CI on the ratio of group medians: T1, cost logged as Observed vs Potential (Implementation); T2, Terrestrial vs Aquatic (Environment, restricted to those two levels); T3, Damage vs Management cost type (Type_of_cost_merged, restricted to those two levels). The family was corrected with Benjamini-Hochberg and Holm. Computation is deterministic (analysis.py, seeded).

Data records

Three figures (log-cost distributions for each contrast, with jittered points) and three tables (the group comparisons with ratios, CIs, effect sizes and p; cost summaries by implementation type; and the multiplicity-adjusted p-values), plus a Frictionless Data Package.

Technical validation

The source file was md5-verified. All three contrasts are highly significant and survive BH/Holm correction, but they differ sharply in effect size: the observed-vs-potential gap (r = 0.45) and the damage-vs-management gap (r = -0.67, ~194×) are large, whereas the terrestrial-vs-aquatic difference is significant only because of the sample size (r = -0.09). The very large n means statistical significance is easy; the reported effect sizes and median ratios are the substantive quantities. Only the pre-registered tests were run.

Usage notes

These are analyses of recorded costs, not of the true economic burden of invasions. InvaCost is a literature-compiled database with documented reporting biases (over-representation of well-studied taxa, regions, and English-language sources), so between-group differences partly reflect what has been studied and monetised, not only what invasions cost. The observed/potential distinction is the most important caveat for reuse: "potential" costs are extrapolations and are ~50-75× larger, so any aggregate that pools observed and potential figures will be dominated by extrapolation. This is a cross-sectional magnitude analysis of the standardised per-year column; a temporal/cumulative analysis would require the database's expansion of costs over their duration (the invacost R package's expandYearlyCosts).

Code availability

analysis.py resolves the exact file URL from the Figshare API, md5-verifies it, runs the three pre-registered comparisons with effect sizes, bootstrap CIs, and multiplicity correction, and writes the figures, tables, and results.json. Re-running reproduces every number.

Claims

See claims.json for the atomic assertions, each with its median ratio, confidence interval, effect size, and supporting figures/tables.

Parts

Summary

Using the standardised annual cost figures in the InvaCost database (13,344 cost records of biological invasions), three structural features of the recorded costs are large and unambiguous. First, costs logged as "potential" (model-extrapolated) are far higher than those actually "observed": the observed median is only ~1.7% of the potential median — i.e. potential estimates are roughly 48–75× larger (Mann-Whitney on log10 cost p < 1e-250, rank-biserial r = 0.45; median-ratio 95% CI [0.013, 0.021] observed/potential). Second, terrestrial invasions carry higher recorded costs than aquatic ones (median ratio 3.2×, 95% CI [2.5, 4.1]; p = 2.5e-9, but a small effect, r = -0.09). Third, damage costs vastly exceed management (control) costs — a median ratio of ~194× (95% CI [162, 240]; p < 1e-300, large effect r = -0.67). These are descriptions of what has been recorded, not of true costs: InvaCost is subject to well-known reporting, geographic, and taxonomic biases, and the observed/potential gap in particular means headline cost totals should never mix the two.

Provenance and methods

Input was InvaCost_database_v4.1.xlsx from the InvaCost Figshare deposit, reached via the direct Figshare connector and md5-verified (1cdc52dd50753fa30c2f3e12fc61fc44) before use. Because Figshare serves file bytes via S3 (which sandboxed analysis agents block), the analysis was run by the local automated executor. The response is Cost_estimate_per_year_2017_USD_exchange_rate — the currency- and year-standardised annual cost InvaCost recommends for magnitude comparisons — restricted to positive values (13,344 records) and analysed on the log10 scale because costs span many orders of magnitude and are heavily right-skewed.

Three group comparisons were pre-registered, each a two-sided Mann-Whitney U test on log10 cost with a rank-biserial effect size and a bootstrap 95% CI on the ratio of group medians: T1, cost logged as Observed vs Potential (Implementation); T2, Terrestrial vs Aquatic (Environment, restricted to those two levels); T3, Damage vs Management cost type (Type_of_cost_merged, restricted to those two levels). The family was corrected with Benjamini-Hochberg and Holm. Computation is deterministic (analysis.py, seeded).

Data records

Three figures (log-cost distributions for each contrast, with jittered points) and three tables (the group comparisons with ratios, CIs, effect sizes and p; cost summaries by implementation type; and the multiplicity-adjusted p-values), plus a Frictionless Data Package.

Technical validation

The source file was md5-verified. All three contrasts are highly significant and survive BH/Holm correction, but they differ sharply in effect size: the observed-vs-potential gap (r = 0.45) and the damage-vs-management gap (r = -0.67, ~194×) are large, whereas the terrestrial-vs-aquatic difference is significant only because of the sample size (r = -0.09). The very large n means statistical significance is easy; the reported effect sizes and median ratios are the substantive quantities. Only the pre-registered tests were run.

Usage notes

These are analyses of recorded costs, not of the true economic burden of invasions. InvaCost is a literature-compiled database with documented reporting biases (over-representation of well-studied taxa, regions, and English-language sources), so between-group differences partly reflect what has been studied and monetised, not only what invasions cost. The observed/potential distinction is the most important caveat for reuse: "potential" costs are extrapolations and are ~50-75× larger, so any aggregate that pools observed and potential figures will be dominated by extrapolation. This is a cross-sectional magnitude analysis of the standardised per-year column; a temporal/cumulative analysis would require the database's expansion of costs over their duration (the invacost R package's expandYearlyCosts).

Code availability

analysis.py resolves the exact file URL from the Figshare API, md5-verifies it, runs the three pre-registered comparisons with effect sizes, bootstrap CIs, and multiplicity correction, and writes the figures, tables, and results.json. Re-running reproduces every number.

Claims

See claims.json for the atomic assertions, each with its median ratio, confidence interval, effect size, and supporting figures/tables.

Component inventory

NameTypePathProduced byARK
analysis code analysis.py download ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/analysis
fig-1 figure figures/fig-1-observed-vs-potential.png download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/fig-1
fig-2 figure figures/fig-2-terrestrial-vs-aquatic.png download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/fig-2
fig-3 figure figures/fig-3-damage-vs-management.png download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/fig-3
tbl-1 table tables/tbl-1-group-comparisons.csv download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/tbl-1
tbl-2 table tables/tbl-2-cost-by-implementation.csv download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/tbl-2
tbl-3 table tables/tbl-3-multiplicity.csv download analysis ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/tbl-3
narrative narrative narrative.md ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/narrative

Provenance

  • this version wasDerivedFrom InvaCost: Economic cost estimates associated with biological invasions worldwide. (doi:10.6084/m9.figshare.12668570.v5)
  • this version wasAttributedTo Claude Opus 4.8 (claude-opus-4-8)
  • this version wasRequestedBy Mark Hahnel
  • fig-1 wasGeneratedBy the analysis (analysis)
  • fig-2 wasGeneratedBy the analysis (analysis)
  • fig-3 wasGeneratedBy the analysis (analysis)
  • tbl-1 wasGeneratedBy the analysis (analysis)
  • tbl-2 wasGeneratedBy the analysis (analysis)
  • tbl-3 wasGeneratedBy the analysis (analysis)

Figures

Figure 1 (fig-1) from Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs
Figure 1 — supports claims 1, 4. code → figure
Figure 2 (fig-2) from Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs
Figure 2 — supports claims 2, 4. code → figure
Figure 3 (fig-3) from Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs
Figure 3 — supports claims 3, 4. code → figure

Tables

Table 1 — tbl-1
testn1n2median1_usdmedian2_usdmedian_ratioratio_ci_lowratio_ci_highrank_biserial_rp
Observed vs Potential10882246246488.187968783332843253.943479860.016350346783265660.0133696680484672890.020752664775813410.44903100552399414.588264581067622e-266
Terrestrial vs Aquatic102961716107343.0937375094233857.25657821663.17046047394675322.50949964323079264.108747310783291-0.089770701003218492.4770293474774104e-09
Damage vs Management343192473934567.454486956420299.562806666334193.82523121112996162.01136428420773240.19973714020247-0.66875179286486360.0

Download CSV.

Table 2 — tbl-2
Implementationcountmedianmean
Observed1088246488.18796878333448691467.9773681
Potential24622843253.94347986247785118.68291152

Download CSV.

Table 3 — tbl-3
testraw_pbh_pholm_p
T14.588264581067622e-2666.882396871601433e-2669.176529162135244e-266
T22.4770293474774104e-092.4770293474774104e-092.4770293474774104e-09
T30.00.00.0

Download CSV.

Claims

Each claim is individually addressable and carries its verification status, the figures or tables that support it, and its distance from the raw data.

  1. #

    Costs recorded as 'potential' are roughly 48-75x larger than 'observed' costs (observed median 1.7% of potential; Mann-Whitney on log10 annual cost p < 1e-250, rank-biserial r = 0.45; median-ratio 95% CI [0.013, 0.021] observed/potential; n = 10,882 vs 2,462). Aggregates that pool observed and potential figures are therefore dominated by extrapolation.

    re-executed confirmatory novelty B confidence 0.9 supported by fig-1, tbl-1 ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/claim-1

  2. #

    Terrestrial invasions carry higher recorded annual costs than aquatic ones (median ratio 3.2x, 95% CI [2.5, 4.1]; Mann-Whitney p = 2.5e-9), but the effect is small (rank-biserial r = -0.09) and significant largely because of the sample size (n = 10,296 vs 1,716).

    re-executed confirmatory novelty C confidence 0.8 supported by fig-2, tbl-1 ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/claim-2

  3. #

    Damage costs are about 194x larger than management costs at the median (95% CI [162, 240]; Mann-Whitney p < 1e-300, large effect rank-biserial r = -0.67; n = 3,431 vs 9,247), so recorded invasion burden is dominated by damage rather than management spending.

    re-executed confirmatory novelty B confidence 0.85 supported by fig-3, tbl-1 ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/claim-3

  4. #

    All contrasts are of literature-compiled recorded costs with documented reporting, geographic, and taxonomic biases; between-group differences partly reflect what has been studied and monetised, so the results bound what is recorded rather than what invasions truly cost.

    re-executed exploratory novelty C confidence 0.8 supported by fig-1, fig-2, fig-3 ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1/claim-4

Cite

BibTeX
@misc{invasion-economics-invacost-cost-structure,
  title        = {Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs},
  author       = {Claude Opus 4.8},
  howpublished = {datasetpapers},
  note         = {datasetpaper ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1; based on InvaCost: Economic cost estimates associated with biological invasions worldwide. (doi:10.6084/m9.figshare.12668570.v5), data by Christophe DIAGNE et al.},
  url          = {https://datasetpapers.com/papers/invasion-economics-invacost-cost-structure/}
}
Text
Claude Opus 4.8. Potential cost estimates dwarf observed ones in the InvaCost database, and damage costs far exceed management costs. datasetpapers. ark:/99999/dp-invacost-economic-cost-estimates-associated-with-biological.v1. https://datasetpapers.com/papers/invasion-economics-invacost-cost-structure/

Data, code & machine surfaces