1 / 39

Total Cost of Preservation Cost Modeling for Sustainable Services

CNI Spring 2012 Membership Meeting Baltimore, April 1-3, 2012. Total Cost of Preservation Cost Modeling for Sustainable Services. Stephen Abrams Patricia Cruse John Kunze University of California Curation Center California Digital Library. Outline. Goals Prior work

coxa
Download Presentation

Total Cost of Preservation Cost Modeling for Sustainable Services

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. CNI Spring 2012 Membership Meeting Baltimore, April 1-3, 2012 Total Cost of PreservationCost Modeling for Sustainable Services Stephen Abrams Patricia Cruse John Kunze University of California Curation Center California Digital Library

  2. Outline • Goals • Prior work • Modeling preservation activity • Total cost of preservation • Pay-as-you-go price model • Paid-up price model • Conclusions • Questions and discussion http://wiki.ucop.edu/display/Curation/Cost+Modeling Source: Getty Images

  3. Goals • Understand costs in order to plan for and implement sustainable preservation services • Investigate the possibility of paid-up pricing in order to address • Boom-or-bust budget cycles • Fixed-term, grant funded projects End date! Source: www.sharedidiz.com/

  4. Prior work } • Nationaal Archief (2005) http://www.nationaalarchief.nl/sites/default/files/docs/kennisbank/codpv1.pdf • LIFE (2008) http://www.life.ac.uk/ • KRDS (2010) http://www.beagrie.com/krds.php • DataSpace (2010) http://arks.princeton.edu/ark:/88435/dsp01w6634361k • Jean-Daniel Zeller (2010) “Cost of digital archiving: Is there a universal model?” 8th European Conference on Digital Archiving, Geneva, April 28-30, 2010 http://regarddejanus.files.wordpress.com/2010/05/costsdigitalarchiving-_jdz_eca2010.pdf • Rosenthal (2011) http://blog.dshr.org/2011/09/modeling-economics-of-long-term-storage.html Identification of granular cost components } Assumption of annual decrease in aggregate cost, i.e., discounted cash flow (DCF) Critique of DCF approach

  5. Key assumptions • Consider only the costs incurred by the preservation service provider • Costs of content creation by collection managers are out of scope • Costs can be categorized unambiguously as fixed or marginal, and one-time or recurring • One-time costs can be annualized over the effective lifespan of the activity or system component

  6. Cost model components  System, composed of various  Servicesfor necessary/desirable functions, running on  Servers, deployed by  Staff, in support of content  Producers, who use  Workflowsto submit instances of  ContentTypes, which occupy  Storage, and are subject to ongoing  Monitoring and periodic  Interventions

  7. Total cost of preservation Total cost to service provider Number and unit cost of Interventions System component subsumes Services and Servers Staff costs are subsumed by other components Fixed cost of System Number and unit cost of Monitoring Number and unit cost of Producers Number and unit cost of Storage Number and unit cost of Workflows Unit cost and number of Content Types

  8. Total cost of preservation • Model is rich enough to represent the full economic cost of preservation • Implemented by a spreadsheet that captures all subsidiary costs

  9. Total cost of preservation • Model is rich enough to represent the full economic cost or preservation • But service providers can customize the model to exclude components whose costs are not recoverable or are subsidized as a matter of local policy

  10. Assumption: Cost allocation • Cost of the Archive, Workflows, Content Types, Monitoring, and Interventions are “common goods” • Equally beneficial to all Providers • Properly apportioned across all Providers

  11. Cost of a single Producer Number of Storage units attributable to Producer Total cost attributable to a given Producer Number of Producers Unit cost of a Producer

  12. Assumptions: Billing • Costs are billed for at the end of the period of service • The cost model should be revenue neutral

  13. Pay-as-you-go cash flow Pay-as-you-go price for a single Producer G G G Income Cash flow diagram t = 0 1 2 3 Expense G G G Cost of a single Producer Cumulative pay-as-you-go price over time period T

  14. Cumulative pay-as-you-go price as a function of time T Cumulative pay-as-you-go G (T ) $16,000 $14,000 $12,000 $10,000 $8,000 $6,000 $4,000 $2,000 $0 Cost ($) 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Year (T) Cumulative pay-as-you-go price over time period T … for “forever”

  15. Assumptions: Costs over time • The aggregate cost of providing preservation service decreases overtime; and that decrease is uniform • Moore’s and Kryder’s laws $ $ Moore’s law, 1971 – 2011 Source: Wikipedia Kryder’s law, 1980 – 2012 Source: Wikipedia

  16. Assumptions: Costs over time • The aggregate cost of providing preservation service decreases overtime; and that decrease is uniform • Moore’s and Kryder’s laws • State-of-the-art tools and understanding • Productivity increases

  17. Discounted pay-as-you-go cash flow Pay-as-you-go price for a single Producer G (1–d )·G (1–d )2·G Income Discounted cash flow (DCF) diagram t = 0 1 2 3 Expense G (1–d )·G (1–d )2·G Cost of a single Producer Discounting factor Discounted pay-as-you-go price over time period T

  18. Discounted pay-as-you-go price as a function of time T (1-d)tdiscount factor Cumulative pay-as-you-go G (T ) $16,000 $14,000 $12,000 $10,000 $8,000 $6,000 $4,000 $2,000 $0 Discounted pay-as-you-go G (T,d) Discounted pay-as-you-go G (,d ) Cost ($) 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Year (T) Discounted pay-as-you-go price over time period T … for “forever”

  19. Discount factor • d is the weighted sum of the expected changes in number and unit cost of individual components • Weighting factors ω are the proportion that a particular component contributes to the aggregate cost G, e.g.

  20. Drawbacks to pay-as-you-go pricing • Only viable for Producers with reliable annual funding sources • Boom-or-bust budgeting or the termination of funded project work can interrupt this funding • Any interruption in proactive preservation care can lead to irretrievable data loss

  21. Assumptions: Investment return • Preservation service providers can carry forward budgetary surpluses across fiscal years • Surplus funds can be invested with the return supplementing the surplus

  22. Paid-up cash flow Paid-up price r ·[(1+r )·[(1+r) ·F–G ]–(1–d )·G ]– (1–d )2·G Investment return r ·[(1+r )· F –G ] F r ·F Income t = 0 1 2 3 Cost of a single Producer Expense G (1–d )·G (1–d )2·G (1+r )·F –G F (1+r )· [(1+r )· F –G ]– (1–d )·G (1+r )·[(1+r )· [(1+r )· F –G ]– (1–d )·G ]–(1–d )2·G Surplus Paid-up price for time period T

  23. Paid-up price as a function of time T (1–d)tdiscount factor Cumulative pay-as-you-go G (T ) $16,000 $14,000 $12,000 $10,000 $8,000 $6,000 $4,000 $2,000 $0 Discounted pay-as-you-go G (T,d) Discounted pay-as-you-go G (,d ) Cost ($) Paid-up price, for T F (T,d ,r) (1+r)tinvestment return Paid-up price, for  F (,d ,r) 0 2 4 6 8 10 12 14 16 18 20 22 24 26 28 30 Year (T) Paid-up price for time period T … for “forever”

  24. Paid-up example • Pay-as-you-go price, G$ 650 (1 TB) • Discount factor, d 5% • Investment return, r 2% • Term, T 10 years • Paid-up price, F $ 4,725 r d < $ 5,216 < $ 6,500

  25. Coefficient of permanence • It is useful to be able to transition from a pay-as-you-go to a paid-up price basis • If you’re currently paying G on a pay-as-you-go basis, you can upgrade to a paid-up basis with a one-time payment of F = G ·φ , where • Princeton DataSpace, φ ≈ 30 (T = ) • USC digital repository, φ ≈ 1.2 (T = 20)

  26. Problems with R&D • TCP modeling is dependent on the predicative reliability of rand d • For d, extrapolate from Moore’s and Kryder’s laws? ? ? Moore’s law, 1971 – 2011 Source: Wikipedia Kryder’s law, 1980 – 2012 Source: Wikipedia

  27. Problems with R&D • TCP modeling is dependent on the predicative reliability of rand d • For d, extrapolate from Moore’s and Kryder’s laws? • For r, extrapolate from 30 year Treasury bonds? ? 30 year treasuries, 2007 – 2012 Source: http://ycharts.com/indicators/30_year_treasury_rate 30 year treasuries, 1882 – 2012 Source: Robert Schiller

  28. Model the risk • Round up r and d, i.e., adding a fixed “risk premium” • Add an additional risk component R to the formula for G • Its influence on the price can grow over time, reflecting increasing uncertainty, by setting a negative discount factor dR so that 1–dR> 1 • Note, however, that if the weighted sum dbecomes less than 0 and |d | > r then G (T ) will not converge to a limit + R

  29. Recalibrate the model • G and F do not have to be fixed values over time • Periodically recalculate based on current conditions (actual costs for G ) and predictions (r and d ), and apply prospectively • Retrospective service contracts remain “locked-in”

  30. Hybrid price model • Distinguish between costs that are (relatively) easy to quantify and forecast, and those that aren’t • Use the paid-up model for the former and pay-as-you-go for the latter

  31. Hybrid price model • Distinguish between costs that are (relatively) easy to quantify and forecast, and those that aren’t • Use the paid-up model for the former and pay-as-you-go for the latter • Bit preservation only

  32. Preservation forever • Some things are intended to last forever… Source: John Church Company Source: United Artists

  33. Preservation forever • Some things are intended to last forever… ?

  34. Preservation for … • A fixed term – 10 years? 20 years? – may be appropriate for much content • Give content an opportunity to prove its worth, as evidenced by someone’s commitment to pay for its subsequent preservation

  35. Embrace uncertainty • The discounted cash flow (DCF) approach is problematic on practical and theoretical grounds • Difficulty in the setting fixed values for rand d that realistically represent financial and technological trends over time • Stochastic modeling to determine the probability distribution of possible outcomes • C.f., David Rosenthal, FAST ‘12 http://blog.dshr.org/2012/02/fast-2012.html CNI Fall 2011 http://www.youtube.com/watch?v=_5lQxmyz3xY

  36. Understand the risks • Possible outcomes… • We overestimate our costs and collect too much • Fund a higher level of service • Refund some portion • We underestimate • Ask for additional funds • Lower service levels • De-accession content – but at least it was preserved up to that point and had a chance to prove its value, and gain an advocate

  37. Conclusions • Different customers have different funding capabilities • Flexibility in price models is important • Any price model is based on an idealization of the real world • Assumptions matter • Knowing all of the costs is distinct from a policy decision to recover all of those costs • If investment return and discount factor can be reliably projected, then standard DCF/NPV methods provide a reasonable prediction of long-term costs

  38. Looking for feedback • Thanks to our reviewers • Lisa Baird, UCOP • Raym Crow, SPARC • Todd Grappone, UCLA • Cliff Lynch, CNI • David Minor, UCSD/SDSC • Richard Moore, SDSC • Michael Mundrane, UC Berkeley • Jake Nadal, UCLA • David Rosenthal, LOCKSS • Mackenzie Smith • UC Council of University Librarians / Systemwide Operations and Planning Group • Candace Yano, UC Berkeley • Have we missed something in our analysis, logic, assumptions, math, etc? • Are the objections to the DCF-based analysis substantial enough to invalidate this approach? • Are there better forecasting methodologies? • Even if we don’t have a perfect model, we need to move forward now with a “good enough” model

  39. For more information Total Cost of Preservation: Cost Modeling for Sustainable Services http://wiki.ucop.edu/display/Curation/Cost+Modeling UC Curation Center http://www.cdlib.org/uc3 uc3@ucop.edu Stephen Abrams Mark Reyes Patricia Cruse Abhishek Salve Scott Fisher Joan Starr Erik Hetzner Tracy Seneca Greg Janée Carly Strasser John Kunze Marisa Strong Margaret Low Adrian Turner David Loy Perry Willett

More Related