1 / 22

Data Sourcing, Statistical Processing and Time Series Analysis

Data Sourcing, Statistical Processing and Time Series Analysis. Presented at EDAMBA summer school, Soreze (France) 23 July – 27 July 2009. An Example from Research into Hedge Fund Investments . ‘In the business world, the rearview mirror is always clearer than the windshield’

brooke
Download Presentation

Data Sourcing, Statistical Processing and Time Series Analysis

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Data Sourcing, Statistical Processing and Time Series Analysis Presented at EDAMBA summer school, Soreze (France) 23 July – 27 July 2009 • An Example from Research into Hedge Fund Investments

  2. ‘In the business world, the rearview mirror is always clearer than the windshield’ - Warren Buffett -

  3. Research Purpose • Developing accurate parametric pricing models for hedge funds and fund of hedge funds • Accounting for the special statistical properties of alternative investment funds • Providing practitioners and statisticians with a framework to assess, categorize and predict hedge fund investments

  4. Research Approach • Research Philosophy • Research Approach • Primary Data Positivistic, deductive research: Postulation of hypotheses that are tested via standard statistical procedures Empirical analysis: Interpreting the quality of pricing models on the basis of historical data External secondary data: Historic time series adjusted for data-bias effects

  5. Data Sourcing • DATA POOL

  6. Data Treatment • FACTOR ANALYSIS • DATA POOL • MODEL BUILDING • STATISTICAL CLUSTERING

  7. STATISTICAL SIGNIFICANCE

  8. Data Processing (1/2)

  9. Data Processing (2/2)

  10. Data Import Access Database Excel Pivot table report

  11. Access Database Management • Introduce Autonumber as primary keys • Define foreign keys for data queries • Define table relationships (one-to-many) • Build junction tables (many-to-many) • Write SQL queries to display relevant data • Integrate SQL in VBA code

  12. Why Access? • Avoiding duplicate entries • Cross-referencing data from various sources • Combining and aggregating different databases • Efficient storage due to relational data management • Queries allow for retrieval/display of specific data • Linked-in with Microsoft VBA and Excel (data displayable as Pivot table reports) • Searching for specific entries via SQL

  13. Data Validity • Consistency of performance history across different database providers • Degree of history-backfilling bias • Exclusion of defaulted funds/non-reporting funds from databases (survivorship bias) • Extent of infrequent or inconsistent pricing of assets (managerial bias)

  14. Data Bias • Survivorship • Self-Selection • Database • Instant History • Look-ahead Inclusion of graveyard funds Multiple databases Rolling-window observation / Incubation period

  15. Hedge Fund Categories (TASS)

  16. Statistical tests • Regression Alpha • Average Error term • Information Ratio • Normality (Chi-squared, JarqueBera) • Goodness of fit, phase-locking and collinearity (Akaike Information Criterion, Hannan-Schwartz) • Serial Correlation (Durbin-Watson, Portmanteau) • Non-stationarity (unit root)

  17. Comparative Analysis • Strategy 1 • Leverage • Strategy 2 • Leverage Unbalanced ANOVA (within and between treatments) t – test (leverage vs. no leverage) t – test (between strategies) t – test for equal means t – test for equal means t – test for equal means • Strategy 1 • No Leverage • Strategy 2 • No Leverage t – test for equal means

  18. Empirical Findings • The accuracy of pricing models could be significantly improved when accounting for special statistical properties of hedge funds (Non-normality, non-linearity) • Hedge fund performance can be attributed to location choice as well as trading strategy • A limited number of principal components explains a significant proportion of cross-sectional return variation

  19. Literature Review • Hedge Fund Linear Pricing Models • Sharpe Factor Model (Sharpe, 1992) • Constrained Regression (Otten, 2000) • Fama-French Factor Model (Fama, 1992) • Factor Component Analysis (Fung, 1997) • Simulation of Trading component (lookback straddle)

  20. Prediction Models

  21. Sources

More Related