[Next]:  Applied mathematical finance  
 [Up]:  Projects  
 [Previous]:  Projects  
 [Contents]   [Index] 


Subsections



Statistical data analysis

Collaborator: D. Belomestny, V. Essaoulova, A. Hutt, P. Mathé, D. Mercurio, H.-J. Mucha, J. Polzehl, V. Spokoiny

Cooperation with: F. Baumgart (Leibniz-Institut für Neurobiologie, Magdeburg), R. Brüggemann, Ch. Heyn, U. Simon (Institut für Gewässerökologie und Binnenfischerei, Berlin), P. Bühlmann, A. McNeil (ETH Zürich, Switzerland), C. Butucea (Université Paris 10, France), M.-Y. Cheng (National Taiwan University, Taipeh), A. Daffertshofer (Free University of Amsterdam, The Netherlands), A. Dalalyan (Université Paris 6, France), J. Dolata (Johann Wolfgang Goethe-Universität Frankfurt am Main), L. Dümbgen (University of Bern, Switzerland), J. Fan (Princeton University, USA), J. Franke (Universität Kaiserslautern), R. Friedrich (Universität Münster), F. Godtliebsen (University of Tromsø, Norway), H. Goebl, E. Haimerl (Universität Salzburg), A. Goldenshluger (University of Haifa, Israel), I. Grama (Université de Bretagne-Sud, Vannes, France), J. Horowitz (Northwestern University, Chicago, USA), B. Ittermann (Physikalisch-Technische Bundesanstalt (PTB), Berlin), A. Juditsky (Université de Grenoble, France), I. Molchanov (University of Bern, Switzerland), K.-R. Müller (Fraunhofer FIRST, Berlin), M. Munk (Max-Planck-Institut für Hirnforschung, Frankfurt am Main), S.V. Pereverzev (RICAM, Linz, Austria), H. Riedel (Universität Oldenburg), B. Röhl-Kuhn (Bundesanstalt für Materialforschung und -prüfung (BAM), Berlin), R. von Sachs (Université Catholique de Louvain, Belgium), A. Samarov (Massachusetts Institute of Technology, Cambridge, USA), M. Schrauf (DaimlerChrysler, Stuttgart), S. Sperlich (University Carlos III, Madrid, Spain), U. Steinmetz (Max-Planck-Institut für Mathematik in den Naturwissenschaften, Leipzig), P. Thiesen (Universität der Bundeswehr, Hamburg), G. Torheim (Amersham Health, Oslo, Norway), C. Vial (ENSAI, Rennes, France), Y. Xia (National University of Singapore, Singapore), S. Zwanzig (Uppsala University, Sweden)

Supported by: DFG: DFG-Forschungszentrum ``Mathematik für Schlüsseltechnologien'' (Research Center ``Mathematics for Key Technologies''), project A3; SFB 373 ``Quantifikation und Simulation Ökonomischer Prozesse'' (Quantification and simulation of economic processes), Humboldt-Universität zu Berlin; Priority Program 1114 ``Mathematische Methoden der Zeitreihenanalyse und digitalen Bildverarbeitung'' (Mathematical methods for time series analysis and digital image processing)

Description: The project Statistical data analysis focuses on the development, theoretical investigation and application of modern nonparametric statistical methods, designed to model and analyze complex data structures. WIAS has, with main mathematical contributions, obtained authority for this field, including its applications to problems in technology, medicine, and environmental research as well as risk evaluation for financial products.

Methods developed in the institute within this project area can be grouped into the following main classes.

1. Adaptive smoothing

(D. Belomestny, V. Essaoulova, A. Hutt, D. Mercurio, H.-J. Mucha, J. Polzehl, V. Spokoiny).

The investigation and development of adaptive smoothing methods have been driven by interesting problems from imaging and time series analysis. Applications to imaging include signal detection in functional Magnet Resonance Imaging (fMRI) and tissue classification in dynamic Magnet Resonance Imaging (dMRI) experiments, image denoising, analysis of images containing Poisson counts or binary information or the analysis of Positron Emission Tomography (PET) data.


Our approach for time series focuses on locally stationary time series models. These methods allow for abrupt changes of model parameters in time. Intended applications for financial time series include volatility modeling, volatility prediction, and risk assessment.


The models and procedures proposed and investigated at WIAS are based on two main approaches, the pointwise adaptation, originally proposed in [46] for estimation of regression functions with discontinuities, and adaptive weights smoothing, proposed in [33] in the context of image denoising.


The main idea of the pointwise adaptive approach is to search, in each design point, for the largest acceptable window that does not contradict to the assumed local model, and to use the data within this window to obtain local parameter estimates. This allows for estimates with nearly minimal variance under controlled bias.


The general concept behind adaptive weights smoothing is structural adaptation. The procedure attempts to recover the unknown local structure from the data in an iterative way while utilizing the obtained structural information to improve the quality of estimation. This approach possesses a number of remarkable properties like preservation of edges and contrasts and nearly optimal noise reduction inside large homogeneous regions. It is almost dimension free and is applicable to high-dimensional situations.


Both ideas have been investigated and applied in a variety of settings.

2. Dimension reduction

(J. Pohlzehl, V. Spokoiny).

Data sets from economy or finance are often high dimensional. Usually many characteristics of a firm or an asset are monitored without knowledge which characteristics are needed to answer specific questions. Data structures often do not allow for simple parametric models. Nonparametric statistical modeling of such data suffers from the curse of dimensionality problem (high-dimensional data are very sparse). Fortunately, in many cases structures in complex high-dimensional data live in low-dimensional, but usually unknown subspaces. This property can be used to construct efficient procedures to simultaneously identify and estimate the structure inherent to the data set. The most common models in this context are additive models, single- and multi-index models and partial linear models. These models focus on index vectors or dimension reduction spaces which allow to reduce the dimensionality of the data without essential loss of information. They generalize classic linear models and constitute a reasonable compromise between too restrictive linear and too vague pure nonparametric modeling.


Indirect methods of index estimation like the nonparametric least squares estimator, or nonparametric maximum likelihood estimator have been shown to be asymptotically efficient, but their practical applications are very restricted. The reason is that their evaluation leads to an optimization problem in a high-dimensional space, see [20]. In contrast, computationally straightforward direct methods like the average derivative estimator, or sliced inverse regression behave far from optimally, again due to the ``curse of dimensionality'' problem.


[17] developed a structural adaptive approach to dimension reduction using the structural assumptions of a single-index and multi-index model. The method allows for an asymptotically efficient estimation of the dimension reduction space and of the link function. [47] improves on these procedures for single- and multi-index models and generalizes it to the case of partially linear models and partially linear multi-index models.


[45] proposes a new method for partially linear models whose nonlinear component is completely unknown. The target of analysis is identification of regressors which enter in a nonlinear way in the model, and complete estimation of the model including slope coefficients of the linear component and the link function of the nonlinear component. The procedure allows for selecting the significant regression variables. As a by-product, a test that the nonlinear component is  M -dimensional for  M = 0, 1, 2,...  is developed. The proposed approach is fully adaptive to the unknown model structure and applies under mild conditions on the model. The only important assumption is that the dimensionality of the nonlinear component is relatively small. Theoretical results indicate that the procedure provides a prescribed level of the identification error and estimates the linear component with an accuracy of order  n-1/2 . A numerical study demonstrates a very good performance of the method even for small or moderate sample sizes.

3. Statistics for inverse problems

(P. Mathé, V. Spokoiny).

Ill-posed equations arise frequently in the context of inverse problems, where it is the aim to determine some unknown characteristics of a physical system from data corrupted by measurement errors.


The problem of reconstructing a planar convex set from noisy observations of its moments is considered in [12] . An estimation method based on pointwise recovering of the support function of the set is developed. We study intrinsic accuracy limitations in the shape-from-moments estimation problem by establishing a lower bound on the rate of convergence of the mean squared error. It is shown that the proposed estimator is near-optimal in the sense of the order. An application to tomographic reconstruction is discussed, and it is indicated how the proposed estimation method can be used for recovering edges from noisy Radon data. This constitutes a first step to adaptive estimation procedures for Positron Emission Tomography (PET).


For ill-posed problems it is often impossible to get sensible results unless special methods, such as Tikhonov regularization, are used. Work in this direction is carried out in collaboration with S.V. Pereverzev, RICAM Linz. We study linear problems where an operator A acts injectively and is compact in some Hilbert space, and the equation is disturbed by noise. Under a priori smoothness assumptions on the exact solution x, such problems can be regularized. Within the present paradigm, smoothness is given in terms of general source conditions, expressed through the operator A as x = $ \varphi$(A * A)v,  | v| $ \leq$ R, for some increasing function $ \varphi$$ \varphi$(0) = 0. This approach allows to treat regularly and severely ill-posed problems in the same way. The deterministic theory for such equations was developed in [23, 24], including discretization and adaptation to unknown source conditions. The statistical setup is more complicated. However, based on the seminal work by [31], we could also extend this, including several ill-posed problems as studied by [13, 50]. One is often not interested in the calculation of the complete solution x, but only in some functional, say, $ \langle$z, x$ \rangle$ of it, where z is given afore-hand. If this is the case, then the linear functional strategy, as proposed by Anderssen ([1]) is important. Previous analysis of this strategy is extended, as carried out in [25] to the present setup.


The analysis of ill-posed problems under general source conditions raises many new issues and bridges between approximation theory and interpolation theory in function spaces.


Natural inference problems for parameters in stochastic processes lead to ill-posed inverse problems. A first instance is the problem of nonparametric estimation of the weight measure a in the stochastic delay differential equation

dX(t) = $\displaystyle \Bigl($$\displaystyle \int_{{-r}}^{0}$X(t + u) da(u)$\displaystyle \Bigr)$ dt + $\displaystyle \sigma$ dW(t), 0$\displaystyle \le$t$\displaystyle \le$T,

where $ \sigma$ > 0 is constant and W denotes Brownian motion. In [40], it is shown that this inference problem for T$ \to$$ \infty$ is equivalent to an integral equation with stochastic errors in the kernel and in the right-hand side, which can be regularized and solved by the Galerkin method. An adaptive wavelet thresholding method is proposed in [41], which attains the minimax rates for classes of weight measures a with Lebesgue densities in Besov spaces. An algorithm based on the wavelet-Galerkin method for general ill-posed linear problems is proposed in [6]. The problem of estimating the length of the delay in models with continuous weight densities is treated in [42], where it is related to change-point detection in ill-posed settings.

A surprising fact is that in the classical scalar diffusion model

dX(t) = b(X(t)) dt + $\displaystyle \sigma$(X(t)) dW(t), 0$\displaystyle \le$t$\displaystyle \le$T,

ill-posedness arises due to the lack of continuous-time observations. Assuming that b has regularity s - 1 and $ \sigma$ has regularity s, estimators based on the low-frequency observations (X(n$ \Delta$))0$\scriptstyle \le$n$\scriptstyle \le$N with large N, but arbitrary $ \Delta$ > 0 are constructed in [11]. They attain the optimal minimax rate N-s/(s+3) for $ \sigma$ and N-(s-1)/(2s+3) for b. The significant loss compared to the situation of high-frequency or continuous-time observations is due to the loss of information about the continuous path properties. The estimators are based on spectral estimation of the underlying Markov transition generator and are modified to a larger model class in [43].


4. Monte-Carlo methods and related topics

(D. Belomestny, J. Polzehl, V. Spokoiny).

In cooperation with the project Applied mathematical finance a root-N consistent Monte Carlo estimator for a diffusion density ([28]) has been developed. The approach has been applied to an environmental problem ([3]) and extended to a large class of models for stochastic processes in discrete time ([29]). These models allow in particular for realistic estimation of ruin probabilities in finance.


In [2], new algorithms for the evaluation of American options using consumption processes are proposed. The approach is based on the fact that an American option is equivalent to a European option with a consumption process involved. A new method of sequential improvement of an initial approximation based on step-by-step interchanging between lower and upper bounds is developed. Various smoothing techniques are used to approximate the bounds in each step and hence to reduce the complexity of algorithm. The results of numerical experiments confirm efficiency of the algorithms proposed. Applications are intended within the project Applied mathematical finance.


Simulation-extrapolation-type estimators in errors-in-variables models are investigated in [38, 39]. These estimates generalize and improve proposals from [7, 48].

References:

  1. R.S. ANDERSSEN, The linear functional strategy for improperly posed problems, Inverse Probl., 77 (1986), pp. 11-30.

  2. D. BELOMESTNY, G.N. MILSTEIN, Monte Carlo evaluation of American options using consumption processes, in preparation.

  3. E. VAN DEN BERG, A.W. HEEMINK, H.X. LIN, J.G.M. SCHOENMAKERS, Probability density estimation in stochastic environmental models using reverse representations, Applied Mathematical Analysis Report no. 6, TU Delft, 2003, to appear in: J. Stoch. Environm. Research & Risk Assessment.

  4. T. CAI, B. SILVERMAN, Incorporating information on neighboring coefficients into wavelet estimation, Sankhy$ \bar{{a}}$, Ser. B, 63 (2001), pp. 127-148.

  5. M.-Y. CHENG, J. FAN, V. SPOKOINY, Dynamic nonparametric filtering with application to finance, in: Recent Advances and Trends in Nonparametric Statistics, M.G. Acritas, D.N. Politis, eds., Elsevier, Amsterdam, Oxford, Heidelberg, 2003, pp. 315-333.

  6. A. COHEN, M. HOFFMANN, M. REISS, Adaptive wavelet Galerkin methods for linear inverse problems, to appear in: SIAM J. Numer. Anal.

  7. J.R. COOK, L.A. STEFANSKI, Simulation-extrapolation estimation in parametric measurement error models, J. Am. Stat. Assoc., 89 (1994), pp. 1314-1328.

  8. R. FRIEDRICH, J. PEINKE, CH. RENNER, How to quantify deterministic and random influences on the statistics of the foreign exchange market, Phys. Rev. Lett., 84 (2000), 5224.

  9. M. GIURCANU, V. SPOKOINY, Confidence estimation of the covariance function of stationary and locally stationary processes, WIAS Preprint no. 726, 2002, submitted.

  10. M. GIURCANU, V. SPOKOINY, R. VON SACHS, Pointwise adaptive modeling of locally stationary time series, in preparation.

  11. E. GOBET, M. HOFFMANN, M. REISS, Nonparametric estimation of scalar diffusions based on low frequency data, to appear in: Ann. Statist.

  12. A. GOLDENSHLUGER, V. SPOKOINY, On the shape-from-moments problem and recovering edges from noisy Radon data, WIAS Preprint no. 802, 2002, Probab. Theory Related Fields, 128 (2004), pp. 123-140.

  13. G.K. GOLUBEV, R.Z. KHASMINSKI, A statistical approach to some inverse problems for partial differential equations, Probl. Peredachi Inf., 35 (1999), pp. 51-66.

  14. I. GRAMA, V. SPOKOINY, Tail index estimation by local exponential modeling, WIAS Preprint no. 819, 2003.

  15. I. GRAMA, J. POLZEHL, V. SPOKOINY, Adaptive estimation for varying coefficient generalized linear models, in preparation.

  16. W. HÄRDLE, H. HERWATZ, V. SPOKOINY, Time inhomogeneous multiple volatility modeling, J. Financial Econometrics, 1 (2003), pp. 55-95.

  17. M. HRISTACHE, A. JUDITSKY, J. POLZEHL, V. SPOKOINY, Structure adaptive approach for dimension reduction, Ann. Statist., 29 (2001), pp. 1537-1566.

  18. A. HUTT, H. RIEDEL, Analysis and modeling of quasi-stationary multivariate time series and their application to middle latency auditory evoked potentials, Physica D, 177 (2003), pp. 203-232.

  19. A. HUTT, A. DAFFERTSHOFER, U. STEINMETZ, Detection of mutual phase synchronization in multivariate signals and application to phase ensembles and chaotic data, Phys. Rev. E, 68 (2003), 036219.

  20. R.L. KLEIN, R.H. SPADY, An efficient semiparametric estimator for binary response models, Econometrica, 61 (1993), pp. 387-421.

  21. P. MATHÉ, The general functional strategy under general source conditions, submitted.

  22. P. MATHÉ, S.V. PEREVERZEV, Optimal error of ill-posed problems in variable Hilbert scales under the presence of white noise, manuscript.

  23.          , Discretization strategy for ill-posed problems in variable Hilbert scales, Inverse Probl., 19 (2003), pp. 1263-1277.

  24.          , Geometry of ill-posed problems in variable Hilbert scales, Inverse Probl., 19 (2003), pp. 789-803.

  25.          , Direct estimation of linear functionals from indirect noisy observations, J. Complexity, 18 (2002), pp. 500-516.

  26. D. MERCURIO, V. SPOKOINY, Statistical inference for time-inhomogeneous volatility models, to appear in: Ann. Statist.

  27.          , Estimation of time dependent volatility via local change point analysis, WIAS Preprint no. 904, 2004.

  28. G.N. MILSTEIN, J.G.M. SCHOENMAKERS, V. SPOKOINY, Transition density estimation for stochastic differential equations via forward-reverse representations, WIAS Preprint no. 680, 2001, Bernoulli, in print.
  29. G.N. MILSTEIN, J.G.M. SCHOENMAKERS, V. SPOKOINY, Forward-reverse representations for Markov chains, in preparation.

  30. H.-J. MUCHA, H.-G. BARTEL, ClusCorr98 -- Adaptive clustering, multivariate visualization, and validation of results, to appear in: Proceedings of the 27th Annual Conference of the GfKl, Springer, Berlin.

  31. M.S. PINSKER, Optimal filtration of square-integrable signals in Gaussian noise, Probl. Inf. Transm., 16 (1980), pp. 52-68.

  32. J. POLZEHL, V. SPOKOINY, Image denoising: Pointwise adaptive approach, Ann. Statist., 31 (2003), pp. 30-57.

  33.          , Adaptive weights smoothing with applications to image restoration, J.R. Stat. Soc., Ser. B, 62 (2000), pp. 335-354.

  34.          , Functional and dynamic Magnetic Resonance Imaging using vector adaptive weights smoothing, J.R. Stat. Soc., Ser. C, 50 (2001), pp. 485-501.

  35.          , Local likelihood modeling by adaptive weights smoothing, WIAS Preprint no. 787, 2002.

  36.          , Varying coefficient regression modeling by adaptive weights smoothing, WIAS Preprint no. 818, 2003.

  37.          , Adaptive estimation for a varying coefficient (E)GARCH model, in preparation.

  38. J. POLZEHL, S. ZWANZIG, On a symmetrized extrapolation estimator in linear errors-in-variables models, Comput. Stat. Data Anal., in print.

  39.          , On a comparison of different simulation extrapolation estimators in linear errors-in-variables models, U.U.D.M. Report no. 17, Uppsala University, 2003.

  40. M. REISS, Minimax rates for nonparametric drift estimation in affine stochastic delay differential equations, Stat. Inference Stoch. Process., 5 (2002), pp. 131-152.

  41.          , Adaptive estimation for affine stochastic delay differential equations, submitted.

  42.          , Estimation of the delay length in affine stochastic delay differential equations, submitted.

  43.          , Nonparametric volatility estimation on the real line from low-frequency observations, submitted.

  44. H. RISKEN, The Fokker-Planck Equation, Springer, Berlin, Heidelberg, 1996.

  45. A. SAMAROV, V. SPOKOINY, C. VIAL, Component identification and estimation in nonlinear high-dimensional regression models by structural adaptation, WIAS Preprint no. 828, 2003.

  46. V. SPOKOINY, Estimation of a function with discontinuities via local polynomial fit with an adaptive window choice, Ann. Statist., 26 (1998), pp. 1356-1378.

  47. V. SPOKOINY, Y. XIA, Effective dimension reduction by structural adaptation, in preparation.

  48. L.A. STEFANSKI, J.R. COOK, Simulation-extrapolation: The measurement error jackknife, J. Am. Stat. Assoc., 90 (1995), pp. 1247-1256.

  49. A. STURM, P. KÖNIG, Mechanisms to synchronize neuronal activity, Biol. Cyb., 84 (2001), pp. 153-172.

  50. A. TSYBAKOV, On the best rate of adaptive estimation in some inverse problems, C.R. Acad. Sci. Paris Sér. I Math., 330 (2000), pp. 835-840.



 [Next]:  Applied mathematical finance  
 [Up]:  Projects  
 [Previous]:  Projects  
 [Contents]   [Index] 

LaTeX typesetting by I. Bremer
2004-08-13