Cherkashina A. S., Matsuga O. M., Atanova M. Y.

Oles Honchar Dnipropetrovsk National University

INFORMATION TECHNOLOGY OF DISTRIBUTIONS RESTORATION BASED ON ROBUST PROCEDURES

During the processing and analysis of sample data, anomalous observations which values are very different from the others require considerable attention. These observations can affect the results of statistical research significantly. To reduce their impact, it is advisable to apply robust methods of mathematical statistics. Such techniques are studied well, but their software implementation in automated data processing systems is still insufficient.

The paper sets the goal to develop software for solving the task of distribution restoration формула, where формула – vector of distribution parameters, using robust methods.

Software’s kernel was created by a computational scheme of estimation the distribution parameters vector формула based on robust procedure [1]. The computational scheme can be presented as follows:

1.  Find an initial approximation of the estimate of the distribution parameters vector формула using the least squares method, namely, by means of minimization of the residual variance [2]

формула    (1)

where формула – linearized distribution function; формула – empirical value of linearized distribution function; формула – number of iteration; формула

2.  Set формула

3.  Using the current approximation of the estimate формула and the chosen weight function, find the weight

формулаформула,

where формулаформула – weight function; формулаq=1,2… – iteration number.

4.  Compute the following approximation of the estimate of the parameters vector формула by means of minimization of the functional (1), using the weights which were found in step 3.

5.  Check the condition формула. If this condition is satisfied, then we take формула for the desired evaluation of the distribution parameters vector. Otherwise, proceed to step 2.

The computational scheme was implemented to restoration of:

– normal, exponential, Weibull distributions;

– normal, exponential and Weibull spline-distributions with one node.

The following weight functions were used: Huber, Ramsey, Andrews, Tukey and Hampelya.

As a result, the software for distributions restoration by robust methods «RobustProcedurs» was created. Its testing was performed on the simulation data. Test results showed the adequacy of the computational scheme.

The created software «RobustProcedurs» based on the computational scheme forms the information technology of distributions restoration based on robust procedure. That ensures obtaining estimates which are relatively stable to statistical distribution of possible deviations from the theoretical.

The list of references:

1. Дрейпер Н. Прикладной регрессионный анализ: 3-е изд. / Н. Дрейпер, Г. Смит. – М.: Издательский дом «Вильямс», 2007. – 912 с.

2. Бабак В. П. Статистична обробка даних / В. П. Бабак, А. Я. Бі­ле­цький, О. П. Приставка, П. О. Приставка. – К.: МІВВЦ, 2001. – 388 с.