eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
1
22
article
Outlier Detection by Boosting Regression Trees
Nathalie Chèze
1
Jean-Michel Poggi
Jean-Michel.Poggi@math.u-psud.fr
2
A procedure for detecting outliers in regression problems is proposed. It is based on information provided by boosting regression trees. The key idea is to select the most frequently resampled observation along the boosting iterations and reiterate after removing it. The selection criterion is based on Tchebychev’s inequality applied to the maximum over the boosting iterations of the average number of appearances in bootstrap samples. So the procedure is noise distribution free. It allows to select outliers as particularly hard to predict observations. A lot of well-known bench data sets are considered and a comparative study against two well-known competitors allows to show the value of the method.
http://jsri.srtc.ac.ir/article-1-163-en.pdf
Boosting
CART
outlier
regression.
eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
23
46
article
Some Statistical Methods for Prediction of Athletic Records
R. Dargahi-Noubary
rnoubary@bloomu.edu
1
Prediction of the sports records has received a great deal of attention from researchers in different disciplines. This article reviews some of the methods developed by statisticians and offers few improvements. Specific methods discussed include trend analysis, tail modeling, and methods based on certain results of the theory of records for independent and identically distributed attempts. To make the latter theory applicable, and to account for factors affecting the records, adjustments are made to the data in the form of increase in participation or attempts. Models utilized for this purpose include geometric increase, logistic increase, and increase as a non-homogenous Poisson process. A method for prediction of ultimate record is also included together with demonstrating examples using data for men’s long jump and 400 meter run.
http://jsri.srtc.ac.ir/article-1-160-en.pdf
Sport records
prediction
trend analysis
tail modeling
theory of records
Poisson processes.
eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
47
61
article
A New Skew-normal Density
Maryam Sharafi
1
javad Behboodian
behboodian@stat.susc.ac.ir
2
We present a new skew-normal distribution, denoted by NSN($lambada$). We first derive the density and moment generating function of NSN($lambada$). The properties of SN($lambada$), the known skew-normal distribution of Azzalini, and NSN($lambada$) are compared with each other. Finally, a numerical example for testing about the parameter $lambada$ in NSN($lambada$) is given.
http://jsri.srtc.ac.ir/article-1-159-en.pdf
Skew-normal distribution
a new skew-normal distribution
moment generating function
skewness
kurtosis
testing hypothesis.
eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
63
74
article
A Randomness Test for Stable Data
Adel Mohammadpour
adl@aut.ac.ir
1
Ali Mohammad-Djafari
djafari@lss.supelec.fr
2
John P. Nolané
jpnolan@american.edu
3
In this paper, we propose a new method for checking randomness of non-Gaussian stable data based on a characterization result. This method is more sensitive with respect to non-random data compared to the well-known non-parametric randomness tests.
http://jsri.srtc.ac.ir/article-1-162-en.pdf
Stable distributions
randomness tests (test for i.i.d.)
characterization.
eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
75
90
article
A Comparative Review of Selection Models in Longitudinal Continuous Response Data with Dropout
Elaheh Vahidi-Asl
elahehva@yahoo.com
1
Mojtaba Ganjali
m-ganjali@sbu.ac.ir
2
Missing values occur in studies of various disciplines such as social sciences, medicine, and economics. The missing mechanism in these studies should be investigated more carefully. In this article, some models, proposed in the literature on longitudinal data with dropout are reviewed and compared. In an applied example it is shown that the selection model of Hausman and Wise (1979, Econometrica 47, pp. 455-473) and the shared parameter model of Follmann and Wu (1995, Biometrics 51, pp. 151-168), two of the most used models for longitudinal data with dropout in economics and medical researches, respectively, cannot sufficiently consider the relation between response variables and missing mechanism. In this paper, the Follmann and Wu’s (1995) dropout model is also generalized by adding a previous time outcome component to the model. Having modified this model, in the case of longitudinal data with two time periods, a general form of this model is obtained, which is able to consider all relations between response and missing mechanism. This is proven in an implicit way. A test for missing at random in the generalized Hechman model (Crouchley and Ganjali, 2002, Stat. Model. 2, pp. 39-62) is also introduced where one has to use $delta$-method to find the variance of the test statistic.
http://jsri.srtc.ac.ir/article-1-161-en.pdf
Longitudinal data
continuous response
missing values
selection bias
dropout
random effect model.
eng
Statistical Research and Training Center - Statistical Centre of Iran
Journal of Statistical Research of Iran JSRI
2538-5771
2538-5763
2006-09
3
1
91
101
article
On Classification of Bivariate Distributions Based on Mutual Information
Mohamed Habibullah
1
Mohammad Ahsanullah
ahsan@rider.edu
2
Among all measures of independence between random variables, mutual information is the only one that is based on information theory. Mutual information takes into account of all kinds of dependencies between variables, i.e., both the linear and non-linear dependencies. In this paper we have classified some well-known bivariate distributions into two classes of distributions based on their mutual information. The distributions within each class have the same mutual information. These distributions have been used extensively as survival distributions of two component systems in reliability theory.
http://jsri.srtc.ac.ir/article-1-158-en.pdf
Mutual information
entropy
survival distribution
bivariate di