Proceedings of the 1st International Conference on Statistics and Analytics, ICSA 2019, 2-3 August 2019, Bogor, Indonesia

Research Article

VARIABLE SELECTION IN ANALYZING LIFE INFANT BIRTH IN INDONESIA USING GROUP LASSO AND GROUP SCAD

Download576 downloads
  • @INPROCEEDINGS{10.4108/eai.2-8-2019.2290485,
        author={Ita  Wulandari and Khairil Anwar  Notodiputro and Bagus  Sartono},
        title={VARIABLE SELECTION IN ANALYZING LIFE INFANT BIRTH IN INDONESIA USING GROUP LASSO AND GROUP SCAD},
        proceedings={Proceedings of the 1st International Conference on Statistics and Analytics, ICSA 2019, 2-3 August 2019, Bogor, Indonesia},
        publisher={EAI},
        proceedings_a={ICSA},
        year={2020},
        month={1},
        keywords={lasso group selection group lasso group scad life infant birth},
        doi={10.4108/eai.2-8-2019.2290485}
    }
    
  • Ita Wulandari
    Khairil Anwar Notodiputro
    Bagus Sartono
    Year: 2020
    VARIABLE SELECTION IN ANALYZING LIFE INFANT BIRTH IN INDONESIA USING GROUP LASSO AND GROUP SCAD
    ICSA
    EAI
    DOI: 10.4108/eai.2-8-2019.2290485
Ita Wulandari1,*, Khairil Anwar Notodiputro2, Bagus Sartono2
  • 1: Statistics Departement, Polytechnic of Statistic STIS, Jl. Otista No. 64C, Jakarta 13330,Indonesia
  • 2: Statistics Departement, Bogor Agricultural University, Jl.Raya Dramaga Bogor, 16680, Indonesia
*Contact email: ita.wulandari@stis.ac.id

Abstract

Regression analysis often requires a selection of explanatory variables X1, X2, ... Xp so shrinkage coefficients can occur that can facilitate the interpretation of the regression equation obtained. In this context, the explanatory variable often has a grouping structure so that a more relevant problem is how to choose groups rather than individuals. Group LASSO and group SCAD are techniques for selecting groups of variables which in many works of literature appear to have advantages over LASSO. In this study, the percentage of live born children in the province of Bali, East Nusa Tenggara and other Indonesia provinces were analyzed and linked to the explanatory variables using the group LASSO and group SCAD methods. The classification of available explanatory variables is grouped based on the theory and results of previous studies. The results show that the best model is the group SCAD method with the smallest AIC, BIC and GCV values. Factors included in the model for Bali province are demographic factors, women's status, and autonomy and the economy. For East Nusa Tenggara province the factors that enter the model are demographics and economics, while generally for Indonesia the factors that are included in the model are demography, women's status, and autonomy and family planning.