Adapting Predictive Models for Cepheid Variable Star Classification Using Linear Regression and Maximum Likelihood Academic Article uri icon


  • AbstractWe describe an approach to automate the classification of Cepheid variable stars into two subtypes according to their pulsation mode. Automating such classification is relevant to obtain a precise determination of distances to nearby galaxies, which in addition helps reduce the uncertainty in the current expansion of the universe. One main difficulty lies in the compatibility of models trained using different galaxy datasets; a model trained using a training dataset may be ineffectual on a testing set. A solution to such difficulty is to adapt predictive models across domains; this is necessary when the training and testing sets do not follow the same distribution. The gist of our methodology is to train a predictive model on a nearby galaxy (e.g., Large Magellanic Cloud), followed by a model-adaptation step to make the model operable on other nearby galaxies. We follow a parametric approach to density estimation by modeling the training data (anchor galaxy) using a mixture of linear models. We then use maximum likelihood to compute the right amount of variable displacement, until the testing data closely overlaps the training data. At that point, the model can be directly used in the testing data (target galaxy).

published proceedings


author list (cited authors)

  • Gupta, K. D., Vilalta, R., Asadourian, V., & Macri, L.

citation count

  • 0

complete list of authors

  • Gupta, Kinjal Dhar||Vilalta, Ricardo||Asadourian, Vicken||Macri, Lucas

publication date

  • May 2015