Services on Demand
Journal
Article
Indicators
- Cited by SciELO
- Access statistics
Related links
- Cited by Google
- Similars in SciELO
- Similars in Google
Share
Revista Colombiana de Estadística
Print version ISSN 0120-1751
Rev.Colomb.Estad. vol.40 no.1 Bogotá Jan./June 2017
https://doi.org/10.15446/rce.v40n1.61779
http://dx.doi.org/10.15446/rce.v40n1.61779
1Universidad Nacional de Colombia, Facultad de Ciencias, Departamento Estadística, Bogotá, Colombia. PhD. Email: ecepedac@unal.edu.co
2Universidad Nacional de Colombia, Facultad de Ciencias, Departamento Matemáticas, Bogotá, Colombia. PhD (c). Email: mvcifuentesa@unal.edu.co
Overdispersion is a common phenomenon in count datasets, that can greatly affect inferences about the model. In this paper develop three joint mean and dispersion regression models in order to fit overdispersed data. These models are based on reparameterizations of the beta-binomial and negative binomial distributions. Finally, we propose a Bayesian approach to estimate the parameters of the overdispersion regression models and use it to fit a school absenteeism dataset.
Key words: Bayesian Approach, Beta-Binomial Distribution, Distribution, Gamma Distribution, Negative Binomial, Overdispersion, Poisson Distribution.
La sobredispersión es un fenómeno común en conjuntos de datos de conteo, que puede afectar en alto grado las inferencias relacionadas con el modelo. En este artículo desarrollamos tres modelos de regresión conjunta de media y dispersión para ajustar datos sobredispersos. Estos modelos se basan en reparameterizaciones de las distribuciones beta-binomial y binomial negativa. Finalmente, proponemos un enfoque Bayesiano para la estimación de los parámetros de los modelos de regresión sobredispersos y lo utilizamos para ajustar un conjunto de datos de ausentismo escolar.
Palabras clave: distribución beta-binomial, distribución binomial negativa, distribución de Poisson, distribución gamma, enfoque bayesiano, sobredispersión.
Texto completo disponible en PDF
References
1. , S. F. & Cribari-Neto, F. (2004), 'Beta Regression for Modelling Rates and Proportions', Journal of Applied Statistics(31-7), 799-815. [ Links ]
2. Breslow, N. (1984), 'Extra-Poisson variation in Log-Linear models', Journal of Applied Statistics 31, 38-44. [ Links ]
3. Cepeda-Cuervo, E. (2001), Modelagem da Variabilidade em Modelos Lineares Generalizados, Unpublished Math Ph.D. thesis, Mathematics Institute, Universidade Federal do Río de Janeiro. [ Links ]
4. Cepeda-Cuervo, E. & Achcar, J. (2009), 'Modelos de regresión heterocedásticos usando aproximación bayesiana', Revista Colombiana de Estadística 32(2), 267-287. [ Links ]
5. Cepeda-Cuervo, E. & Gamerman, D. (2005), 'Bayesian methodology for modeling parameters in the two parameter exponential family', Revista Estadística 57(168), 93-105. [ Links ]
6. Cepeda-Cuervo, E., Migon, H., Garrido, L. & Achcar, J. (2014), 'Generalized linear models with random effects in the two-parameter exponential family', Journal of Statistical Computation and Simulation 84(3), 513-525. [ Links ]
7. Collet, D. (1991), Modeling Binary Data, Chapman Hall, London. [ Links ]
8. Cox, D. (1983), 'Some remarks on overdispersion', Biometrika 70(1), 269-274. [ Links ]
9. Demétrio, C. & Hinde, J. (1998), 'Overdipersion: Models and estimation', Computational Statistics and Data Analysis 27, 151-170. [ Links ]
10. Demétrio, C., Kokonendji, C. & Zocchi, S. (2007), 'On Hinde-Demétrio regression models for overdispersed count data', Statistical Methodology 4, 277-291. [ Links ]
11. Jórgensen, B. (1997), The Theory of Dispersion Models, Chapman & Hall, London. [ Links ]
12. Lawless, J. (1987), 'Negative binomial regression model', Canadian Journal of Statistics 15(3), 209-225. [ Links ]
13. Margolin, B., Kaplan, N. & Zeiger, E. (1981), 'Statistical analysis of the Ames Salmonella microsome test', Proceedings of the National Academy of Sciences 76, 3779-3783. [ Links ]
14. McCullagh, P. & Nelder, J. (1989), Generalized Linear Models, Chapman Hall, London. [ Links ]
15. Quine, S. (1975), Achievement orientation of aboriginal and white Australian adolescents, Ph.D. Thesis, Australian National University, Australia. [ Links ]
16. Quintero-Sarmiento, A., Cepeda-Cuervo, E. & Núñez-Antón, V. (2012), 'Estimating infant mortality in Colombia: some overdispersion modeling approaches', Journal of Applied Statistics 39(5), 1011-1036. [ Links ]
17. Williams, D. (1975), 'The analysis of binary responses from toxicological experiments involving reproduction and teratogenicity', Biometrics 31(4), 949-952. [ Links ]
18. Williams, D. (1982), 'Extra-binomial Variation in Logistic linear Models', Journal of Applied Statistics 31, 144-148. [ Links ]
Este artículo se puede citar en LaTeX utilizando la siguiente referencia bibliográfica de BibTeX:
@ARTICLE{RCEv40n1a07,
AUTHOR = {Cepeda-Cuervo, Edilberto and Cifuentes-Amado, María Victoria},
TITLE = {{Double Generalized Beta-Binomial and Negative Binomial Regression Models}},
JOURNAL = {Revista Colombiana de Estadística},
YEAR = {2017},
volume = {40},
number = {1},
pages = {141-163}
}