Journal of System Simulation ›› 2020, Vol. 32 ›› Issue (11): 2226-2234.doi: 10.16182/j.issn1004731x.joss.19-FZ0510E

Previous Articles     Next Articles

Generalized Zero-inflated Binomial Distribution Model Aimed at Air Quality Data Analysis

Su Benyue1,3, Xu Pengpeng2,3, Sheng Min3,4   

  1. 1.School of Mathematics and Computer,Tongling University,Tongling 244061,China;
    2.School of Computer and Information,Anqing Normal University,Anqing 246133,China;
    3.The University Key Laboratory of Intelligent Perception and Computing of Anhui Province,Anqing 246133,China;
    4.School of Mathematics and Computational Science,Anqing Normal University,Anqing 246133,China
  • Received:2019-02-10 Revised:2019-09-08 Online:2020-11-18 Published:2020-11-17
  • About author:Su Benyue(1971-),Male,Wuhu,Ph.D.,professor,research direction:statistical computing and statistical pattern recognition.
  • Supported by:
    National Nature Science Foundation of China(11475003), Science and Technology Major Project of Anhui Province (18030901021), Anhui Provincial Department of Education outstanding top-notch talent-funded projects ( gxbjZD26)

Abstract: For the problem of the quality monitoring and counting of excessive gas emissions in chemical industry parks, a generalized zero-inflated binomial distribution model is constructed. Statistics show that the times of number of excessive gas emissions has a typical zero-inflated feature. The traditional zero-inflated Poisson model and negative binomial regression model and so on will underestimate the probability of zero inflation. A generalized zero-inflated binomial distribution model is constructed by extending the traditional binomial regression model to a more general form. This model satisfies the characteristic that the expectation is less than the variance, and better solves the problems of both over-dispersed and zero-inflated in excessive gas emissions. Experiments show that the generalized zero-flated binomial distribution model has a good fitting effect, strong adaptability and robustness.

Key words: count model, generalized binomial distribution, zero-inflated model, zero-inflated binomial regression model, air quality analysis

CLC Number: