A Bayesian approach to construct confidence intervals for comparing the rainfall dispersion in Thailand

View article
Environmental Science

Introduction

Natural phenomena can often be random events in statistics, and natural rainfall is an important one because it is directly related to the quality of life, agriculture, economic growth, and industry, among others. Rice is the main export product from Thai agriculture, so natural rainfall is a crucial water resource for farmers. However, climate change has incurred prolonged droughts, thereby decreasing agricultural and fishery yields, and conversely, caused violent flooding and health-related issues in Thailand (Marks, 2011). Farmers who consume approximately 70% of the country’s water supply are at the forefront of these impacts due to changes in rainfall amount (Marks, 2011). Thus, extreme rainfall variation is one of the main consequences of climate change and can result in drought or flooding, and ineffective water management can exacerbate both situations (Duangdai & Likasiri, 2017).

Thailand is divided into five regions: northern, northeastern, central, eastern, and southern. The north is mountainous and contains the sources of several important rivers, including the Mekong Basin (a principal river system of Thailand). Nan is one of the provinces in the northern region faced with heavy rainfall for around two weeks which caused landslides and flooding during the rainy season (mid-May to mid-October) (Hariraksapitak, Sriring & Navaratnam, 2018). Nicely & Counselor (2018) reported that approximately 60% of the northeastern region’s arable land has been turned over to total rice cultivation. Meanwhile, 5–10% of precipitation above the normal level occurred during May–July 2018 in the northern and northeastern regions, as reported by the Thai Meteorological Department (Nicely & Counselor, 2018). These situations have led us to consider estimating the rainfall fluctuation between the northern and northeastern regions of Thailand in terms of their ratio of variances during July 2018. It is of major importance to estimate the dispersion in weekly rainfall amount between the northern and northeastern regions in Thailand for the benefit of key industries, and the information gained could help in realizing climate change awareness. The information could be advantageous for the Thai government and other related organizations to realize and plan for solving or reducing the risk of environmental issues if a large variation in rainfall is known in advance. The results from normality plots (Fig. 1), histograms (Fig. 2), and applying the Akaike information criterion (AIC) show that the weekly natural rainfall data for both regions fit delta-lognormal distributions. Note that the log-transformed data of the positive rainfall amounts fit a normal distribution which is symmetrical.

Q–Q plots of log-transformation of non-zero records in (A) northern (B) northeastern areas.

Figure 1: Q–Q plots of log-transformation of non-zero records in (A) northern (B) northeastern areas.

Histogram plots of log-transformation of weekly positive rainfall records in (A) northern (B) northeastern areas.

Figure 2: Histogram plots of log-transformation of weekly positive rainfall records in (A) northern (B) northeastern areas.

Aitchison & Brown (1963) first introduced the delta-lognormal distribution for non-negative data containing zero values with the probability 0 < δ < 1; positive observations with the remainder of the probability 1 − δ follow a lognormal distribution and the zeros follow a binomial distribution with binomial proportion δ. The delta-lognormal distribution has been fitted for real-world examples in many research areas such as the environment (Owen & DeRouen, 1980; Hasan & Krishnamoorthy, 2018), fishery surveys (Pennington, 1983; Smith, 1988, 1990; Lo, Jacobson & Squire, 1992; Fletcher, 2008; Wu & Hsieh, 2014) and medicine (Zhou & Tu, 1999, 2000; Tian & Wu, 2006; Hasan & Krishnamoorthy, 2018).

In statistical inference, one of the parameters of interest is the variance (dispersion in environment) defined as the second central moment; the positive square root of the variance is called the standard deviation (Casella & Berger, 2002). To compare two independent populations, the ratio of their variances is the measurement of the variation between them with no difference resulting in a ratio equal to 1. Confidence intervals (CIs) have been applied to several distributions to estimate the variability in terms of variance and the ratio of variances. For example, Krishnamoorthy, Mathew & Ramachandran (2006) obtained CIs for lognormal variance based on the idea of the generalized confidence interval (GCI) which they applied to the variation in the air lead levels in 15 industrial facilities in a health hazard evaluation. Bebu & Mathew (2008) claimed that GCI was better than a modified signed log-likelihood ratio test for establishing CIs for the ratio of variances for bivariate lognormal distributions even when the sample sizes were small. Niwitpong (2017) showed that GCI performed well for the ratio of variances of lognormal distributions to solve the problems that occur when applying the traditional approach.

Casella & Berger (2002) argued that CIs can capture the parameter of interest better than point estimates. Although a few attempts have been conducted on CIs for the ratio of variances for some distributions, research on the delta-lognormal distribution has yet to be carried out. Clearly, there is a need for research that investigates and constructs CIs for the ratio of delta-lognormal variances using the highest posterior density based on normal-gamma prior (HPD-NG) and MOVER. These proposed CIs were compared with the existing HPD-based Jeffreys’ (HPD-Jef) and Jeffreys’ Rule (HPD-Rul) priors of Harvey & Van der Merwe (2012), GCI of Wu & Hsieh (2014) and fiducial GCI (FGCI) of Hasan & Krishnamoorthy (2018).

The organization of this paper is as follows. The concepts of all of the CIs for the ratio of variances in delta-lognormal distributions are elaborated in “Methods”. The simulation procedure and numerical results are reported in “Results.” In “Discussion,” the proposed methods are applied to real-world datasets to estimate the rainfall fluctuation between the northern and northeastern regions in Thailand. Last, we present a discussion and conclusions on the study outcomes.

Methods

Let Xij = (Xi1, Xi2, …, Xini); i = 1, 2 and j = 1, 2, …, ni be non-negative random samples draw from a delta-lognormal distribution with parameters μi, σ2i and δi, stand for Xij ∼ Δ(μi, σ2i, δi). For Xij = 0, the number of having zero ni(0)B(ni, δi) and ni = ni(0) + ni(1). For Xij > 0, XijLNi, σ2i) where Yij = ln(Xij)∼ Ni, σ2i). Aitchison & Brown (1963) defined the population variance of Xij as ωi=(1δi)exp(2μi+σi2)[exp(σi2)(1δi)]

The maximum likelihood estimates of δi, μi and σ2i are δ^i=ni(0)ni, μ^i=1ni(1)j=1ni(1)yij, and σ^i2=1ni(1)j=1ni(1)(yijμ^i)2, respectively. From Eq. (1), the ratio in delta-lognormal variances is obtained as θ=lnω1lnω2

where ωi is log-transformed as lnωi=ln(1δi)+(2μi+σi2)+ln[exp(σi2)(1δi)]. The methods for constructing CIs for θ are described as seen below.

GCI

On the ideas of GCI, the generalized pivotal quantity (GPQ) is necessary to satisfy the two requirements of Weerahandi (1993). The GPQ of δi based on VST was proposed by Wu & Hsieh (2014) as Rδi=sin2[arcsinδ^iWi2ni]where Wi=2ni(arcsinδ^iarcsinδi)dN(0,1). The GPQs of μi and σ2i are Rμi=μ^iZi(ni(1)1)σ^2/[ni(1)Ui]Rσi2=(ni(1)1)σ^i2/Uiwhich are presented by Krishnamoorthy & Mathew (2003). These GPQs lead to obtain the GPQ of θ as Rθ=Rlnω1Rlnω2where Rlnωi=ln(1Rδi)+(2Rμi+Rσi2)+ln[exp(Rσi2)(1Rδi)]. [Rθ(ζ/2),Rθ(1ζ/2)] becomes the 100(1− ζ)%GCI for θ. Algorithm 1 shows the steps to compute GCI as seen above.

(1) Generate WiN(0, 1), ZiN(0, 1) and Uiχni(1)12.
(2) Compute Rμi, Rσi2, and Rδi.
(3) Compute Rln ωi.
(4) Compute Rθi.
(5) Repeat 1–4 a number of times (say, m = 2,500).
(6) For the 2,500 times, compute (100 − ζ)%GCI for θ.

FGCI

The fiducial generalized pivotal quantity (FGPQ) was proved and defined by Hannig, Iyer & Patterson (2006). Hannig (2009) claimed that their generalized fiducial recipe has been developed as the GPQ concept, so their ideas are applied with generalized inference directly. Here Xij ∼ Δ(μi, σ2i, δi) is considered. Hasan & Krishnamoorthy (2018) showed the FGPQs of μi and σ2i as Tμi=μ^iZiσ^i2/(ni(1)Ui) Tσi2=σ^i2/Uiwhere ZiN(0, 1) and Ui ∼ χ2ni(1) 1/(ni(1) − 1) are independent random variables. Also, the FGPQs of δi was developed as T1 δibeta(ni(1) + 0.5, ni(0) + 0.5). By three FGPQs, the FGPQ of ln ωi is defined as Tlnωi=lnT1δi+(2Tμi+Tσi2)+ln[exp(Tσi2)T1δi], then the FGPQs of θ can be expressed as Tθ=Tlnω1Tlnω2which can establish the 100(1− ζ)%FGCI for θ that is [Tθ(ζ/2),Tθ(1ζ/2)]; Tθ(ζ) stands for (ζ)100th percentile of Tθ. Algorithm 2 details the computation of FGCI.

(1) Generate ZiN(0, 1) and Uiχni(1)12.
(2) Compute Tμi, Tσi2, and T1 δi.
(3) Compute Tln ωi.
(4) Compute Tθi.
(5) Repeat 1–4 a number of times (say, m = 2,500).
(6) For the 2,500 times, compute (100 − ζ)%FGCI for θ.

MOVER interval

This interval is expanded from CIs for delta-lognormal mean of Hasan & Krishnamoorthy (2018) to the ratio of delta-lognormal variances. Given the individual CIs for the parameters of interest γ, the MOVER concept is used to establish a CI for a linear combination of parameters, proposed by Krishnamoorthy & Oral (2017). Let γ=(γ1,γ2,...γp) be a p-dimensional parameter vector. The estimate of γi is γ^i;i=1,2,...,p that are independent. The MOVER interval for i=1pciγi is defined as CIMOVER=[i=1pciγ^ii=1pci2(γ^ili*)2,i=1pciγ^i+i=1pci2(γ^iui*)2 ]where li=li if ci > 0, ui if ci<0 and ui=ui if ci > 0, li if ci<0. Moreover, (li, ui) stands for the CI for γi. Here we let lnωi=ωi(1)+ωi(2)+ωi(3)=ln(1δi)+(2μi+σi2)+ln[exp(σ2)(1δi)]

Using μ^i,σ^i2,δ^i from a sample, lnω^i=ω^i(1)+ω^i(2)+ω^i(3) is obtained, and CIs for individual parameters ωi(1), ωi(2) and ωi(3) are also constructed. To begin with ωi(1), (100 − ζ)% CI-based Wilson was proposed by Wilson (1927), that is CIωi(1)=[lωi(1),uωi(1)]=ln[{(ni(1)+Zi,ζ22/2)(Zi,1ζ2ni(0)ni(1)ni+Zi,ζ224)}/(ni+Zi,ζ22)]where Zi=ni(1)ni(1δi)niδi(1δi)dN(0,1). The (100 − ζ)%CI for ωi(2) is developed from Zou, Taleban & Huo (2009), then CIωi(2)=[lωi(2),uω2(1)]=[ω^i(2)4σ^i2Ti2ni(1)+σ^i4(1ni(1)1χ1ζ/2,ni(1)12)2,ω^i(2)+4σ^2iTi2ni(1)+σ^i4(ni(1)1χζ/2,ni(1)121)2]where Ti=μ^iμiσi2/ni(1)dN(0,1) and χ2ni(1) 1 denoted as chi-square distribution with ni(1) − 1 degree of freedom. Next, the (100 − ζ)%CI for ωi(3) is proposed as CIωi(3)=[lωi(3),uωi(3)]where lωi(3)=ln[ω^i(3)exp{σ^i4(1ni(1)1χ1ζ/2,ni(1)12)2}+{Zi,1ζ2ni(0)ni(1)ni+Zi,ζ224ni+Zi,ζ22}2] uωi(3)=ln[ω^i(3)+exp{σ^i4(ni(1)1χζ/2,ni(1)121)2}+{Zi,1ζ2ni(0)ni(1)ni+Zi,ζ224ni+Zi,ζ22}2]

As mentioned above, the (100 − ζ)% MOVER interval for ln ωi can be written as CIlnωi=[Llnωi,Ulnωi]where Llnωi=(ω^i(1)+ω^i(2)+ω^i(3))(ω^i(1)lωi(1))2+(ω^i(2)lωi(2))2+(ω^i(3)lωi(3))2 Ulnωi=(ω^i(1)+ω^i(2)+ω^i(3))+(ω^i(1)uωi(1))2+(ω^i(2)uωi(2))2+(ω^i(3)uωi(3))2

(1) Generate ZiN(0, 1), TiN(0, 1) and Uiχni(1)12 are independent.
(2) Compute CIωi(1), CIωi(2) and CIωi(3).
(3) Compute CIln ωi and CIθ.
(4) Compute (100 − ζ)%MOVER for θ.

Therefore, the (100 − ζ)% MOVER interval for θ is based on Donner & Zou (2012), given by CIθ=[Lθ,Uθ]where Lθ=(lnω^1lnω^2)(lnω^1Llnω1)2+(Ulnω2lnω^2)2 Uθ=(lnω^1lnω^2)+(Ulnω1lnω^1)2+(lnω^2Llnω2)2

The following detail is used to compute the MOVER interval.

HPD credible interval

The HPD credible interval is a parameter estimate based on the posterior probability in Bayesian framework. Box & Tiao (1973) defined the HPD ideas consisting of two requirements: the set of value that contain 100(1 − ζ)% of the posterior distribution and the property that the density within the region is equal or greater than outside. The posterior density is unimodal and symmetric, the region based on their definition becomes to a equal-sided CI (ζ/2 and 1 − ζ/2 percentiles of the posterior density). This is clearly different between equal-sided CI and HPD credible interval if there is a highly skewed in the posterior density. Recently, HPDs based on beta and uniform priors were recommended to constructed for single mean and the difference between two delta-lognormal means proposed by Maneerat, Niwitpong & Niwitpong (2019). The HPD credible interval is then focused on our study. Recall that Xij > 0 be random variables from lognormal distribution with parameter κ=(μ1σ12μ2σ22); Yij = ln XijNi, σ2i). The Fisher information matrix of κ is I(κ)=diag[n1(1)σ12n1(1)2σ14n2(1)σ22n2(1)2σ24]. For Xij = 0, the number of zero values ni(0) be random sample of binomial distribution, denoted as B(nii). The likelihood function of Xij is P(x|ω)i=12δini(0)(1δi)ni(1)(σi2)ni(1)/2exp{12σi2j=1ni(1)(lnxiμi)2}which leads to obtain the Fisher information matrix of ω=(μ1σ12δ1μ2σ22δ2) I(ω)=diag[n1(1)σ12n1(1)2σ14n1δ1(1δ1)n2(1)σ22n2(1)2σ24n2δ2(1δ2)]

Here θ = ln ω1 − ln ω2 so that the HPD credible intervals based on different priors for θ are described.

Jeffreys’ prior

The Jeffreys’ prior for ω is defined as P(ω)Ji=12P(κ)JP(δ)i=12σi2[δi(1δi)]1/2where P(κ)JI(σ2) and P(δ)JI(δ). The prior Eq. (17) is combined with its likelihood Eq. (16) to obtain the posterior distributions of each parameters σ2i, μi, and δi. Firstly, the posterior density of ω is using Bayes’ theorem (Casella & Berger, 2002), given by P(ω|x)Ji=12δini(0)1/2(1δi)ni(1)1/2σi(ni(1)+2)exp{12σi2[(ni(1)1)σ^i2+ni(1)(μ^iμi)2]}

Obtain that P(σi2|x)Ji=12(σi2)(ni(1)12+1)exp{(ni(1)1)σ^i22σi2}which is the posterior density of σ2|x, as inverted gamma distribution, denoted as IG(ai, bi); ai = ri/2 and bi=riσ^i2/2; ri = ni(1) − 1. Given σ2i and x, one obtains that P(μi|σi2,x)Ji=12exp{ni(1)2σi2(μiμ^i)2}

This is the posterior density of μi2i, x, as normal N(μ^i,σi2/ni(1)). For δi, its posterior was proved as p(δi|x)Jδi(ni(0)+1/2)1(1δi)(ni(1)+1/2)1which is the beta distribution with ci and di, denoted as beta(ci, di); ci = ni(0) + 1/2 and di = ni(1) + 1/2.

Jeffreys’ Rule prior

According to Harvey & Van der Merwe (2012), the difference between Jeffreys and Jeffreys’ Rule priors is the posterior densities of σ2i and δi, while Jeffreys’ Rule prior is defined as P(ω)JRi=12σi3δi1/2(1δi)1/2

To obtain the joint posterior of ω denoted as P(ω|x)JR, the Jeffreys’ Rule prior Eq. (21) is combined with the likelihood Eq. (16). For σ2i, its posterior has been changed to inverted gamma distribution with shape parameter si/2 and scale parameter siσ^i2/2; si = ni(1) + 1. Then, the posterior distribution of μi given σ2i and x is changed because it depends on the posterior density of σ2i. Likewise, the posterior of δi becomes P(δ|x)JR = beta(ni(0) + 1/2, ni(1) + 3/2). Their posterior distributions represent to estimate own parameter, meanwhile P(σi2|xij)JR, Pi2i, xij)JR and Pi|xij)JR are independent. As a result, HPD credible interval for θ is computed based on Jeffreys’ Rule prior. The steps for establishing HPD-intervals based on two priors are detailed as Algorithm 4.

(1) Generate σ2*i denoted as the posterior distribution of σ2i based on priors:
 • Jeffreys’ prior: σi(J)2IG(ri/2,riσ^i2/2); ri = ni(1) − 1.
 • Jeffreys’ Rule prior: σi(JR)2IG(si/2,siσ^i2/2); si = ni(1) + 1.
 • NG prior: σ2*i(NG)IG2ii,ni(1), βi,ni(1)).
(2) Given σ2*i and x, generate μ*i depends on the following priors:
 • Jeffreys’ prior: μi(J)N(μ^i,σi(J)2/ni(1)).
 • Jeffreys’ Rule prior: μi(JR)N(μ^i,σi(JR)2/ni(1)).
 • NG prior: μi(NG)tdf(μi|μi,ni(1),βi,ni(1)/[αi,ni(1)ki,ni(1)]).
(3) Generate δ*i,
 • Jeffreys’ prior: δ*i(J)beta(ni(0) + 1/2,ni(1) + 1/2).
 • Jeffreys’ Rule prior: δ*i(JR)beta(ni(0) + 1/2,ni(1) + 3/2).
 • NG prior: δ*i(NG),ni(1)beta(ni(1) + di,ni(0) + di).
(4) Compute ω*i,
 • Jeffreys’ prior: ωi(J)=(1δi(J))exp(2μi(J)+σi(J)2)[exp(σi(J)2)(1δi(J))].
 • Jeffreys’ Rule prior: ωi(JR)=(1δi(JR))exp(2μi(JR)+σi(JR)2)[exp(σi(JR)2)(1δi(JR))].
 • NG prior: ωi(NG)=δi(NG),ni(1)exp(2μi(NG)+σi(NG)2)[exp(σi(NG)2)δi(NG),ni(1)].
(5) Compute θ* = ln ω*1 − ln ω*2 based on three priors.
(6) Repeat 1–5 a number of times (say, m = 2,500).
(7) For the 2500 times, compute (100 − ζ)%HPD interval for θ in each prior.

Normal-gamma prior

DeGroot (1970) defined the conjugate families for random sample of normal distribution. This is a necessary to develop credible interval based on Bayesian approach for the delta-lognormal variance. Using Theorem 1 of the conjugate families for random sample of normal distribution (DeGroot, 1970), assume that (Y = ln Xij; i = 1, 2 j = 1, 2, …, ni(1)) be a random sample from normal distribution with the mean μ = (μ1, μ2) and precision λ = (λ1, λ2); λi = 1/σ2i where XLN(μ, λ). The normal-gamma prior of τ = (μ, λ)′ is defined the marginal distributions between normal μiiN(μ, [kiλi] 1) and gamma λiGi, βi), given by P(τ)=P(λ;α(0),β(0))P(μ|λ;μ(0),k(0)λ)=i=12[βi(0)Γ(αi(0))λiαi(0)1exp{βi(0)λi}][(ki(0)λi)1/22πexp{kiλi2(μiμi(0))2}]i=12λαi(0)1/2exp{λi2[2βi(0)+ki(0)(μiμi(0))2]}

The likelihood can be written as P(y;τ)=i=12(2π)ni(1)/2λini(1)/2exp{λi2i=1ni(1)(yijμi)2}i=12λini(1)/2exp{λi2[i=1ni(1)(yijμ^i)2+ni(1)(μ^μi)2]}

The posterior distribution of τ can be derived as P(τ|y)=P(y;τ)P(τ)i=12λini(1)2exp{λi2[i=1ni(1)(yijμ^i)2+ni(1)(μ^μi)2]}λiαi(0)12exp{λβi(0)}exp{λi2[ki(0)(μiμi(0))2]}i=12λni(1)2+αi(0)12exp{λi[12i=1ni(1)(yijμ^i)2+βi(0)]}exp{λi2[ni(1)(μ^μi)2+ki(0)(μiμi(0))2]}i=12λiαi12exp{λi[12i=1ni(1)(yijμ^i)2+βi(0)]}exp{λi2[(ni(1)+ki(0))(μiμi)2+ni(1)ki(0)[μ^iμi(0)]2ni(1)+ki(0)]}i=12λiαi1exp{λi[βi(0)+12i=1ni(1)(yijμ^i)2+ni(1)ki(0)[μ^iμi(0)]22(ni(1)+ki(0))]}λi12exp{λi[ni(1)+ki(0)]2(μiμi)2}i=12λiαi1exp{λiβi}λi12exp{λiki2(μiμi)2}

This implied that τi|yNormal(μi|μi,[kiλ]1)Gamma(λi|αi,βi) where αi=αi(0)+ni(1)/2, βi=βi(0)+12i=1ni(1)(yijμ^i)2+ni(1)ki(0)[μ^iμi(0)]22(ni(1)+ki(0)), ki=ni(1)+ki(0), and μi=μi(0)ki(0)+ni(1)μ^ini(1)+ki(0). From Eq. (22), the normal-gamma prior of τ is defined as P(τ)i=12λi1which is normal-gamma distribution, denoted as NGi, λi|μ, ki(0) = 0, αi(0) = −1/2, βi(0) = 0 so that its posterior of τ is derived from Eq. (24) that becomes P(τ|y)i=12λini(1)121exp{λi2j=1ni(1)(yijμ^i)2}λi12exp{ni(1)λi2(μiμi)2}

This is NGi,ni(1), ki,ni(1), αi,ni(1), βi,ni(1)); μi,ni(1)=μ^i, ki,ni(1) = ni(1), αi,ni(1) = (ni(1) − 1)/2 and βi,ni(1)=12j=1ni(1)(yijμ^i)2. The marginal posterior of λi becomes P(λi|y)λiαi,ni(1)12exp{λiβi,ni(1)}exp{λiki,ni(1)2(μiμi,ni(1))2}dμiλiαi,ni(1)12exp{λiβi,ni(1)}2πλiki,ni(1)λiαi,ni(1)1exp{λiβi,ni(1)}which is λi|yGii,ni(1), βi,ni(1)). This can be implied that σ2i|yIG2ii,ni(1), βi,ni(1)). Let ρi=βi,ni(1)+ki,ni(1)2(μμi,ni(1))2 and ai = λiρ i, then dλi=1ρidai. Also, let bi=αi,ni(1)+12. From Eq. (26), the marginal posterior of μi|y is also obtained as P(μi|λi,y))λi(αi,ni(1)+12)1exp{λi[βi,ni(1)+ki,ni(1)2(μμi,ni(1))2]}dλiλibi1exp{λiρi}dλi(aiρi)bi11ρiexp(ai)daiρibiaibi1exp(ai)daiρibi[βi,ni(1)+ki,ni(1)2(μiμi,ni(1))2](αi,ni(1)+12)[1+12αi,ni(1)ki,ni(1)αi,ni(1)βi,ni(1)(μiμi,ni(1))2](αi,ni(1)+12)

Then, μi|ytdfii,ni(1)i,ni(1)/[αi,ni(1)ki,ni(1)]) where df = 2αi,ni(1) = ni(1) − 1. For δi,ni(1) = 1 − δi, Jin, Thulin & Larsson (2017) have investigated and attempted to find the power-divergence (PD) interval for δi estimated by Bayesian credible interval-based beta(d,d) prior. For focusing on this study, the beta(d,d) prior of δi,ni(1) is given by P(δi,ni(1))=Γ(2di)[Γ(di]2δi,ni(1)di1(1δi,ni(1))di1where di=(2+zζ/22)/6; zζ/2 be a random sample of standard normal distribution. It is combined with the likelihood function of δi,ni(1) such that its posterior of δi,ni(1) is then P(δi,ni(1))Γ(2di)[Γ(di]2δi,ni(1)(ni(1)+di)1(1δi,ni(1))(ni(0)+di)1which is beta distribution, denoted as beta(ni(1) + di,ni(0) + di). The posterior distributions of μi and σ2i based on NG prior and δi,ni(1) based on Jin, Thulin & Larsson (2017) are obtained, then HPD-NG credible interval for θ can be computed in Algorithm 4.

Results

The CIs proposed in this study are HPD-NG and MOVER. The former develops the NG prior to obtain the posterior density, while the latter is an extension of Hasan and Krishnamoorthy’s MOVER (Hasan & Krishnamoorthy, 2018). Both were compared with the existing CIs HPD-Jef and HPD-Rul of Harvey & Van der Merwe (2012), GCI of Wu & Hsieh (2014), and FGCI of Hasan & Krishnamoorthy (2018). Two simulation studies were conducted to show the aforementioned CI performances under equal and unequal sample sizes in different situations:

  • S1. The probability of additional zero δi is varied while μi = 3 and σ2i = 1.

  • S2. The mean μi and δi are varied while σ2i = 1.

S1 was designed and simulated to be consistent with the weekly rainfall datasets, as can be seen in the next section. The second one was constructed to indicate CI performance when both the mean μi and δi are changed, i.e. whether the numerical simulation is consistent with S1.

The coverage probability (CP) and relative average length (RAL, denoted as the ratio of the average lengths of a proposed CI to HPD-Rul) were used to assess the performances of the CIs using Monte Carlo simulations. For 5,000 simulation runs, GPQs and FGPQs were fixed at 2,500 at a nominal level of 0.95. In a comparison of the methods, the following criteria were used to judge the best-performing CI: a CP closer to or greater than the nominal level and the narrowest RAL of less than 1 and minimal. The steps of the simulation procedure are given in Algorithm 5.

(1) Generate Xij ∼ Δ(μi, σ2i, δi); i = 1, 2 and j = 1, 2, …, ni.
(2) Compute ni(0), ni(1), μ^i, σ^i2 and δ^i.
(3) Construct CIs based on the methods as follows:
 • GCI, FGCI and MOVER from Algorithms 1, 2 and 3, respectively.
 • HPD-Jef, HPD-Rul and HPD-NG from Algorithm 4.
(4) Repeat 1–3, a number of times, (say, M = 5,000). All CIs are obtained.
(5) Compute CPs and RALs with all CIs.

The findings are based on the simulation work as follows. For the ratio of variances, the numerical evaluation shows that for the method in the S1 scenario, MOVER provided good performance for a small difference between δi with equal small sample sizes (Table 1). Importantly, HPD-NG had good coverage (a CP greater than 0.95) and the shortest CIs for both equal and unequal medium sample sizes as well as a large difference in δi for unequal large sample sizes. Meanwhile, HPD-Rul’s performance satisfied the criteria for a small difference in δi and unequal large sample sizes. The results in Table 2 for case S2 indicate that for a small difference in δi, MOVER could meet the requirements for equal small sample sizes, while HPD-NG maintained the given target for both equal and unequal medium sample sizes, and unequal large sample sizes. Moreover, HPD-Rul performed well for a large difference in δi and unequal medium-to-large sample sizes. From the evidence of both situations, HPD-Jef gave the best CP in all cases even though its average lengths were mostly wider than HPD-NG. Meanwhile, GCI and FGCI performances provided average lengths that were broader than other methods in all situations.

Table 1:
CP and RAL performances of 95% CIs for θ: (μ1, μ2) = (3, 3); (σ12, σ22 ) = (1, 1).
(n1,n2) 12) CP RAL
HPD-Jef HPD-Rul HPD-NG GCI FGCI MOVER HPD-Jef HPD-Rul HPD-NG GCI FGCI MOVER
(15,15) (0.1,0.1) 99.8 99.7 99.8 99.7 99.6 94.4 1.095 [*] 1.152 1.109 1.108 0.796
(0.1,0.2) 99.8 99.7 99.8 99.5 99.5 94.7 1.104 [*] 1.179 1.122 1.120 0.819
(0.1,0.3) 99.8 99.4 99.8 99.3 99.3 94.0 1.115 [*] 1.206 1.135 1.134 0.847
(0.1,0.4) 99.8 99.4 99.9 99.4 99.4 94.3 1.137 [*] 1.273 1.166 1.166 0.934
(0.1,0.5) 99.9 99.6 99.9 99.6 99.6 95.5 1.160 [*] 1.344 1.201 1.200 1.045
(0.2,0.1) 99.8 99.6 99.8 99.5 99.5 93.5 1.105 [*] 1.179 1.121 1.120
(0.2,0.2) 99.9 99.6 99.9 99.6 99.6 94.0 1.114 [*] 1.205 1.131 1.130 0.836
(0.2,0.3) 99.7 99.4 99.8 99.3 99.3 94.8 1.122 [*] 1.228 1.139 1.139 0.856
(0.2,0.4) 99.9 99.5 99.9 99.5 99.4 94.3 1.146 [*] 1.296 1.171 1.171 0.943
(0.2,0.5) 99.8 99.3 99.9 99.3 99.4 94.5 1.170 [*] 1.373 1.209 1.208 1.072
(0.3,0.1) 99.8 99.5 99.8 99.4 99.4 94.2 1.114 [*] 1.204 1.133 1.131 0.845
(0.3,0.2) 99.7 99.6 99.8 99.5 99.5 94.5 1.123 [*] 1.230 1.142 1.141 0.861
(0.3,0.3) 99.8 99.2 99.8 99.2 99.2 94.2 1.131 [*] 1.253 1.151 1.149 0.877
(0.3,0.4) 99.8 99.5 99.9 99.3 99.3 95.2 1.153 [*] 1.320 1.179 1.178 0.967
(0.3,0.5) 99.9 99.5 99.9 99.5 99.5 95.8 1.177 [*] 1.393 1.210 1.210 1.070
(50,50) (0.1,0.1) 97.3 97.2 95.5 96.9 97.0 91.2 1.019 [*] 0.935 1.027 1.026
(0.1,0.2) 97.6 97.2 95.4 97.2 97.2 90.7 1.021 [*] 0.936 1.029 1.028
(0.1,0.3) 96.9 96.5 94.6 96.7 96.5 90.6 1.022 [*] 0.939 1.031 1.030
(0.1,0.4) 97.2 96.7 94.8 96.5 96.7 89.9 1.024 [*] 0.944 1.034 1.034
(0.1,0.5) 96.8 95.9 94.4 95.8 96.1 89.4 1.029 [*] 0.955 1.041 1.041
(0.2,0.1) 97.2 96.6 94.9 96.6 96.6 90.5 1.020 [*] 0.936 1.028 1.027
(0.2,0.2) 97.4 96.8 95.1 96.6 96.7 90.4 1.022 [*] 0.938 1.029 1.029
(0.2,0.3) 96.4 96.1 94.1 95.8 95.9 89.2 1.023 [*] 0.941 1.032 1.032
(0.2,0.4) 96.8 96.3 94.4 96.1 96.0 90.2 1.026 [*] 0.946 1.035 1.035
(0.2,0.5) 96.8 96.5 95.1 96.4 96.5 90.1 1.030 [*] 0.956 1.041 1.041
(0.3,0.1) 97.4 96.9 95.4 96.8 96.9 90.6 1.022 [*] 0.940 1.031 1.030
(0.3,0.2) 96.9 96.5 94.5 96.5 96.3 88.5 1.024 [*] 0.942 1.032 1.032
(0.3,0.3) 96.6 96.0 94.1 95.8 95.9 89.0 1.025 [*] 0.945 1.034 1.034
(0.3,0.4) 97.1 96.6 95.2 96.8 96.7 89.9 1.027 [*] 0.950 1.036 1.035
(0.3,0.5) 96.9 96.4 94.7 96.1 96.1 89.7 1.032 [*] 0.960 1.042 1.042
(30,50) (0.1,0.1) 98.1 97.8 96.8 97.7 97.8 90.8 1.030 [*] 0.964 1.042 1.042
(0.1,0.2) 98.3 97.9 96.7 97.8 97.7 90.6 1.030 [*] 0.966 1.043 1.042
(0.1,0.3) 97.8 97.7 96.2 97.1 97.1 89.3 1.032 [*] 0.968 1.044 1.042
(0.1,0.4) 97.7 97.4 96.0 97.1 97.1 90.1 1.033 [*] 0.972 1.043 1.042
(0.1,0.5) 97.6 97.2 96.3 97.0 97.1 89.7 1.036 [*] 0.981 1.046 1.046
(0.2,0.1) 98.0 97.7 96.6 97.4 97.5 90.9 1.033 [*] 0.971 1.047 1.046
(0.2,0.2) 97.8 97.4 96.3 97.3 97.3 90.2 1.034 [*] 0.972 1.047 1.046
(0.2,0.3) 97.9 97.4 96.4 97.2 97.2 89.9 1.034 [*] 0.974 1.046 1.046
(0.2,0.4) 98.1 97.6 96.4 97.3 97.3 89.6 1.035 [*] 0.978 1.047 1.046
(0.2,0.5) 97.6 97.5 96.2 97.0 96.9 89.5 1.039 [*] 0.987 1.050 1.049
(0.3,0.1) 98.1 97.6 96.4 97.3 97.3 89.8 1.036 [*] 0.981 1.054 1.053
(0.3,0.2) 97.7 97.1 96.0 96.7 96.9 89.6 1.037 [*] 0.981 1.053 1.053
(0.3,0.3) 97.6 97.1 95.6 96.7 96.8 89.6 1.038 [*] 0.984 1.053 1.052
(0.3,0.4) 97.4 97.0 95.8 96.7 96.7 88.8 1.039 [*] 0.988 1.053 1.052
(0.3,0.5) 97.7 97.3 96.2 97.1 97.0 89.6 1.043 [*] 0.997 1.056 1.055
(50,100) (0.1,0.1) 96.7 96.4 93.9 96.2 96.3 91.7 1.016 [*] 0.922 1.026 1.026
(0.1,0.2) 96.6 96.2 94.0 96.0 96.2 91.4 1.016 [*] 0.921 1.025 1.025
(0.1,0.3) 96.7 96.2 93.8 96.1 96.1 91.1 1.016 [*] 0.921 1.025 1.025
(0.1,0.4) 96.7 96.6 94.2 96.3 96.4 91.3 1.016 [*] 0.921 1.025 1.025
(0.1,0.5) 96.8 96.5 94.5 96.3 96.3 90.9 1.019 [*] 0.923 1.026 1.025
(0.2,0.1) 96.7 96.4 94.1 96.1 96.3 91.1 1.017 [*] 0.923 1.028 1.028
(0.2,0.2) 96.7 96.3 93.7 95.9 96.0 90.4 1.017 [*] 0.923 1.029 1.027
(0.2,0.3) 96.4 96.1 93.9 96.0 95.9 90.5 1.017 [*] 0.922 1.028 1.027
(0.2,0.4) 96.8 96.6 93.7 96.4 96.4 90.7 1.018 [*] 0.923 1.028 1.027
(0.2,0.5) 96.4 96.1 93.7 96.1 96.1 90.2 1.018 [*] 0.924 1.028 1.027
(0.3,0.1) 96.0 96.0 93.0 95.5 95.4 90.9 1.019 [*] 0.926 1.033 1.033
(0.3,0.2) 96.0 95.7 93.0 95.3 95.4 90.2 1.019 [*] 0.926 1.032 1.031
(0.3,0.3) 96.4 95.7 93.7 95.8 95.8 90.5 1.019 [*] 0.926 1.032 1.031
(0.3,0.4) 96.4 95.7 93.4 95.9 95.9 90.6 1.020 [*] 0.926 1.031 1.031
(0.3,0.5) 96.7 96.3 94.2 96.2 96.3 90.7 1.021 [*] 0.928 1.030 1.031
DOI: 10.7717/peerj.8502/table-1

Notes:

[*]: HPD-Rul satisfies the criteria.

Bold denotes the best-performing CI.

Table 2 :
CP and RAL performances of 95% CIs for θ: (σ21, σ22 ) = (1, 1).
(n1,n2) 12) 12) CP RAL
HPD-Jef HPD-Rul HPD-NG GCI FGCI MOVER HPD-Jef HPD-Rul HPD-NG GCI FGCI MOVER
(15,15) (0.1,0.1) (0,0) 99.8 99.5 99.8 99.5 99.5 93.7 1.096 [*] 1.154 1.112 1.110 0.797
(0,0.3) 99.8 99.4 99.6 99.4 99.3 93.7 1.096 [*] 1.155 1.111 1.108 0.798
(0,0.5) 99.9 99.6 99.9 99.7 99.7 93.9 1.095 [*] 1.153 1.111 1.108 0.796
(0,0.7) 99.7 99.5 99.7 99.6 99.6 94.4 1.095 [*] 1.153 1.111 1.109 0.798
(0,0.9) 99.8 99.6 99.7 99.4 99.5 93.8 1.096 [*] 1.154 1.111 1.110 0.797
(0.2,0.2) (0,0) 99.7 99.4 99.8 99.4 99.4 93.9 1.114 [*] 1.205 1.132 1.130 0.839
(0,0.3) 99.9 99.6 99.9 99.6 99.7 94.6 1.113 [*] 1.205 1.130 1.129 0.835
(0,0.5) 99.9 99.5 99.8 99.5 99.6 94.5 1.115 [*] 1.206 1.131 1.129 0.835
(0,0.7) 99.8 99.7 99.9 99.6 99.5 94.6 1.114 [*] 1.205 1.132 1.130 0.838
(0,0.9) 99.8 99.6 99.9 99.5 99.6 94.5 1.114 [*] 1.205 1.131 1.130 0.836
(0.4,0.4) (0,0) 99.7 99.2 99.9 99.2 99.3 94.6 1.174 [*] 1.382 1.200 1.199 1.038
(0,0.3) 99.8 99.4 99.9 99.2 99.2 94.7 1.173 [*] 1.380 1.201 1.199 1.033
(0,0.5) 99.9 99.5 99.9 99.5 99.5 95.2 1.175 [*] 1.384 1.202 1.200 1.033
(0,0.7) 99.9 99.4 99.9 99.5 99.5 95.1 1.174 [*] 1.383 1.198 1.199 1.029
(0,0.9) 99.9 99.5 99.9 99.5 99.5 95.5 1.174 [*] 1.383 1.201 1.199 1.038
(30,30) (0.1,0.1) (0,0) 98.8 98.3 97.7 98.3 98.4 90.8 1.037 [*] 0.986 1.046 1.044
(0,0.3) 98.8 98.4 97.7 98.3 98.2 90.8 1.037 [*] 0.986 1.047 1.045
(0,0.5) 98.9 98.6 97.9 98.3 98.4 91.0 1.037 [*] 0.986 1.047 1.045
(0,0.7) 98.7 98.2 97.7 98.1 98.0 90.5 1.037 [*] 0.987 1.047 1.045
(0,0.9) 98.8 98.4 98.0 98.5 98.4 91.6 1.036 [*] 0.986 1.046 1.045
(0.2,0.2) (0,0) 98.4 98.2 97.7 98.2 98.2 90.3 1.041 [*] 0.997 1.052 1.051
(0,0.3) 98.4 98.2 97.6 98.1 98.0 90.3 1.041 [*] 0.997 1.052 1.051
(0,0.5) 98.2 97.8 97.3 97.7 97.6 90.0 1.042 [*] 0.997 1.052 1.050
(0,0.7) 98.4 98.0 97.5 97.7 97.8 89.9 1.040 [*] 0.998 1.051 1.051
(0,0.9) 98.1 98.0 97.4 97.7 97.8 89.9 1.041 [*] 0.998 1.051 1.051
(0.4,0.4) (0,0) 98.1 97.8 97.3 97.4 97.3 89.6 1.058 [*] 1.042 1.070 1.071
(0,0.3) 98.2 97.8 97.6 97.7 97.7 90.3 1.060 [*] 1.042 1.072 1.072
(0,0.5) 98.2 97.7 97.3 97.4 97.4 89.9 1.058 [*] 1.042 1.072 1.071
(0,0.7) 98.4 97.7 97.4 97.5 97.5 89.5 1.059 [*] 1.041 1.071 1.070
(0,0.9) 98.2 97.7 97.2 97.4 97.3 89.8 1.058 [*] 1.041 1.070 1.071
(50,50) (0.1,0.1) (0,0) 97.1 96.8 95.1 96.7 96.8 90.1 1.020 [*] 0.936 1.028 1.027
(0,0.3) 97.3 96.9 95.3 97.0 96.8 91.0 1.020 [*] 0.935 1.028 1.027
(0,0.5) 97.7 97.2 95.3 97.1 97.1 90.5 1.019 [*] 0.934 1.027 1.026
(0,0.7) 97.4 97.1 95.2 97.0 96.9 90.9 1.019 [*] 0.935 1.027 1.026
(0,0.9) 96.8 96.4 94.8 96.2 96.3 89.9 1.020 [*] 0.936 1.028 1.027
(0.2,0.2) (0,0) 96.4 96.0 94.3 96.1 96.0 89.5 1.021 [*] 0.938 1.029 1.028
(0,0.3) 97.1 96.7 94.6 96.5 96.6 89.8 1.022 [*] 0.938 1.031 1.030
(0,0.5) 96.8 96.4 94.4 96.2 96.2 89.8 1.022 [*] 0.938 1.030 1.029
(0,0.7) 96.9 96.5 94.5 96.3 96.3 90.3 1.023 [*] 0.938 1.030 1.029
(0,0.9) 96.8 96.4 94.5 96.2 96.2 89.9 1.022 [*] 0.937 1.029 1.029
(0.4,0.4) (0,0) 96.6 96.3 94.4 96.0 95.9 89.1 1.029 [*] 0.954 1.038 1.038
(0,0.3) 96.8 96.4 94.4 96.0 96.1 88.9 1.029 [*] 0.954 1.038 1.038
(0,0.5) 96.9 96.4 94.6 95.9 96.2 89.3 1.029 [*] 0.954 1.038 1.038
(0,0.7) 96.7 96.0 94.3 95.7 95.8 89.2 1.029 [*] 0.955 1.038 1.038
(0,0.9) 97.0 96.3 94.7 96.1 96.1 89.2 1.030 [*] 0.955 1.039 1.038
(30,50) (0.1,0.1) (0,0) 98.4 98.0 96.8 97.8 97.7 90.8 1.029 [*] 0.965 1.042 1.041
(0,0.3) 98.3 98.1 97.0 97.8 97.9 90.5 1.029 [*] 0.965 1.042 1.041
(0,0.5) 98.5 98.1 97.0 97.7 97.7 91.2 1.030 [*] 0.965 1.043 1.041
(0,0.7) 98.2 97.8 96.5 97.5 97.4 89.9 1.029 [*] 0.964 1.042 1.040
(0,0.9) 98.1 97.8 96.7 97.7 97.7 90.0 1.030 [*] 0.964 1.042 1.041
(0.2,0.2) (0,0) 98.1 97.7 96.7 97.5 97.5 90.1 1.033 [*] 0.972 1.048 1.046
(0,0.3) 97.8 97.5 96.3 97.3 97.3 90.2 1.033 [*] 0.972 1.047 1.046
(0,0.5) 98.1 97.8 96.8 97.4 97.4 90.5 1.033 [*] 0.972 1.047 1.046
(0,0.7) 98.1 97.6 96.3 97.3 97.3 90.1 1.033 [*] 0.971 1.046 1.045
(0,0.9) 97.8 97.2 96.0 97.0 97.0 89.6 1.034 [*] 0.972 1.047 1.047
(0.4,0.4) (0,0) 97.4 97.1 96.3 96.7 96.9 89.6 1.046 [*] 1.003 1.063 1.062
(0,0.3) 97.6 97.3 96.4 96.8 96.8 89.7 1.046 [*] 1.004 1.063 1.062
(0,0.5) 98.2 97.4 96.6 97.2 97.1 89.5 1.046 [*] 1.003 1.062 1.062
(0,0.7) 97.5 97.1 95.9 96.6 96.6 88.7 1.046 [*] 1.004 1.063 1.062
(0,0.9) 97.6 96.9 96.0 96.6 96.7 88.5 1.046 [*] 1.004 1.063 1.063
(50,100) (0.1,0.1) (0,0) 96.7 96.3 94.1 96.4 96.2 91.6 1.016 [*] 0.921 1.026 1.025
(0,0.3) 96.7 96.5 94.3 96.3 96.3 92.3 1.016 [*] 0.922 1.026 1.025
(0,0.5) 96.7 96.5 93.9 96.3 96.1 92.0 1.015 [*] 0.921 1.026 1.025
(0,0.7) 96.5 96.3 93.8 96.0 96.0 92.0 1.016 [*] 0.922 1.026 1.025
(0,0.9) 97.1 96.7 94.2 96.6 96.5 92.1 1.016 [*] 0.921 1.025 1.025
(0.2,0.2) (0,0) 96.0 95.5 93.2 95.5 95.4 90.4 1.017 [*] 0.923 1.029 1.028
(0,0.3) 96.7 96.5 94.0 96.5 96.4 91.3 1.017 [*] 0.922 1.029 1.028
(0,0.5) 96.3 95.9 93.5 95.8 95.7 90.8 1.018 [*] 0.923 1.029 1.028
(0,0.7) 96.6 96.3 93.8 96.1 96.1 91.0 1.016 [*] 0.923 1.028 1.027
(0,0.9) 96.7 96.1 93.7 96.2 96.2 91.2 1.018 [*] 0.923 1.029 1.028
(0.4,0.4) (0,0) 96.1 95.5 93.2 95.4 95.4 89.5 1.023 [*] 0.932 1.035 1.035
(0,0.3) 96.4 95.8 93.0 95.8 95.7 89.3 1.023 [*] 0.933 1.036 1.036
(0,0.5) 96.3 95.6 93.2 95.7 95.6 89.3 1.023 [*] 0.933 1.036 1.036
(0,0.7) 96.3 96.0 93.6 96.0 95.9 90.1 1.023 [*] 0.932 1.035 1.036
(0,0.9) 96.2 95.8 93.7 95.8 95.9 89.4 1.024 [*] 0.933 1.036 1.037
DOI: 10.7717/peerj.8502/table-2

Notes:

[*]: HPD-Rul satisfies the criteria.

Bold denotes the best-performing CI.

An empirical application

Duangdai & Likasiri (2017) predicted the global temperature, and forest and seasonal rainfall amount in northern Thailand by various mathematical models. Their results indicate that during 1973–2008, the rainfall fluctuation during the rainy season was less than the summer rainfall, although the forest cover was higher in the summer than in the rainy season. Approximately 60% of the total arable land in the northeast has been turned over to rice cultivation (Nicely & Counselor, 2018). Importantly, both the northern and northeastern regions of Thailand are important agricultural areas where the planted rice is rain-fed and is the most valuable crop in the Lower Mekong Basin (Zhang et al., 2014). These findings led us to focus on the rainfall amounts in the northern and northeastern areas in Thailand, especially regarding rainfall variation in the rainy season. Importantly, a comparison of the rainfall in the northern and northeastern regions in terms of the ratio of variances was investigated and estimated with our proposed CIs. There were 272 substations in total for both regions to record rainfall measurements by the Thai Meteorological Center. The 272 rainfall observed values contained 40 of 62 (64.52%) and 145 of 210 (69.05%) positive records in the north (62 substations) and northeast (210 substations) areas during 2–8 July 2018, respectively. The remainder were zero observations for both sets.

To examine for normality, a Q–Q plot of the log-transformed positive rainfall amounts for the two areas are plotted in Fig. 1. Histograms also confirmed the fitted distributions of the northern and northeastern region rainfall records, as shown in Fig. 2. Moreover, Table 3 report the AIC results to check the fitted distribution of the positive rainfall observations for both areas. The results show that the positive rainfall for both had lognormal distributions, while the fact that the records contained zero values implies that both sets fit delta-lognormal distributions. The approximation of rainfall dispersion ratio between north and northeast areas is θ = −1.674 [ratio = exp(−1.674) = 0.187] where the variance (ω1, ω2) = (448.34, 2,390.93); the basic statistics are (n1, n2) = (62,210); (μ^1,μ^2)=(2.541,2.576); (σ^12,σ^22)=(0.886,1.576) and (δ^1,δ^2)=(0.355,0.309). From the results, the 95%CIs for exp (θ) given in Table 4 indicate that HPD-NG and MOVER were better for situation S1 [(μ12) = (3, 3); (σ12,σ22)=(1,1) and (δ12) = (0.3,0.3)].

Table 3:
Results of AIC for the nonzero rainfall records in north and northeast areas.
Regions AIC
Exponential Weibull Lognormal Normal t-distribution Cauchy Logistic
Northern 316.677 317.046 314.927 348.812 332.373 336.093 337.678
Northeastern 1,232.797 1,231.199 1,227.468 1,411.599 1,332.576 1,331.557 1,376.586

Note:

Bold denotes the lowest AIC.

Table 4:
95% CIs for the weekly rainfall ratio between north and northeast regions.
Methods 95% CIs for exp (θ) Length
Lower Upper
HPD-Jef 0.0455 0.9255 0.8800
HPD-Rul 0.0477 0.9239 0.8762
HPD-NG 0.0546 0.7793 0.7247
GCI 0.0464 0.9860 0.9396
FGCI 0.0652 1.1484 1.0832
MOVER 0.0512 0.5896 0.5384

As mentioned previously, the results can be interpreted as the rainfall variability in the northern region being less than the northeastern one during 2–8 July 2018, which implies that growing rice in the northeastern region was probably more affected than in the northern region because the former’s rain variation was larger. It is possible that this information could influence approximately 60% of the total arable area in the northeast. Note that Nicely & Counselor (2018) estimated 70% of rice production during the marketing year 2018–2019 cultivated under desirable weather conditions. This analysis might provide useful information to the Royal Thai Government to carry out effective water management. Our findings made a few realizations about natural disasters due to climate change during the rainy season in the northern and northeastern Thailand as well. The results are also in agreement with the simulation study ones in Table 1.

Discussion

The HPD-NG was proposed for constructing confidence intervals for comparing the rainfall dispersion in north and northeast regions. How to select the prior in this situation is that we found the CP performance of HPD depend on the posterior densities of σ2 and δ in the previous study. To our knowledge, we believed that μ and σ2 might be suitable for random variables of the normal and gamma distributions, respectively. This becomes the normal-gamma prior of (μ,σ2). For δ, it was motivated by Jeffreys’ prior, while a beta distribution was also developed and recommended by Jin, Thulin & Larsson (2017) so that beta (d,d) becomes a prior of δ in this study.

The difference between HPD-Jef and HPD-Rul is the posterior densities of σ2 and δ, although it can be seen from the results of the simulation study that HPD-Rul’s outcomes were agreement with Harvey & Van der Merwe (2012) when focusing on the ratio of delta-lognormal means. This implies that both HPD performances are dependent on the posterior of σ2 and δ. For the proposed HPD-NG, its prior of τ = (μ,λ); λ = 1/σ2 is the inverse of the Jeffreys’ prior, leading us to obtain the posterior distributions of μ and σ2 derived from the NG prior under the assumption that the mean μ has a normal distribution and the precision λ has a gamma distribution. After that, the posterior of σ2 has a inverse-gamma distribution with its parameters (αi,ni(1)i,ni(1)). Importantly, the parameter βi,ni(1) is different from the inverse-gamma based one on Jeffreys’ prior, resulting in a different posterior of μ as well. Likewise, the prior of δ was found and recommended by Jin, Thulin & Larsson (2017). For these reasons, the HPD-NG prior was developed to estimate the ratio between delta-lognormal variances. However, more research on this topic needs to be conduct to find an appropriate prior to obtain a better performance. Furthermore, MOVER could maintain its performance to satisfy the target criteria even with a small difference between the binomial proportions and small sample sizes. It is possible that an interval estimate for δ based on Wilson’s interval satisfies the criteria for small to moderate sample sizes, as confirmed by Donner & Zou (2011).

Conclusions

The purpose of this study was to develop CIs, namely HPD-NG and MOVER, for the ratio of delta-lognormal variances through Monte Carlo simulations. By way of comparison, both were examined to report their performance with the existing methods: HPD-Jef, HPD-Rul, GCI and FGCI. These proposed CIs were applied to estimate and compare the rainfall fluctuation between the northern and northeastern regions in Thailand in terms of the ratio of variances.

The findings of this study indicate that HPD-NG is the best and most recommended CI in situations S1 (with varied δ) for both equal and unequal medium sample sizes, and for a large difference between the binomial proportions for unequal large sample sizes. Both MOVER and HPD-Rul are the next best CIs recommended for a small difference between the binomial proportions with different sample sizes: MOVER for equal small sample sizes and HPD-Rul for unequal large sample sizes. For situation S2 where δ and μ are both varied, HPD-NG delivered the best performance for a small binomial proportion and both equal and unequal medium sample sizes as well as a small proportion of zeros for unequal large sample sizes. MOVER is also recommended for situations similar to situation S1, while HPD-Rul is recommended for a large proportion of zeros and unequal medium-to-large sample sizes. Although the CPs of HPD-Jef, GCI and FGCI results satisfied the criteria, their average lengths were wider than our proposed methods in all situations.

Note: In this paper, confidence intervals for the ratio variances of delta-lognormal are proposed, whereas, other submission proposed CIs for the difference between variances of delta-lognormal populations based on Herbert et al. (2011) and Krishnamoorthy, Lian & Mondal (2010). Although, the difference between them is that the former was considered to estimate the ratio of two delta-lognormal variances (ω12) using HPD-based normal gamma prior. The latter was presented HPDs based on other priors for the difference in delta-lognormal variances (ω1 − ω2) for large variational situations. According to Herbert et al. (2011) and Krishnamoorthy, Lian & Mondal (2010), the assessment of the range of distribution of (ω1 − ω2) can be estimated using CIs for (ω1 − ω2) such that it can be implied that there is the distributed range between (ω1 − ω2) and (ω12). Finally, both of random variables are used to check significantly difference in the two independent datasets. The next contrast between this submission and other one is the illustrated data that there are different dispersions and locations in each paper.

Supplemental Information

R code for all outputs in Tables 1–4.

DOI: 10.7717/peerj.8502/supp-1

Weekly rainfall records in northern and northeastern regions during 2–8 July 2018.

DOI: 10.7717/peerj.8502/supp-2
16 Citations   Views   Downloads