Heliyon (May 2024)
Linear modeling of zonal level crop production in Ethiopia
Abstract
Accounting for zonal-level variations and identifying factors that have linear effects on crop production help to make better decisions and plan new policies for effective crop production and food security. The main objective of this study is to identify potential subsets of covariates and estimate their linear effects on crop production. A linear mixed effects model (random--intercept) is used on agricultural sample survey data for Meher seasons from 2012/13 to 2019/20 to explore and identify the best variance-covariance structure for the longitudinal data on 90 zones with eight repeated observations and different sampling weights. The minimum, mean, and maximum crop production by farmers across the country are 1.616, 8.693, and 147.843 quintals, respectively, and about 98 % of farmers produced less than 25 quintals. There is a small rate of increase in mean and median crop production by farmers across the years, and the variability between zones is highest in the year 2019/20 and in the Somali region. The histogram, kernel density, and P–P plots suggested a common logarithm transformation on the crop production variable. Results from the data exploration and variance-covariance structure selection methods suggested a heterogeneous compound symmetry (CSH) structure. Covariates region, year, proportion of farmers who practice pure-agriculture and other-agriculture types, proportion of farmers who use any type of fertilizer, farmer's age, area used, farmer association crop production, indigenous seed used, improved seed used, UREA fertilizer used, other fertilizers used, and percentage of crop damaged are significant in linearly explaining/affecting log crop production, and among these area used, farmers association crop production, UREA fertilizer used, and indigenous seed used have relatively highest effect on log crop production. Zones Wolayita, North-Shewa (Am), West-Arsi, West-Welega, Dawro, and Guji are top/good performers while zones Southwest-Shewa, Waghimra, Guraghe, South-Omo, Keffa, North-Wello, South-Wello, and Eastern Tigray are bottom/poor performers in crop production. Model assumptions and influence diagnostics results suggested the linearity of the model and normality of random effects and residuals are not violated, even though some zones have influences on either model parameters, precisions of estimates of these parameters, and predicted values.