Comparison between data mining methods to assess calving diﬃculty in cattle

Zaborski*, Daniel; Proskura, Witold S; Grzesiak, Wilhelm; Zaborski*, Daniel; Proskura, Witold S; Grzesiak, Wilhelm

doi:10.17533/udea.rccp.v30n3a03

Services on Demand

Journal

Article

Indicators

Cited by SciELO
Access statistics

Revista Colombiana de Ciencias Pecuarias

Print version ISSN 0120-0690

Rev Colom Cienc Pecua vol.30 no.3 Medellín July/Sept. 2017

https://doi.org/10.17533/udea.rccp.v30n3a03

Original Articles

Comparison between data mining methods to assess calving diﬃculty in cattle^¹

Comparación entre métodos de minería de datos para evaluar la dificultad al parto en ganado

Comparação entre métodos de mineração de dados para avaliar a dificuldade no parto em bovinos

Daniel Zaborski*^*¹

Witold S Proskura¹

Wilhelm Grzesiak¹

¹Department of Ruminants Science, West Pomeranian University of Technology, Szczecin, Poland.

Abstract

Background:

Dystocia in cattle results in adverse consequences (increased calf morbidity and mortality, decreased fertility, and milk production, lower cow survival and reduced welfare) leading to considerable economic losses.

Objective:

To classify calvings in dairy cattle according to their diﬃculty using selected data mining methods (classiﬁcation and regression trees (CART), chi-square automatic interaction detection trees (CHAID) and quick, unbiased, eﬃcient, statistical trees (QUEST)), and to identify the most signiﬁcant factors aﬀecting calving diﬃculty. The results of data mining methods were compared with those of a more traditional generalized linear model (GLM).

Methods:

A total of 1,342 calving records of Polish Holstein- Friesian black-and-white heifers from four farms were used. Calving diﬃculty was divided into three categories (easy, moderate and diﬃcult).

Results:

The percentages of calvings correctly classiﬁed by CART, CHAID, QUEST, and GLM were as follows: 35.14, 18.92, 19.82, and 43.24% (easy), 68.70, 73.91, 81.74, and 41.74%

(moderate), and 77.27, 85.45, 73.64, and 81.82% (diﬃcult), respectively. The most important factors aﬀecting calving diﬃculty were bull’s rank (based on the mean calving diﬃculty score of its daughters), calving age, farm category (based on its mean milk yield) and calving season.

Conclusion:

All classiﬁcation models were satisfactory and could predict the class of calving diﬃculty.

Keywords: classification; dairy heifers; decision support systems; dystocia; electronic learning

Resumen

Antecedentes:

La distocia en el ganado resulta en consecuencias adversas (elevadas morbilidad y mortalidad de terneros, reducida fertilidad y producción de leche, menor supervivencia y bienestar de las vacas) que conllevan a pérdidas económicas considerables.

Objetivo:

Clasiﬁcar los partos del ganado lechero en función de su grado de diﬁcultad a través de métodos seleccionados de minería de datos (árboles de clasiﬁcación y de regresión (CART), detección automática de interacción chi-cuadrado (CHAID) y árboles estadísticos no sesgados y eﬁcientes (QUEST)) e identiﬁcar los factores más característicos de diﬁcultad al parto. Los resultados de los métodos de minería de datos se compararon con los del modelo lineal generalizado tradicional (GLM).

Métodos:

Se utilizaron 1.342 registros de parto de novillas de raza polaca Holstein-Friesian blanca y negra de cuatro explotaciones lecheras. La diﬁcultad de parto del ganado se dividió en tres categorías (fácil, moderado y difícil).

Resultados:

El porcentaje de partos correctamente clasiﬁcados por CART, CHAID, QUEST y GLM fue 35,14, 18,92, 19,82 y 43,24% (fácil), 68,70, 73,91, 81,74 y 41,74% (moderado), y 77,27, 85,45, 73,64

y 81,82% (difícil), respectivamente. Los factores más importantes de diﬁcultad de parto fueron el rango de toro (determinado sobre la base de diﬁcultad media de los partos de sus hijas), la edad al parto, la categoría de las ﬁncas (sobre la base del rendimiento medio de leche) y la temporada de parto.

Conclusión:

Todos los modelos de clasiﬁcación se caracterizaron como satisfactorios y podrían predecir la clase de diﬁcultad al parto.

Palabras clave: aprendizaje electrónico; clasificación; distocia; novillas lecheras; sistemas de soporte de decisiones

Resumo

Antecedentes:

A distócia em bovinos resulta em consequências adversas (aumento da morbidade e mortalidade dos bezerros, diminuição da fertilidade e da produção de leite, baixa sobrevivência da vaca e redução do bem-estar) levando a consideráveis perdas econômicas.

Objetivo:

Classiﬁcar os partos do gado leiteiro segundo o seu grau de diﬁculdade através dos métodos selecionados de data mining (árvores de classiﬁcação e regressão (CART), detecção automática de interação chi-quadrado (CHAID) e ârvores estatísticas eﬁcientes e rápidas e imparciais (QUEST)) e identiﬁcar os fatores mais importantes para a diﬁculdade nos partos. Os resultados dos métodos de data mining foram comparados com os resultados do modelo lineal generalizado (GLM) mais convencional.

Métodos:

Foram utilizados 1.342 registos de partos de novilhas da raça polaca Holstein-Frísia branca e preta de quatro fazendas. A diﬁculdade em um parto foi dividida em três categorias (fácil, média, difícil).

Resultados:

A percentagem de partos corretamente classiﬁcados através de CART, CHAID, QUEST e GLM foram de 35,14, 18,92, 19,82 e 43,24% (fácil), 68,70, 73,91, 81,74 e 41,74% (média) e 77,27, 85,45, 73,64 e 81,82% (difícil), respetivamente. Os fatores mais importantes de diﬁculdade no parto foram a classiﬁcação do touro (determinada com base na diﬁculdade média nos partos de suas ﬁlhas), a idade no momento de parto, a categoria de exploração leiteira (com base no rendimento médio de leite) e a temporada de parto.

Conclusão:

Todos os modelos de classiﬁcação destacaram-se por sua qualidade satisfatória e foram capazes de prever a categoria de diﬁculdade de um parto.

Palavras chave: aprendizagem electrónica; classificação; distócia; novilhas leiteiras; sistemas de apoio à decisão

Introduction

Dystocia in cattle results in many adverse consequences for the dam and its oﬀspring (^{Azizzadeh et al., 2012}; ^{Barrier et al., 2012}). These include increased calf morbidity and mortality, decreased fertility and milk production, low cow survival and reduced welfare (^{Mee et al., 2011}). There are also many direct and indirect factors aﬀecting the incidence of dystocia in cattle. The ﬁrst group comprises feto-pelvic disproportion, fetal malposition, vulvar or cervical stenosis, and uterine torsion, whereas the second group includes dam’s age at calving, gestation length, parity, body weight, and condition at service and calving, calf sex, sire, breed and strain, feeding, and climate, etc. (Mee, 2008). In order to prevent the occurrence of dystocia and alleviate its negative eﬀects, it would be desirable to develop prognostic methods capable of indicating animals with potential problems at calving, based on the above-mentione risk factors. One such approach involves the use of statistical methods, especially those from the ﬁeld of data mining. There are numerous data mining algorithms, some of which have already been applied to animal farming (^{Piwczyński et al., 2013}). Decision trees, belonging to this group of algorithms, are characterized by a relatively easy interpretation and implementation. However, each type of algorithm has some unique features which make it better or worse suited for certain tasks. Thus, it is advisable to compare the eﬀectiveness of several such methods in solving a given problem.

Therefore, the ﬁrst aim of our study was to classify calving diﬃculty in dairy heifers using three diﬀerent types of decision trees [classiﬁcation and regression trees (CART), chi-square automatic interaction detection trees (CHAID), and quick, unbiased, eﬃcient, statistical trees (QUEST)], and to compare the results of this classiﬁcation with those of a more traditional statistical method (i.e. a generalized linear model; GLM). The second aim was to identify the most signiﬁcant factors aﬀecting calving course.

Materials and methods

Ethical considerations

Since our study involved only the analysis of information records routinely collected on a farm by the farm management software (sire identiﬁcation number, farm number, calf sex, calving age, calving season, and calving diﬃculty score), the approval of the Local Ethics Committee on Animal Experimentation was not necessary.

Animals

A total of 1,342 calving records of Polish Holstein- Friesian black-and-white heifers from four farms located in the West Pomeranian Province were used for analysis. The records were collected between 2002 and 2013. The late-gestation heifers were housed under similar conditions on all four farms. They were moved to calving pens approximately two weeks before calving, where they remained until the end of the colostrum-feeding period. A single straw-bedded pen could accommodate two animals. Heifers were fed according to standard requirements. The calves were moved to the igloo boxes after being licked by their dams, so they did not stay with the heifers after calving. Subsequently, the heifers were included in the primipara group.

Data acquisition and editing

The original dataset comprised 1,656 calving records primarily obtained from the farm documentation via a National Milk Recording Scheme SYMLEK, but was subsequently reduced after editing for erroneous or incomplete data as well as outliers. A total of 314 (approximately 19%) records were removed from the initial dataset mainly because of their incompleteness (lack of values for the independent variables). Some records contained obvious errors, however, their correction was impossible and they were also removed from the dataset. Moreover, data were checked for the presence of outliers using the two-sided Tukey method (i.e. records with the values of the independent variables exceeding ± 1.5 x interquartile range -IQR- were deleted from the dataset). Each calving record consisted of the two continuous and three categorical predictors: X₁ - SIRE - the rank of the heifer’s sire (the bull that sired the heifer) determined based on the mean calving diﬃculty scores of its daughters (expressed as an ordinal variable with a rank of 1 indicating the sire with the easiest calvings); X₂ -CALA- heifer’s calving age (in months); X₃ -FARM- the category of the farm where the heifer was kept determined based on the farm average milk yield using the k-means clustering method (below 10,200 Kg milk -POOR or equal to or above 10,200 Kg milk -GOOD); X₄ -SEX- calf sex (only male or female, twins, and triplets were excluded from the analysis due to their low frequency of occurrence); X₅ -CALS- calving season with two categories (autumn-winter from October to March -AW and spring-summer from April to September-SS). The sire’s rank (SIRE) was derived in the following way: The daughters of each sire from each of the four farms were ﬁrst identiﬁed; then, their original calving diﬃculty scores were averaged; next, the sires were ordered according to an increasing mean calving diﬃculty score and the ranks were assigned on this basis (with a rank of 1 indicating the sire with the easiest calvings, and a rank of 107 indicating the sire with the most diﬃcult calving).

The dependent variable [calving diﬃculty (DIF)] was a calving diﬃculty category (easy, moderate, and diﬃcult). Originally, calvings were scored by experienced animal scientists employed on the farms on a ﬁve- (before 2006) or six-points (since 2006) scale, which was subsequently converted to an ordinal one with three levels: easy -an easy, spontaneous calving without any help from man; moderate-a calving requiring help from man or the use of mechanical equipment; diﬃcult -a calving requiring much more force than usual or veterinary intervention (including cesarean section and embryotomy) leading to damage to the dam or the calf. Abortions were excluded from the analysis.

The means and standard deviations of continuous independent variables are reported in Table 1 and the distributions of categorical variables are presented in Table 2. The whole data set of calving records (1,342) was partitioned into a training set (L) of 1,006 records (for preparing the CART, CHAID, QUEST, and GLM models) and a test set (T) of 336 records (for their veriﬁcation on new data, not used previously during model construction).

Model construction and evaluation

Of the numerous data mining algorithms, decision trees are characterized by a relatively fast construction process and easy interpretation of a ﬁnal model (^{Witten et al., 2011}). They are based on a “divide-and-conquer” approach to the problem of learning from a set of independent observations (cases). Individual nodes within the tree test particular attributes (predictors

Table 1 Means and standard deviations of continuous independent variables.

1Calving age. 2Sire’s rank based on the mean calving diﬃculty scores of its daughters (without units). SD: Standard deviation

Table 2 Distributions of categorical variables.

1Category of the farm where the heifer was kept based on its average milk yield (POOR: <10,200 Kg, GOOD: ≥ 10,200 Kg). 2Calving season. 3Calf sex. 4Calving diﬃculty

or independent variables), whereas terminal nodes (called “leaves”) indicate the class to which each observation reaching this node belongs (^{Witten et al., 2011}). In our study, three diﬀerent types of decision trees [classiﬁcation and regression trees (CART), chi- square automatic interaction detection (CHAID), and quick, unbiased, eﬃcient, statistical trees (QUEST)] were applied. The CART algorithm builds binary trees (with each parent node split into two child nodes) by the iterative checking of all possible values of the independent variables (predictors) in order to identify the one on which the split in a parent node will be based (the so-called splitter) as well as the cut-oﬀ point for the split so that the resulting child nodes contain the groups of cases as homogeneous as possible (^{Speybroeck, 2011}). The process is repeated until it is no longer possible to make additional splits, but the tree obtained in this way is frequently too complex and overﬁt to the training data and must be reduced in the so-called “pruning” step (^{Moisen, 2008}). In the case of CHAID, the splits are not limited to binary ones and the chi-square test is used to determine the best split at each stage of tree growing. Moreover, CHAID stops adding new nodes before overﬁtting occurs and makes direct use of only categorical independent variables so continuous (numerical) variables are ﬁrst discretized into separate intervals (^{Chang, 2007}). Finally, QUEST generates binary trees by merging classes into two groups before splitting and using quadratic discriminant analysis to determine the best split. As a result, two potential splitting points are obtained, from which the one closer to the mean value of the analyzed variable in a population of vectors belonging to one of the clusters is selected (^{Loh and Shih, 1997}).

In the development of CART, equal costs of misclassiﬁcation and the Gini index as a measure of node impurity were used. The a priori probability of class membership was estimated from the training sample. The stop criterion was the minimization of misclassification error with a minimal node size of 134 cases. Moreover, 10-fold cross-validation was used to ﬁnd the best tree structure understood as a compromise between the tree complexity and its quality. In the construction of the CHAID trees, a modiﬁcation of the standard algorithm was applied (i.e. exhaustive CHAID), which conducts a more thorough search for the predictor that yields the most signiﬁcant split (i.e. the merging of predictor categories is carried out until only two categories remain; ^{Hill and Lewicki, 2006}). When growing the exhaustive CHAID tree, the misclassiﬁcation costs and the minimal tree node size were like in the CART analysis, whereas the p-value for splitting was equal to 0.05. Moreover, the Bonferroni adjustment and the 10-fold cross validation were applied to ﬁnd the best model. The parameters for the last analyzed tree algorithm (QUEST) included: The apriori probability estimated from the training sample, equal costs of misclassification, minimization of misclassiﬁcation error as a stop criterion (minimal leaf size equal to 5, standard error rule equal to 1.0), the 10-fold cross-validation, and the p-value for split variable selection equal to 0.05.

Finally, the GLM model with an ordinal multinomial distribution for the dependent variable (calving diﬃculty score) and a logit link function was applied according to the following formula:

Where:

_^Yi = is the ith observation of the dependent variable (calving diﬃculty score).

j = is the calving category (easy, moderate, or diﬃcult).

_^θj = is the intercept for the _^jth category.

_^xi = is a vector of explanatory variables for the _^ith

observation.

β = is the corresponding set of regression parameters.

To assess the goodness of ﬁt of GLM, the deviance statistic (D) was calculated:

Where:

_^Lm = is the maximized log-likelihood for a given model.

_^Ls = is the log-likelihood for the saturated model (i.e. the most complex model for the selected distribution of the dependent variable and a link function).

The assumptions of GLM were also tested (i.e. the normal distribution of residuals, the lack of predictor collinearity, and outliers).

After growing the trees and estimating the GLM parameters, their classiﬁcation quality was evaluated on the L set. The proportions of correctly classiﬁed calvings from each of the three distinguished categories (easy, moderate, and difficult) as well as overall accuracy (the proportion of correctly classiﬁed cases from all classes) were calculated and the diﬀerences in these proportions were tested for statistical signiﬁcance using the McNemar test for dependent samples with the Bonferroni correction for multiple comparisons. Statistical signiﬁcance was set at p ≤ 0.05. Moreover, all types of models were veriﬁed on the independent T set to evaluate their ability to correctly predict calving diﬃculty class during their potential practical application. The proportions of correct classiﬁcations on the T set were again compared with the test for proportions. It should be added that the learning (or training) set (L) was used to build and train the tree models and to estimate the GLM parameters, whereas the test set (T) comprising new data (calvings), not seen previously by the models during their development, was used to verify their predictive capabilities. This results from the fact that the post hoc prediction is almost always too optimistic since the models are veriﬁed on the same data that were used for their construction. Consequently, a new data subset (the test set) separated from the whole dataset of records is necessary to objectively assess the a priori predictive performance of the model.

To complement the analysis of model performance, the cumulative gains charts were also plotted (based on the test set) to show the relationship between the gains (deﬁned as a proportion of correctly classiﬁed cases out of all the cases in the population belonging to a given category) and the considered sample size for the three types of classiﬁcation trees and GLM (^{Nisbet et al., 2009}). Model construction and evaluation was performed using Statistica 10 software (StatSoft Inc., Tulsa, OK, USA).

Identification of the most influential factors aﬀecting calving diﬃculty

At the last stage of our study, the most inﬂuential factors aﬀecting calving diﬃculty were identiﬁed based on the “importance analysis” available for the tree models and the Wald statistic for GLM.

Results

Model structure and evaluation

The layouts of the CART, CHAID, and QUEST trees are presented in Figures 1-3 and the estimated parameters of the GLM model are shown in Table 3. The value of the ratio of the deviance statistic to its respective degrees of freedom was 0.92. However, it should also be mentioned that not all the GLM assumptions were fulﬁlled. It was characterized by a signiﬁcant deviation from the normal distribution of residuals veriﬁed by the Shapiro-Wilk test (p ≤ 0.05).

Classiﬁcation results obtained on the L set using the four models are shown in Table 4. The only statistically signiﬁcant diﬀerence in accuracy on the L set existed between CART (61.53%) and GLM (57.26%). After the quality evaluation of the models, their predictive performance was veriﬁed on the independent T set. The diﬀerences in proportions observed on the L set were generally conﬁrmed on the T test (Table 4). No signiﬁcant diﬀerences in accuracy were recorded on the T test.

Finally, the cumulative gains charts plotted based on the T set are shown in Figure 4.

Identification of the most influential factors aﬀecting calving diﬃculty

The importance of individual factors aﬀecting the course of parturition identiﬁed by the tree models is presented in Figure 5 and the statistically signiﬁcant eﬀects for GLM are shown in Table 3.

Discussion

In the case of the CART and CHAID trees, the ﬁrst split was based on either the SIRE or FARM variable. The SIRE was also used for the ﬁrst split in the QUEST tree (Figures 1-3).

In the study by ^{Piwczyński et al. (2013}) on the use of CART and CHAID for the analysis of signiﬁcant predictors of calving diﬃculty in Polish Holstein- Friesian black-and-white cows, the ﬁrst division of the whole data set in the root node was based on lactation number. The two subsequent divisions were based on calf birth weight and the third one on pregnancy length and this variable was used for splitting twice (at the threshold values of 282 and 284 days, respectively). In the above-mentioned study, the last considered splitting variable was management system. It should be noted that although the CART and CHAID trees in our study and that by Piwczyński et al. (2013) utilized a similar set of independent variables, the ﬁnal structure of the resulting decision trees was somewhat diﬀerent. Obviously, some factors described by Piwczyński et al. (2013; such as lactation number) were not available in our study, which included only heifers.

Figure 1 Classiﬁcation and regression tree (CART) model for the classiﬁcation of calving. SIRE: Sire’s rank based on the mean calving diﬃculty scores of its daughters. FARM: Category of the farm where the animal was kept based on its mean milk yield (POOR: <10,200 Kg, GOOD: ≥ 10,200 Kg). Node labels are assigned according to the most numerous category.

Figure 2 Chi-square automatic interaction detection (CHAID) model for the classiﬁcation of calving. SIRE: Sire’s rank based on the mean calving diﬃculty scores of its daughters. FARM: Category of the farm where the animal was kept based on its mean milk yield (POOR: <10,200 Kg, GOOD: ≥ 10,200 Kg). CALS: Calving season (AW - autumn-winter, SS - spring-summer). Node labels are assigned according to the most numerous category.

Figure 3 Quick, unbiased, eﬃcient, statistical trees (QUEST) model for the classiﬁcation of calving (cases satisfying the splitting condition in a parent node go to its left child node). SIRE: Sire’s rank based on the mean calving diﬃculty scores of its daughters. FARM: Category of the farm where the animal was kept based on its mean milk yield (POOR: <10,200 Kg, GOOD: ≥ 10,200 Kg). Node labels are assigned according to the most numerous category.

Table 3 Estimated parameters of the generalized linear model (GLM).

1 Calving age. 2 Sire’s rank based on the mean calving diﬃculty scores of its daughters. 3 Category of the farm where the animal was kept based on its mean milk yield (POOR: <10,200 Kg, GOOD: ≥ 10,200 Kg). 4 Calving season. 5 Calf sex. Variables with p-values less than 0.05 are marked in bold

In the case of GLM, the value of the applied goodness-of-fit criterion (i.e. the deviance statistic relative to its degrees of freedom; 0.92) testiﬁed to the good overall quality of the constructed GLM model as the values of approximately 1.0 are considered to show a good ﬁt of the model to the training data (McCullagh and Nelder, 1989). However, since not all the assumptions of the GLM model (in principle required) were met, its application in some situations may not be fully recommended from a purely statistical point of view.

Table 4 Proportions of correctly classiﬁed calvings on the training and test sets.

a-d Values marked with diﬀerent superscript letters within a column (and a set) diﬀer signiﬁcantly (p ≤ 0.05). 1 Classiﬁcation and regression trees. 2 Chi-square automatic interaction detection. 3 Quick, unbiased, eﬃcient, statistical trees. 4 Generalized linear model. 5 Accuracy: Proportion of correctly classiﬁed cases from all classes.

Figure 4 Gains chart for individual calving categories: A= easy, B = moderate, C = diﬃcult. CART: Classiﬁcation and regression trees. CHAID: Chi-square automatic interaction detection. QUEST: Quick, unbiased, eﬃcient, statistical trees. GLM: Generalized linear model.

Figure 5 The importance of individual predictors of calving diﬃculty for the tree models. SIRE: Sire’s rank based on the mean calving diﬃculty scores of its daughters. FARM: Category of the farm based on its mean milk yield. CALA: Calving age. CALS: Calving season. SEX: Calf sex. CART: Classiﬁcation and regression trees. CHAID: Chi-square automatic interaction detection. QUEST: Quick, unbiased, eﬃcient, statistical trees.

As far as the model quality evaluated on the L set is concerned, CART and GLM were most eﬀective in classifying easy calvings (44.51 and 47.56% correctly indicated easy cases, respectively), whereas QUEST was most eﬃcient in predicting moderate calvings (81.99%). The CART and CHAID were also quite eﬀective in this respect (68.42 and 66.20%, respectively) compared with GLM, for which the proportion of correctly indicated moderate cases was the lowest (49.31%). The greatest number of diﬃcult calvings was properly classiﬁed by CHAID and GLM (82.02 and 76.34%, respectively), while the diagnosis made by CART and QUEST was signiﬁcantly less accurate (71.29 and 66.88%, respectively). In this context, CHAID and GLM would be preferable under conditions in which the highest dystocia detection rate is the priority. However, GLM was also able to properly indicate most easy calvings, which is advantageous from the farmer’s point of view, since the number of false alarms generated by the model would be the lowest in this case.

In general, the accuracy obtained on the L set for the three distinguished categories of calving course (approximately 60%) in our study was moderate (Table 4). It was similar to that (61.50%) reported by ^{Piwczyński et al. (2013}), who established four different categories of calving difficulty. It was also comparable to the accuracy (50 to 60.20%) recorded by ^{Johnson et al. (1988}), who studied the possibility of dystocia detection (with the ﬁve classes of calving diﬃculty) in Hereford heifers using discriminant function analysis. The accuracy reported by the aforementioned authors depended on the set of predictors included in the forecasting model and increased to approximately 85.50% with only two classes of calving ease. With the same number of distinguished delivery classes (dystocia vs eutocia), ^{Arthur et al. (2000}) obtained very similar accuracy (85.20 to 91.70%) using the same method as above for dystocia diagnosis in Angus heifers. This value was much higher than that in our study (Table 4), where the three classes of calving diﬃculty were considered. This comparison between different model types shows that the ﬁnal classiﬁcation accuracy depends to some extent on the number of categories of the dependent variable. The division into only two classes usually yields better results in terms of the number of correct classiﬁcations, but such a model loses some information on the possible calving course. Taking into account the real number of calving diﬃculty categories distinguished by the oﬃcial recording scheme in our country (which is six at present), an attempt was made in our study to more accurately indicate calving class.

After evaluating model quality, the predictive performance of individual decision trees and GLM was objectively veriﬁed on the independent T set, which was not used during the tree growing and GLM estimation stage and which could show the real ability of the models to properly predict calving categories during their potential practical application. The results obtained earlier on the L set were generally conﬁrmed on the T set. And so, GLM and CART were most accurate in predicting easy calvings (43.24 and 35.14%, respectively), whereas CHAID and QUEST were the most eﬀective classiﬁers for the moderate calvings (73.91 and 81.74%, respectively). The lowest ability to correctly indicate moderate calvings was exhibited by GLM (41.74%). The highest proportion of diﬃcult cases was properly diagnosed by CHAID and GLM (85.45 and 81.82%, respectively), whereas CART and QUEST were signiﬁcantly less successful in classifying this type of calvings (77.27 and 73.64%, respectively). In general, the accuracy on the T set in our study was moderate (Table 4), and it was approximately 10 to 30% lower than the values (72.60 to 90.30%) reported by ^{Arthur et al. (2000}), who investigated dystocia detection in Angus heifers. However, the better results presented by Arthur et al. (2000) can partially be attributed to the lower number of calving classes. Moreover, although the prediction models based on discriminant function analysis could accurately predict normal calvings (speciﬁcity in the range of 72.60 to 90.30%), their ability to properly predict dystocia in Angus heifers was much lower (sensitivity ranging from 0 to 40.00%). Finally, it was not possible to compare the results obtained on the independent test set in our study with those of ^{Piwczyński et al. (2013}) and ^{Johnson et al. (1988}) because they did not report the outcomes of the validation procedure.

On the other hand, a high proportion of correct classiﬁcations of dystocia cases (the diﬃcult class) in the heifer T data set (73.64 to 85.45%) in our study is especially noteworthy. This may make it possible for a farmer or herd manager to undertake appropriate measures in order to prevent adverse consequences of dystocia in a heifer. It is also important to consider that models with high sensitivity would be preferred under ﬁeld conditions as the misclassiﬁcation of an easy calving by the model is not so costly (additional labor associated with cow watching) as the misdiagnosis in the opposite direction (missing a dystocia case). However, it is also desired for the model to have possibly high speciﬁcity, as a large number of false alarms are troublesome for the farmer and decrease his trust in the system. We would also like to emphasize that the percentage of correctly diagnosed moderate calvings (i.e. those requiring help from man or the use of mechanical equipment) by decision trees in our study was relatively high (68.70 to 81.74%). In this respect, data mining models in the form of classiﬁcation trees turned out to be superior to GLM, for which this proportion was the lowest (41.74%).

Finally, the shape of the curves plotted on the cumulative gains charts revealed the relatively good performance of all the classiﬁers investigated (Figure 4). The closer the curve approaches the upper left corner of the graph [the (0, 1) point], the better the discriminative power of the model is. As can be seen in Figure 4, QUEST and GLM were characterized by slightly lower gains than CART and CHAID for easy calvings, but QUEST generated somewhat higher gains for moderate deliveries, for which GLM presented the worst results. However, the gains produced by all the classiﬁers were greatest for the diﬃcult category. Of the data mining models (three diﬀerent types of decision trees) used in our study, the best predictive performance was in general characteristic of CHAID, although it should be emphasized that there were not any signiﬁcant diﬀerences in the accuracy on the T set. Nevertheless, CHAID exhibited the highest proportion of correctly predicted diﬃcult calvings (dystocia) in heifers at a relatively large number of properly diagnosed moderate deliveries. Only its ability to accurately indicate easy calvings was lower (only approximately one-ﬁfth of all cases), which needs to be considered by the farmer if such a model is implemented in a farm.

The comparison of the data mining algorithms with a more traditional statistical method (i.e. the GLM model, used as a reference in our study) showed that both types of classiﬁers yielded comparable results. Somewhat larger differences were found for the easy and moderate category, but the overall accuracy was also very similar. Therefore, it is not possible to explicitly conﬁrm the superiority of data mining models (in the form of decision trees) over more traditional statistical techniques (GLM in this case) based on the prediction results of our study. However, parametric methods such as GLM require the fulﬁllment of various assumptions, from which not all were met in our study. Moreover, the structure of the classiﬁcation trees is more easily interpretable (even by non-experts) than the coeﬃcients of the GLM model, which facilitates the understanding of the investigated relationships between diﬀerent factors and calving course.

The second stage of our study was the identiﬁcation of the most influential factors affecting calving diﬃculty. The most important predictor for all the three decision tree types was SIRE. Also, CALA and FARM were found to considerably aﬀect the category of calving diﬃculty. In the case of GLM, the only signiﬁcant eﬀects were SIRE, FARM, and CALS.

The rank of the dam’s sire was based on the mean calving diﬃculty score of its daughters. The goal of including this predictor in the tree models and GLM was to take into account the genetic component of dystocia represented by the dam’s sire eﬀect. Although, it is not possible to directly include a sire eﬀect in the prediction model, it can be incorporated into it in a more general form (e.g. a rank), which orders sires according to the calving diﬃculty level experienced by their daughters. In a recent study by ^{Mee et al. (2011}), it was found that the relationship between predicted transmitting ability for maternal calving diﬃculty and the probability of assisted parturition depended on dam parity and calf sex. It was stronger for lower parities and male sex calves.

The next important factor was the category of the farm where the heifer was kept (FARM; Figure 5). As can be seen from Figures 1-3, the POOR category was associated with a markedly higher number of diﬃcult calvings. This relationship could have resulted from the worse husbandry conditions on the farm, including poorer control of diﬃcult calvings. However, this result is not entirely consistent with that reported by ^{Gröhn et al. (1990}), who investigated diﬀerent factors aﬀecting reproductive disorders in Finnish Ayrshires, and found that higher herd milk yield in the current lactation was associated with an increased risk of dystocia. On the other hand, the only herd-level factor included in the analysis of dystocia incidence in Irish Holstein-Friesians (^{Mee et al., 2011}; i.e. herd size), did not signiﬁcantly aﬀect the frequency of diﬃcult parturitions.

The last important predictor for decision trees was calving age (CALA; Figure 5). The greatest diﬀerence in dystocia occurrence is found between heifers and cows (^{Norman et al., 2010}; ^{Atashi et al., 2012}). Generally speaking, the optimal age at ﬁrst calving in dairy heifers is 22 to 24 months (^{Ghavi Hossein-Zadeh, 2013}), although a recent study on seasonally calving Holstein-Friesian heifers (^{Berry and Cromie, 2009}) suggested that this age should be 25 to 27 months with respect to calving ease. In the cited study, heifers calving at the age of 22 months had a higher risk of calving assistance than those calving at 24 months of age, whereas heifers calving at 25 to 27 and 35 months of age had a lower risk of such assistance compared with the animals calving at 24 months of age. Also, body weight at breeding or calving may aﬀect calving diﬃculty. It can be even a better predictor of dystocia than age at ﬁrst calving itself, but it is much more diﬃcult to be consistently recorded. As a result of greater growth rates, heifers currently calve for the ﬁrst time relatively earlier but with a high body weight. Consequently, calvings in such heifers are usually easier compared with those of their lighter herdmates. Finally, it should be emphasized that some authors (^{Hickey et al., 2007}; ^{Bazzi, 2010}; ^{Yıldız et al., 2011}) did not conﬁrm any signiﬁcant relationship between calving age and diﬃculty.

The other signiﬁcant predictor of calving diﬃculty identiﬁed by the GLM model was also calving season (CALS). It is generally considered that under European climatic conditions, calvings occurring in autumn and winter tend to be more diﬃcult than those in the spring-summer season, which may result from increased gestation length, calf birth weight, and stillbirth rate in the colder season and less intensive supervision of calvings and more physical exercises in summer (^{Mee et al., 2011}).

In conclusion, the tree classiﬁcation models obtained in our study showed promise in predicting individual classes of calving diﬃculty in dairy heifers; however, their further improvement would be necessary to obtain better accuracy. The most inﬂuential factors aﬀecting diﬃculty level included: The rank of the dam’s sire, calving age, and the yield category of a farm and calving season. Our study showed that decision trees (after improvement of their predictive performance) could be potentially applied as an accessory tool to aid farmer in making decisions concerning calving management, especially considering that the created rules are relatively simple and easily interpretable.

Acknowledgements

This work was supported by the Polish Ministry of Science and Higher Education (grant number 517- 01-028-3962/17).

Conﬂict of interest

The authors declare they have no conﬂicts of interest with regard to the work presented in this report

References

Arthur PF, Archer JA, Melville GJ. Factors inﬂuencing dystocia and prediction of dystocia in Angus heifers selected for yearling growth rate. Aust J Agric Res 2000; 51:147-154. [ Links ]

Atashi H, Zamiri MJ, Sayadnejad MB. The eﬀect of maternal inbreeding on incidence of twinning, dystocia, and stillbirth in Holstein cows of Iran. Iran J Vet Res 2012; 13:93-99. [ Links ]

Azizzadeh M, Shooroki HF, Kamalabadi AS, Stevenson MA. Factors aﬀecting calf mortality in Iranian Holstein dairy herds. Prev Vet Med 2012; 104:335-340. [ Links ]

Barrier AC, Ruelle E, Haskell MJ, Dwyer CM. Eﬀect of a diﬃcult calving on the vigor of the calf, the onset of maternal behaviour, and some behavioural indicators of pain in the dam. Prev Vet Med 2012; 103:248-256. [ Links ]

Bazzi H. Evaluation of non-genetic factors aﬀecting birth weight in Sistani cattle. J Anim Vet Adv 2010; 10:3095-3599. [ Links ]

Berry DP, Cromie AR. Associations between age at ﬁrst calving and subsequent performance in Irish spring calving Holstein- Friesian dairy cows. Livest Sci 2009; 123:44-54. [ Links ]

Chang C-L. A study of applying data mining to early intervention for developmentally delayed children. Expert Syst Appl 2007; 33:407-412. [ Links ]

Ghavi Hossein-Zadeh N. Eﬀect of dystocia on the productive performance and calf stillbirth in Iranian Holsteins. J Agric Sci Technol 2013; 16:69-78. [ Links ]

Gröhn Y, Erb HN, McCulloch CE, Saloniemi HS. Epidemiology of reproductive disorders in dairy cattle: Associations among host characteristics, disease, and production. Prev Vet Med 1990; 8:25-39. [ Links ]

Hickey JM, Keane MG, Kenny DA, Cromie AR, Amer PR, Veerkamp RF. Heterogeneity of genetic parameters for calving diﬃculty in Holstein heifers in Ireland. J Dairy Sci 2007; 90:3900-3908. [ Links ]

Hill T, Lewicki P. Statistics: Methods and applications. Tulsa (OK): StatSoft; 2006. [ Links ]

Johnson SK, Deutscher GH, Parkhurst A. Relationships of pelvic structure, body measurements, pelvic area, and calving diﬃculty. J Anim Sci 1988; 66:1081-1088. [ Links ]

Loh W-Y, Shih Y-S. Split selection methods for classiﬁcation trees. Stat Sin 1997; 7:815-840. [ Links ]

Mee JF. Prevalence and risk factors for dystocia in dairy cattle: A review. Vet J 2008; 176:93-101. [ Links ]

Mee JF, Berry DP, Cromie AR. Risk factors for calving assistance and dystocia in pasture based Holstein-Friesian heifers and cows in Ireland. Vet J 2011; 187:189-194. [ Links ]

Moisen GG. Classiﬁcation and regression trees. In: Jørgensen SE, Fath BD, editors. Encyclopedia of ecology. Oxford (UK): Elsevier; 2008. p. 582-588. [ Links ]

Nisbet R, Elder J, Miner G. Handbook of statistical analysis and data mining applications. Amsterdam, Boston (MA): Academic Press/Elsevier; 2009. [ Links ]

Norman HD, Hutchison JL, Miller RH. Use of sexed semen and its eﬀect on conception rate, calf sex, dystocia, and stillbirth of Holsteins in the United States. J Dairy Sci 2010; 93:3880-3890. [ Links ]

Piwczyński D, Nogalski Z, Sitkowska B. Statistical modeling of calving ease and stillbirths in dairy cattle using the classiﬁcation tree technique. Livest Sci 2013; 154:19-27. [ Links ]

Speybroeck N. Classiﬁcation and regression trees. Int J Public Health 2011; 57:243-246. [ Links ]

Witten IH, Frank E, Hall MA. Data mining practical machine learning tools and techniques. 3rd ed. Burlington (MA): Morgan Kaufmann Publishers, Inc.; 2011. [ Links ]

Yıldız H, Saat N, Simsek H. An investigation on body condition score, body weight, calf weight, and hematological proﬁle in crossbred dairy cows suﬀering from dystocia. Pak Vet J 2011; 31:125-128 [ Links ]

¹To cite this article: Zaborski D, Proskura WS, Grzesiak W. Comparison between data mining methods to assess calving diﬃculty in cattle. Rev Colomb Cienc Pecu 2017; 30:196-208.

Received: August 02, 2016; Accepted: April 16, 2017

* Corresponding author: Daniel Zaborski. Department of Ruminants Science, West Pomeranian University of Technology, Doktora Judyma 10, 71-466 Szczecin, Poland. Tel: +48914496813. E-mail: daniel.zaborski@zut.edu.pl

This is an open-access article distributed under the terms of the Creative Commons Attribution License