Application of classification trees-J48 to model the presence of roach (Rutilus rutilus) in rivers

In the present study, classification trees (CTs-J48 algorithm) were used to study the occurrence of roach in rivers in Flanders (Belgium). The presence/absence of roach was modelled based on a set of river characteristics. The predictive performance of the CTs models was assessed based on the percentage of Correctly Classified Instances (CCI) and Cohen's kappa statistics. To find the best model performance, a 3- fold cross validation techniques was applied on the dataset. The effect of Pruning Confidence Factors (PCFs) was examined on the reliability and model complexity. Based on the obtained... Mehr ...

Verfasser: Zarkami, R.
Dokumenttyp: Artikel
Erscheinungsdatum: 2011
Reihe/Periodikum: Caspian Journal of Environmental Sciences, Vol 9, Iss 2, Pp 189-198 (2011)
Verlag/Hrsg.: University of Guilan
Schlagwörter: Flanders / River basins / Roach / Occurrence / Classification trees (J48) / Ecological modelling / Environmental sciences / GE1-350 / Science / Q
Sprache: Englisch
Permalink: https://search.fid-benelux.de/Record/base-29470679
Datenquelle: BASE; Originalkatalog
Powered By: BASE
Link(s) : https://doaj.org/article/64e462386c6d40a881fd2846063f3450

In the present study, classification trees (CTs-J48 algorithm) were used to study the occurrence of roach in rivers in Flanders (Belgium). The presence/absence of roach was modelled based on a set of river characteristics. The predictive performance of the CTs models was assessed based on the percentage of Correctly Classified Instances (CCI) and Cohen's kappa statistics. To find the best model performance, a 3- fold cross validation techniques was applied on the dataset. The effect of Pruning Confidence Factors (PCFs) was examined on the reliability and model complexity. Based on the obtained results, the induced model could predict well the presence/absence of roach in the rivers. The highest overall means of two model performances showed that the models were reliable. When analyzing the ecological relevance of CTs, it seemed that the structural-habitat variables were more the main predictors than the water quality ones to predict the occurrence of roach in rivers. In particular, the distance from the source and width contributed more to the prediction of roach while among water quality variables, only electric conductivity was relatively important in this regard.