An analysis of the factors that influence sugarcane yield in Northern Argentina using classification and regression trees

被引:52
作者
Ferraro, Diego O. [1 ]
Rivero, Dario E. [1 ]
Ghersa, Claudio M. [1 ]
机构
[1] Univ Buenos Aires, CONICET, Fac Agron, Inst Invest Fisiol & Ecol Vinculadas Agr, Buenos Aires, DF, Argentina
关键词
Data mining; CART analysis; Cane yield variability; Non-parametric method; COMMERCIAL CANE SUGAR; PRODUCTION SYSTEMS; SOIL; MANAGEMENT; BRAZIL; PRODUCTIVITY; INFORMATION; VARIABILITY; SIMULATION; STABILITY;
D O I
10.1016/j.fcr.2009.02.014
中图分类号
S3 [农学(农艺学)];
学科分类号
0901 ;
摘要
Multi-location trials are commonly used to estimate the effects of different explanatory factors on crop yield. Conversely, the analysis of production databases could also be useful for exploring and understanding Such effects. These data require flexible and robust methods for dealing with multivariate, non-linear and unbalanced data structures, high-order interactions and missing values. In this paper, we explore the issue of crop yield explanation using a 5-year period (1999-2005) of Sugarcane (Saccharum officinarum L.) yield data from Northern Argentina. Using a data mining technique Such as classification and regression trees (CART) we show that farm membership (FARM) was among the main splitting factors for total cane per hectare (TCH) cluster variability. Crop class (AGE) was at the second level in the hierarchy and values of AGE higher than 2,5 splitted low and medium from the high TCH clusters. Sugarcane cultivar (VAR) was the most important explanatory factor regarding total sugar per hectare (TSH), and crop class (AGE) was second in importance. In this case, farm membership did not appear among the main splitting factors. The growth period duration, field area and precipitation did not show remarkable importance values for explaining final TCH and TSH values. By-year CART models also showed low values of importance of weather related variables across the years analyzed suggesting that other environmental conditions than precipitation is controlling yearly variation in sugar and cane yield (e.g. radiation, water-use efficiency or temperature regime). The CART analysis developed here is the first systematic analysis for explanatory factors of biomass and sugar content in Argentina's cane most productive region. However, we believe this methodology could be applicable for a wider geographic area and other sugarcane production regions as well as other cropping systems. Although regression trees provide less formal statistical inference, its results could be added as an additional analytical toot to traditional experimental analyses that use mixed models. Also, they could be useful for elaborating hypotheses and suggest mechanistic studies to test them. (C) 2009 Published by Elsevier B.V.
引用
收藏
页码:149 / 157
页数:9
相关论文
共 45 条
[1]   Tree regression analysis to determine effects of soil variability on sugarcane yields [J].
Anderson, DL ;
Portier, KM ;
Obreza, TA ;
Collins, ME ;
Pitts, DJ .
SOIL SCIENCE SOCIETY OF AMERICA JOURNAL, 1999, 63 (03) :592-600
[2]   Variability in regional wheat yields as a function of climate, soil and economic variables: Assessing the risk of confounding [J].
Bakker, MM ;
Govers, G ;
Ewert, F ;
Rounsevell, M ;
Jones, R .
AGRICULTURE ECOSYSTEMS & ENVIRONMENT, 2005, 110 (3-4) :195-209
[3]   Management effects on nitrogen recovery in a sugarcane crop grown in Brazil [J].
Basanta, MV ;
Dourado Neto, D ;
Reichardt, K ;
Bacchi, OOS ;
Oliveira, JCM ;
Trivelin, PCO ;
Timm, LC ;
Tominaga, TT ;
Correchel, V ;
Cássaro, FAM ;
Pires, LF ;
de Macedo, JR .
GEODERMA, 2003, 116 (1-2) :235-248
[4]  
Belsley D., 2005, REGRESSION DIAGNOSTI
[5]   Prospects for green cane harvesting and cane residue use in Brazil [J].
Braunbeck, O ;
Bauen, A ;
Rosillo-Calle, F ;
Cortez, L .
BIOMASS & BIOENERGY, 1999, 17 (06) :495-506
[6]  
Breiman L., 1984, BIOMETRICS, V40, P874, DOI 10.1201/9781315139470
[7]  
De'ath G, 2000, ECOLOGY, V81, P3178, DOI 10.1890/0012-9658(2000)081[3178:CARTAP]2.0.CO
[8]  
2
[9]   RESOURCE USE EFFICIENCY IN AGRICULTURE [J].
DEWIT, CT .
AGRICULTURAL SYSTEMS, 1992, 40 (1-3) :125-151
[10]   Ecosystem classification in a flat, highly fragmented region of Indiana, USA [J].
Dolan, BJ ;
Parker, GR .
FOREST ECOLOGY AND MANAGEMENT, 2005, 219 (2-3) :109-131