Spaceborne sensors, particularly Synthetic Aperture Radar (SAR), provide valuable tools for monitoring agricultural resources, improving yield predictions, and ensuring sustainable farming practices. In this research, we explore several venues to advance the use of SAR observatio
...
Spaceborne sensors, particularly Synthetic Aperture Radar (SAR), provide valuable tools for monitoring agricultural resources, improving yield predictions, and ensuring sustainable farming practices. In this research, we explore several venues to advance the use of SAR observation time series for agricultural and vegetation monitoring applications.
The first part of this research evaluates the added value of Sentinel-1 InSAR coherence time series for land cover classification, using an agricultural region in São Paulo, Brazil, as a case study. This region is characterized by a mixture of crops, pastures, and sugarcane plantations, all managed asynchronously. The findings demonstrate that incorporating InSAR coherence alongside SAR backscatter improves classification accuracy, particularly during the dry season when the distinctions between vegetation and bare soil are more pronounced. The research employed machine learning approaches to analyze pixel-level and field-level classifications using different sampling schemes. It highlights how multi-looking strategies can be adjusted to improve the accuracy of the classification outcomes in agricultural settings. This research highlights the usefulness of coherence data for the detection of events such as harvesting, offering valuable insights for more dynamic agricultural monitoring. The sensitivity of the coherence to agricultural changes leads to the observed improvement in Land Use Land Cover (LULC) mapping.
Forward models, or observation operators, are essential for the interpretation of radar observations and for the development of assimilation frameworks. In particular, in this research, we are interested in forward modeling the relation between crop bio-geophysical parameters, such as Leaf Area Index (LAI), Above Ground Biomass (AGB), and soil moisture, the inputs to our data-driven model, and radar observables, the outputs.
In the second part of this research, we integrate an existing crop growth model, the Decision Support System for Agrotechnology Transfer (DSSAT), with machine learning techniques to train a forward model to predict SAR observables over silage maize fields in The Netherlands across multiple years. Using crop growth models circumvents the dependency on limitedly available field measurements. When we use training and validation data from the same growth season, we obtain accurate predictions, with a mean absolute error (MAE) of less than 1.23 dB. Some of the field-to-field variability is accounted for by including the mean backscatter intensity during a few acquisitions before crop emergence. The obtained performance suggests the potential of using this approach to generate observation operators for data assimilation frameworks or for anomaly detection, supporting large-scale agricultural monitoring. However, the results also highlight one of the main challenges: the resulting data-driven model fails to generalize when presented with input bio-geophysical parameters that fall outside the regions of the parameter space spanned by the training dataset, as can happen, for example, during a drought period.
In the final part, we develop a physics-guided machine learning approach to address the limitations of data-driven models: lack of generalizability, tendency to overfitting, and reliance on extensive training data sets. We introduce physical constraints in an artificial neural network (ANN) in two ways. First, by modifying the loss function, used to train the ANN, by including a penalty for unphysical behavior, in particular by penalizing negative values of the partial derivative of the predicted backscatter intensity with respect to the surface soil moisture, since we assume this should always be positive. Second, by mirroring the architecture of the widely used Water Cloud Model (WCM) in the network topology. The added physical term to the loss function improves the ANN performance in all cases considered, with an R2 increase of 3 percentage points (p.p). The WCM-inspired model performs slightly worse when trained and tested with data from the same year, but it generalizes better, producing significantly better results for unseen conditions. In addition, the WCM-inspired model also produces individual contributions to the observed intensity, such as the surface-scattering component and the vegetation backscatter component.