Open Access Article
Dimitra-Danai
Varsou
*ab,
Aikaterini
Theodori
a,
Anastasios G.
Papadiamantis
bc,
Andreas
Tsoumanis
bc,
Dimitrios
Zouraris
bc,
Maria
Antoniou
bc,
Nikoletta-Maria
Koutroumpa
c,
Georgia
Melagraki
d,
Iseult
Lynch
e and
Antreas
Afantitis
*abcf
aNovaMechanics MIKE, Piraeus 18545, Greece. E-mail: varsou@novamechanics.com; afantitis@novamechanics.com
bEntelos Institute, Nicosia 2102, Cyprus
cNovaMechanics Ltd, Nicosia 1070, Cyprus
dDivision of Physical Sciences and Applications, Hellenic Military Academy, Vari 16672, Greece
eSchool of Geography, Earth and Environmental Sciences, University of Birmingham, B15 2TT Birmingham, UK
fDepartment of Pharmacy, Frederick University, Nicosia 1036, Cyprus
First published on 22nd December 2025
The application of nanomaterials as fertilizers, biostimulants, and pesticides has been emerging as a promising approach in recent years, aiming to support sustainable and precision agriculture, while simultaneously addressing the challenges of climate change, global population growth, and the search for alternative energy sources (biofuels). In this work, to computationally assess the effects of nanoparticles (NPs) on plant growth (encoded in terms of length of e.g., root, shoot or overall plant length), we performed extensive data curation and enrichment with atomistic descriptors of an existing NP–plant interactions database, ensuring high-quality data for the development of machine learning (ML) models. To address class imbalance, data augmentation techniques were applied. An autoML workflow was developed to optimise and evaluate seven ML algorithms for predicting the plant length response class following NP exposure. The optimised XGBoost model demonstrated superior predictive performance during external validation, achieving an accuracy of 85% and a balanced accuracy of 83%, and its applicability domain was clearly defined. One of the key advantages of the plant length response model is that it requires no experimental input data to generate predictions, thus facilitating virtual screening prior to implementation of controlled experimental setups. The curated dataset has been made findable, accessible, interoperable and reusable (FAIR) via the nanoPharos database (https://db.nanopharos.eu/Queries/Datasets.zul?datasetID=np31) and the XGBoost model was documented in a standardized QSAR model report format (QMRF) to enhance its usability and FAIRness and made available as a user-friendly web-application, CeresAI-nano, via the Enalos Cloud platform (https://enaloscloud.novamechanics.com/chiasma/agrinano/).
Environmental significanceThis study provides a significant advance for the environmental sustainability of nano-enabled agriculture by combining rigorous data curation, atomistic descriptor enrichment, and automated machine learning to predict plant length responses to nanoparticle (NP) exposure. Conventional assessment of NP–plant interactions requires long, resource-intensive experiments that often capture only part of a crop's growth cycle. By contrast, our publicly available curated dataset and predictive model (CeresAI-nano) enable rapid, reproducible, and FAIR (findable, accessible, interoperable, and reusable) virtual screening of nano-agrochemical treatments without the need for experimental input. This capability directly supports the safe- and sustainable-by-design (SSbD) development of new nano-fertilizers, biostimulants, and pesticides, helping stakeholders identify formulations that enhance plant growth while minimising adverse effects on soil health, biodiversity, and human exposure. These efforts accelerate environmentally responsible innovation, and contribute to the long-term resilience of agricultural systems under climate change pressures. |
Nanotechnology can play a key role in promoting more precise, sustainable, and resource-efficient, agricultural practices,5,7 mainly due to the intrinsic properties of nanoparticles (NPs), which stem from their small size, high surface area-to-volume ratio, and tuneable surface charge.2,9 NPs are widely used in nano-enabled agrochemicals to enhance plant nutrition and resilience, and support biotic and abiotic stress management, all while increasing productivity.2,6,7,10 For example, NPs are incorporated into fertilizers and pesticides,6,8,10 as they can serve as nano-carriers for controlled release of nutrients,3,5,7 target pests efficiently,5 protect plants against the toxic effects of heavy metals such as Cd,11 remediate or improve soil quality,3,8,10,12 and/or facilitate key biochemical processes in plants, such as photosynthesis.2,5,6,13 It should also be noted that NPs are more effective than conventional fertilizers.13–15 For example, Nekoukhou et al.16 showed that foliar application of ZnO NPs on dragonhead resulted in increased productivity compared to conventional fertilizer (ZnS) treatments. In addition, nano-sensors are being developed and integrated to monitor and evaluate crop growth, detect infections, assess soil conditions, and track pesticide penetration, thus, contributing to more efficient agricultural practices.3–5,7,13,17
However, as is often observed in emerging research areas, nano-agriculture is not free of risks. Apart from the beneficial impact of the nano-enabled agrichemicals, there is an ongoing discussion about the potential undesirable effects of NPs on crop growth, which can sometimes alter plant physiological and biochemical functions in ways that are contrary to expectations (e.g., plant growth inhibition, reduced micronutrient uptake, etc.).10 Nanotoxicity and the potential toxic effects of NPs on human health and the environment, highlight the need for further research, especially in terms of non-monotonic responses.7 Additionally, it should be considered that beyond the intentional application of NPs to plants and soils, they can also be released into the agroecosystems incidentally during their production, through the use of nano-enabled products, and disposal, through untreated wastewater discharge, landfills, and via other pathways.7,9,10,18 Crops cultured in NPs-contaminated soil or atmosphere, may uptake these through the roots or aerial parts19 and show reduced growth rates and quality as a result of NP-induced oxidative stress.20 The interaction between NP properties, soil systems and plant species is highly complex.5,6,8 The NPs undergo transformation in different systems (e.g., biomolecule corona formation),5 which poses a barrier in identifying the precise mechanisms through which NPs impact plant growth and development. In this context, the application of machine learning (ML) algorithms may serve as a powerful tool to model, decode, and predict NP-induced effects and shed light on the mechanisms of NPs uptake and translocation in plant tissues, including any transformations and their consequences.
ML and data-driven approaches, including quantitate structure–activity relationship (QSAR) models and read-across methodologies, have been already established in the nanoinformatics field to support the safe and sustainable-by-design (SSbD) development of novel NPs. These approaches aim to optimise NP functionality while eliminating potential adverse effects-even prior to their synthesis.5,21 Similarly, ML approaches can support SSbD of novel nano-agrichemicals, by predicting specific effects (e.g., oxidative stress and photosynthesis parameters, as well as the NPs uptake and accumulation in different plant tissues), in order to achieve optimal NPs properties that enhance plant growth and resilience, improve soil health, and reduce possible toxicological effects on human health and the environment.5
Artificial neural network (ANN) approaches have recently been employed to simulate NP–plant–soil interactions. For instance, Li et al.22 developed eight ANN models to predict the yield and quality responses of rice plant after soil exposure to selenium NPs at four different time intervals (30–120 days). The reported R2 values for the eight models were in the range of 0.71–0.89 during external validation. Wang et al.23 applied ANN algorithms to predict the transport factor (TF) and root concentration factor (RCF) of different types of NPs in plants grown in soil or hydroponic systems, achieving R2 values higher than 0.80 in all cases during external validation. ANNs algorithms have also been successfully used to model the uptake of cooccurring CeO2 NPs and Cd in Brassica napus L. cultivated in soil. Rossi et al.24 developed a multilayer perceptron ANN to predict Cd and Ce concentrations in plant roots and leaves based on plant characteristics, achieving R values greater than 0.90 on the test set in all cases.
Min et al.25 explored the effects of metal oxide NPs along with parameters such as crop species (vegetable, legume, fruit, cereal or other) and cultivation conditions on the heavy metal uptake by crops. They developed different ML models, among which the random forest algorithm achieved the highest performance, with an R2 of 0.62 on the test set for the prediction of the effect of metal oxide NPs on the heavy metal uptake and growth of crops expressed as a weighted response ratio (LnR+). Xu et al.12 investigated in silico the potential hazardous effects posed by NPs to soil microbial communities by training random forest models on NP and soil properties to predict microbial diversity using Richness and Shannon index. These models, evaluated via 10-fold cross-validation, achieved R2 values mostly exceeding 0.70. Li et al.6 also developed several ML models for prediction of the effects of carbon dots on plants (mostly lettuce, tomato, maize, wheat, cucumber, rice, mung beans, and soybean) including nutrient content, quality, growth indicators, photosynthesis, and antioxidant response. These models used features such as carbon dots and plant properties, environmental factors, and experimental conditions. The growth indicator endpoint was predicted with the highest accuracy using a random forest model achieving an R2 value of 0.62 in external validation. Yu et al.26 applied light gradient boosting machine (LightGBM) models to classify the relative metal/metalloid concentration (RMC) in maize seedlings following seed priming with fourteen different metal oxide NPs at varying concentrations. These models were developed using NP characteristics and two seedling parameters. Using ten different stratified train-test splits, the average accuracy on the test sets reached 0.76.
Great progress has thus been made in the integration of ML methods in nano-agriculture, however key challenges persist in model development. The lack of large-scale experimental studies about the effects of NPs on plant growth, the variations in experimental conditions, methods and protocols, as well as the non-systematic data coding in existing datasets created by different research groups, impede the creation of a uniform dataset that will allow the development of robust ML models. As a result, the earlier work mentioned mainly either addresses very specific nano-agriculture applications (only one type of NP or one crop species) or utilizes small datasets with limited attributes for model development. To overcome this, Deng et al.8 conducted an extensive literature review to identify NP–plant interactions studies and systematically extracted a wide range of relevant data, including NP characterization data, experimental conditions, plant properties, and responses for both treated and control groups. Their effort resulted in the compilation of data from 57 studies into 13 heterogeneous datasets, covering plant growth, photosynthesis, NP uptake, and antioxidant responses for 17 different NPs. This work represents a critical step toward overcoming current data limitations and enabling the development of more generalisable ML models in nano-agriculture. Subsequently, they developed random forest models and obtained predictions about the potential of nano-enabled agriculture across different global regions.8
Deng et al. made their dataset publicly available on GitHub, where key NP characteristics – such as NP type (e.g., carbide, metal, oxide, macromolecular compound), composition, and shape (e.g., granular, one- and two-dimensional, hollow) – are encoded. However, the absence of explicit information on the core composition of the NPs poses challenges for reproducibility and limits the full exploitation of the data. For instance, calculating atomistic descriptors requires information on the precise NP composition and shape, and deciphering the encoded data from existing datasets such as that compiled by Deng et al.,8 can be time-consuming. Furthermore, the lack of direct links within the dataset to the DOIs of the original publications from which data were extracted impedes quick verification and retrieval of additional details that may be essential for other types of studies, such as mechanistic investigations, meta-analyses, or regulatory risk assessment (e.g., for the calculation of atomistic descriptors the NP crystallographic phase is needed).
In the present study, building upon the dataset compiled by Deng et al.,8 we performed an extensive additional data curation and quality control process, which involved identifying the original publications and cross-checking all details and supplementing with additional information such as the aforementioned core composition and crystal phase information (where available), followed by data enrichment using computationally derived atomistic descriptors based on the elemental composition of the NPs. Our goal was to develop a ML model capable of predicting the effects of NP exposure on plant length (roots, shoots or total). To address data imbalance, we employed a synthetic data generation technique, combined with additional data manipulation strategies aimed at optimising the modelling workflow, which included rigorous data filtering and variable selection through an automated ML (autoML) framework. The resulting optimised models were validated according to the Organisation for Economic Co-operation and Development (OECD) principles for the validation of QSAR models for regulatory purposes, including the generation of a standardised report following the QSAR model reporting format (QMRF),27 and the final predictive model was implemented as a web-service and distributed via the Enalos Cloud platform. To ensure transparency and FAIRness of the data and model, curated data were made available through the nanoPharos database for full exploitation from the stakeholders.
Our analysis steps are presented schematically in Fig. 1. Prior to model development, data curation, management, and quality control (QC) of the original dataset took place based on the best practices for curating, cleaning, and processing literature-extracted data for meta-analysis purposes. A detailed workflow was established, with the aim to maximise the extraction of information, correct potential errors from the data curated in the source publication,8 perform data gap filling where possible, and enrich the dataset with additional key points from the original papers used for data extraction or with computational descriptors calculated in house.29,30
The curated dataset was later randomly partitioned into training, validation and test sets using a stratified sampling technique. Training and validation sets were used for model development and hyperparameters optimization, respectively, within the autoML scheme and the test set was used as a blind set for external validation.
| NP type | |
|---|---|
| Encoding | Description |
| Carbide | Number of carbon atoms |
| Metal | Number of metal atoms |
| Oxide | Number of oxygen atoms |
| Component 1 (Com1) | A r of Com1, e.g., for TiO2, Com1(Ti) = 47.87 |
| Component 2 (Com2) | A r of Com2, e.g., for TiO2, Com2(O) = 16.00 |
| Morphology | |
|---|---|
| Encoding | Description |
| Dim0 | Granular |
| Dim1 | One-dimensional |
| Dim2 | Two-dimensional |
| Hollow | Hollow |
To QC the data extracted by Deng et al.,8 each dataset row was matched to its corresponding original publication, e.g., rows 1–9 were matched to paper number 5 (ref. 31) in the literature list provided by the writers. All data reported by Deng et al.8 were QCed based on the original publications. Any inconsistencies between the original and the Deng et al. publications were highlighted and, where possible, corrected (see Table S2 in the SI for a full list of corrections made to the dataset). Corrections were made in the NP size (sizeTEM), purity, daytime and night temperature (DT and NT respectively), humidity, and illumination descriptors. Similarly, 52 rows from the original dataset were removed, as the size or shape of the NPs were not clearly mentioned in the original papers. Where possible, verification of the data was achieved based on previous studies; e.g., for rows 249–260 (ref. 32) of the original dataset, CuO NP shape and size were identified in a follow-up study by the same group.33 When included data could not be traced back to the original papers, the respective rows (n = 16, rows 216–231 of the original dataset) were removed. For this reason, the sizeDLS and zeta columns were removed as they contained the most data gaps, i.e., 73% and 66%, respectively.
On top of that, the full text for all the 57 original publications was studied and key points were highlighted or extra data extracted to enrich the dataset further. This included adding 12 rows with additional data from papers number 10 (ref. 34) and 45 (ref. 35). Furthermore, the total concentration (in mg L−1) column was added to include raw data about initial NP concentration. To avoid any bias or artificial enhancement, the original total content (TC (mg)) column was removed to avoid possible assumptions about initial volumes by Deng et al., as well. This led to the removal of 48 rows because the total concentration was not available and could not be calculated from the information in the original publication. For example, for rows 91–108 (ref. 36) concentration was reported in mg kg−1 of soil, but the initial soil mass was not provided, preventing conversion to mg L−1.
Following cleansing of the “Length” dataset, the positive values of the label “Length” were re-calculated and normalised based on the control value to extract a more interpretable output (originally positive values were normalised based on the experimental value while negative values were normalised based on the control value). In cases where raw length values were not directly available (e.g., provided in textual format), the “WebPlotDigitizer” tool37 was employed to extract these values from plots presented in the original referenced articles. The systematic error associated with the digitisation process was quantified using the relative absolute error, which was estimated to be less than 1% (see SI file for more details). Then, for the output variable of length a specific class was assigned, “positive” (for positive responses, i.e., increased length) or “negative” (for negative responses, i.e., reduced length) based on the NP impact compared to the controls. This was done to facilitate the development of a classification model, as preliminary regression models resulted in poor performance (R2 < 0.5). As a result, the 25 control data points were removed as they did not fall under the “positive”/“negative” classes.
The produced dataset, which consisted of 170 NPs and 26 features, was then enriched with a set of 53 atomistic descriptors of NPs in vacuum calculated with the Enalos NanoConstruct tool.29 The necessary information to perform the calculation of the atomistic descriptors is included in Table 2. The required crystallographic data were obtained from the Crystallography Open Database (COD).38 Calculations were performed for spherical NPs, as following processing and cleansing only spherical NPs remained in the dataset. For TiO2 NPs lacking crystallographic phase information, an anatase structure was assumed to prevent the exclusion of the relevant particles and preserve loss of samples from the dataset. Out of the 53 calculated descriptors, 10 provided not a number (NaN) values for some NPs and were excluded. When atomistic descriptor calculation was not possible, e.g., due to large size of the NP, the respective dataset rows (n = 57) were removed. This included polymeric and large carbon NPs, e.g., graphene sheets and nanotubes (rows 189–210 in the original dataset39). The final dataset used for model development, following cleansing and enrichment, contained 69 features and 113 rows, compared to the 28 features and 299 rows of the original one.
| NP core | Diameter [nm] | COD code of CIF file | FF using OPENKIM ID40 |
|---|---|---|---|
| Ag (ref. 31, 34 and 41) | 13.8, 20 | 1509146 | EAM_Dynamo_AcklandTichyVitek_1987v2_Ag_ |
| _MO_055919219575_000 | |||
| Graphene quantum dots (ref. 42) | 2.5 | 1200017 | DUNN_WenTadmor_2019v1_C_ |
| _MO_584345505904_000 | |||
| CuO (ref. 32 and 35) | 30, 40 | 1011148 | LJ_ElliottAkerson_2015_Universal_ |
| _MO_959249795837_003 | |||
| Fe3O4 (ref. 43) | 6.7 | 1011032 | EAM_Dynamo_AcklandTichyVitek_1987_Ag_ |
| _MO_212700056563_005 | |||
| SiO2 (ref. 44) | 15 | 9011493 | Sim_LAMMPS_Vashishta_BroughtonMeliVashishta_1997_SiO_ |
| _SM_422553794879_000 | |||
| TiO2 (ref. 45 and 46) | 6.5, 21 | 1010942 | Sim_LAMMPS_MEAM_ZhangTrinkle_2016_TiO_ |
| _SM_513612626462_000 | |||
| ZnO (ref. 47) | 25 | 1011258 | Sim_LAMMPS_ReaxFF_RaymandVanDuinBaudin_2008_ZnOH_ |
| _SM_449472104549_001 |
To effectively address both the small dataset size and the class imbalance problem, a synthetic data generation method was applied to populate the minority “positive” class. In this case an oversampling technique that has been successfully applied to similar modelling problems,51–53 the synthetic minority over-sampling technique (SMOTE),54 was selected to enhance data balancing without introducing unrealistic data points. In SMOTE, the new data points created correspond to a randomly chosen point along the line segments between a “positive”-labelled observation and one of its k-nearest neighbours (kNN). It should be noted that oversampling was applied exclusively to the training set to ensure that the calculated accuracy metrics are meaningful and express the model performance on real data. Equally significant was the normalisation of the data before applying a distance-based oversampling technique such as SMOTE. To this end, the training data were normalised using the z-score normalisation method prior to the application of SMOTE, and subsequently de-normalised to proceed with the rest of the feature engineering processes.
| ML methodologies | Hyperparameters | Search space [min, max; step] |
|---|---|---|
| Gradient boosted trees | Number of trees | [50, 100; 20] |
| Naïve Bayes | Default probability (threshold) | [0.004, 0.01; 0.0001] |
| Logistic regression | Step size | [0.01, 0.1; 0.01] |
| Decision tree | Minimum number of records per node | [2, 20; 2] |
| Random forest | Tree depth | [4, 8; 4] |
| Number of trees | [100, 200; 100] | |
| Minimum child node size | [10, 20; 5] | |
| Neural network | Number of hidden layers | [1, 2; 1] |
| Number of hidden neurons per layer | [5, 20; 10] | |
| XGBoost trees | Learning rate (eta) | [0.2, 0.3; 0.1] |
| Maximum depth | [5, 10; 5] |
| Class | Training set before oversampling | Training set after oversampling | Validation set | Test set |
|---|---|---|---|---|
| Positive | 0.26 | 0.50 | 0.27 | 0.25 |
| Negative | 0.74 | 0.50 | 0.73 | 0.75 |
Accuracy (ACC, eqn (1)) and balanced accuracy (BA, eqn (2)) metrics were calculated during the internal validation of the seven ML models within the autoML scheme. As the BA metric accounts for classes imbalance, it was selected to evaluate model performance, along with the ACC metric.
![]() | (1) |
![]() | (2) |
To thoroughly evaluate model predictivity, within the external validation of the selected model from the autoML, in addition to ACC and BA, the sensitivity (SEN, eqn (3)), specificity (SPE, eqn (4)), precision (PRE, eqn (5)), Matthews correlation coefficient (MCC, eqn (6)), and F1-score metrics (eqn (7)) were calculated.
![]() | (3) |
![]() | (4) |
![]() | (5) |
![]() | (6) |
![]() | (7) |
| thr = 〈d〉 + Zσ | (8) |
To build our predictive workflow the curated dataset was initially split into training (47 NP observations), validation (26 NP observations) and test (40 NP observations) sets following a stratified sampling technique to ensure equal classes representation between sets and with the original dataset. Training data were oversampled using SMOTE (with k = 5), and 23 data points belonging to the minority class (“positive”) were generated to correct class imbalance and increase the dataset's size (Table 4).
The oversampled training set was filtered to exclude noisy and intercorrelated descriptors, using a low-variance and a correlation filter, leading to the removal of 16 and 15 descriptors, respectively. To further refine the descriptor set, we calculated information gain scores for the remaining 38 features, retaining only those with non-zero values (19 descriptors). The variable representing total concentration was not subjected to this selection process, as it encodes critical experimental parameters and is considered as a priority driver of plant uptake,8 and was thus force-included in the final training set (Table 5). To eliminate redundancy that could introduce bias, the filtered dataset was reviewed for duplicate observations. Finally, categorical variables were one-hot-encoded, and numerical descriptors were standardised via z-score normalisation to ensure comparability across varying scales.
| Selected variable | Description | Information gain score |
|---|---|---|
| Photoperiod | Hours of plant exposure to light per day in hours per day | 0.419 |
| Species | Plant species (cucumber, bean, wheat, rice, tomato, maize) | 0.340 |
| Illumination | Illumination intensity in μmol m−2 s−1 | 0.284 |
| NT | Nighttime temperature in °C | 0.229 |
| Purity | % nanoparticle material purity | 0.225 |
| D24 | The average coordination parameter (5Ang) of all atoms | 0.198 |
| D12 | The average difference of the coordination parameter between core and shell atoms | 0.179 |
| D19 | The average coordination parameter (4Ang) of all atoms | 0.179 |
| D25 | The average coordination parameter (5Ang) of the core atoms | 0.179 |
| D26 | The average coordination parameter (5Ang) of the shell atoms | 0.179 |
| D37 | Lattice energy of NP in eV | 0.179 |
| Duration | NP treatment duration (time elapsed from the exposure of the plant to NPs to the measurement) in days | 0.179 |
| DT | Daytime temperature in °C | 0.160 |
| D4 | The average potential energy of all atoms in eV | 0.152 |
| GS | Growth stage of plant (germination, seedling, vegetative) | 0.057 |
| EP | NP exposure pathway (seeds, root, leaf) | 0.045 |
| Category | Plant carbon fixation metabolic pathway (C3 or C4 photosynthetic process) | 0.040 |
| MT | Measured tissue (root, shoot, plant) | 0.037 |
| Cultured | Cultivation method (medium, hydroponic, soil) | 0.008 |
| Total concentration | Total concentration of NP treatment in mg L−1 | Forced inclusion |
Using the 20 selected features, seven ML models were developed, and their hyperparameters were optimised through five-fold cross-validation (Table 6). To select the best-performing among these models, they were applied on the validation set (26 NPs), which was also normalised following the training normalisation function. The accuracy of the optimised models on the validation set is presented in Fig. 2. The XGBoost model yielded the highest ACC and BA values (0.77 and 0.71, respectively) and therefore, was selected as the final model for the prediction of the “Length” class.
| ML methodologies | Optimised hyperparameters |
|---|---|
| Gradient boosted trees | nrModels = 50 |
| Naïve Bayes | Threshold = 0.0044 |
| Logistic regression | stepSize = 0.05 |
| Decision tree | minNumberRecordsperNode = 16 |
| Random forest | maxLevels = 4, minNodesize = 20, nrModels = 200 |
| Neural network | Hiddenlayer = 1, nrhiddenneurons = 15 |
| XGBoost trees | Eta = 0.2, Max_depth = 5 |
Given the limited size of the training data, and to mitigate the risk of overfitting, we applied a permutation-based feature importance to the XGBoost model following its selection within autoML based on the inner validation performance, to further refine the descriptor set. More specifically, to assess the stability and relevance of each of the 20 descriptors of Table 5, permutation importance was calculated within a ten-fold cross-validation scheme using the XGBoost model. In each fold, individual descriptors in the validation subset were randomly shuffled, and the resulting drop in the predictive performance (compared to the baseline/unshuffled validation subset) was recorded. Normalisation was performed within the cross-validation loop to prevent data leakage. A feature was considered important if its mean permutation-induced performance drop across folds was greater than zero. Descriptors with consistently low importance were removed prior to final training. The important features that emerged from the feature permutation process were the total concentration, species, MT, cultured, category, duration, photoperiod and D12, and are depicted in Fig. 3. The final XGBoost model was retrained on the reduced descriptor set and subsequently validated.
The XGBoost model was rebuilt with the refined set of descriptors and was validated externally, by applying it to the 40 NPs of the test set, which preserved the original distribution of “positive” and “negative” classes. Test set descriptors were previously normalised with the training set z-score function. In Table 7 the confusion matrix of the test set is presented. Table 8 summarises the performance of the XGBoost model on the test set: the model demonstrates strong predictive capabilities and generalizability on unseen data with an ACC of 0.85 and a BA of 0.83.
| Actual/predicted class | Negative | Positive |
|---|---|---|
| Negative | 26 | 4 |
| Positive | 2 | 8 |
| Metrics | Values |
|---|---|
| ACC | 0.850 |
| BA | 0.833 |
| SEN | 0.867 |
| PRE | 0.929 |
| SPE | 0.800 |
| F1 | 0.897 |
| MCC | 0.630 |
To evaluate the stability of the model's predictive performance on the test set,62 a bootstrap resampling strategy was employed. The test set was resampled 1000 times with replacement, and predictions were produced by applying the XGBoost model. For each bootstrap sample, the BA value was calculated, and the mean, median, and standard deviation of these values were used to assess the stability and robustness of the model's performance. The results are summarised in Table 9 and visualised in Fig. S4. The distribution of the BA values is approximately symmetric (shows minimal negative skew), as also indicated by the proximity of the mean (0.830) and median (0.833). The low standard deviation (0.073) further supports the consistency and reliability of the model's predictions.
| BA | Values |
|---|---|
| Mean | 0.830 |
| Median | 0.833 |
| Standard deviation | 0.073 |
Further confirmation of the model's robustness comes from the poor predictive BA and MCCs (similar to random estimation accuracy) obtained with the y-randomised models as shown in Table 10, demonstrating that the model's learned classification rules are meaningful and not just random noise.
| Randomisation | BA | MCC |
|---|---|---|
| 1 | 0.467 | −0.061 |
| 2 | 0.467 | −0.061 |
| 3 | 0.467 | −0.061 |
| 4 | 0.533 | 0.067 |
| 5 | 0.467 | −0.058 |
| 6 | 0.533 | 0.067 |
| 7 | 0.417 | −0.146 |
| 8 | 0.383 | −0.204 |
| 9 | 0.550 | 0.087 |
| 10 | 0.567 | 0.118 |
The cross-validation results, summarised in Table 11, prove the model's stability to data inclusion–exclusion.
| Metrics | LOO | Ten-fold |
|---|---|---|
| ACC | 0.843 | 0.886 |
| BA | 0.843 | 0.886 |
| MCC | 0.686 | 0.777 |
The APD threshold was calculated equal to 5.071. As all test NPs domain values were below this limit value, the test set predictions are all considered reliable.
To support model FAIRification, the produced model has been FAIRified using the FAIR principles application to computational workflows guidelines.66 The main steps of model development are documented in this paper and in a straightforward manner through a standardized QMRF report,68 which is in line with the OECD's Guidelines for the Validation of QSAR Models.55 The QMRF is available as part of this paper's SI.
Apart from data and model development sharing, which enhances reproducibility, FAIR model dissemination, following the FAIR4RS principles,67 is equally important to promote and support future advancement of nano-enabled agriculture. For this purpose, the developed XGBoost model was transferred to the Isalos Analytics Platform,61 a user-friendly software tool for data manipulation and ML model development. This facilitates model deployment and sharing as a web service with a unique identifier, called CeresAI-nano, via the Enalos Cloud platform (https://enaloscloud.novamechanics.com/chiasma/agrinano/). The CeresAI-nano can be also accessed remotely using APIs (https://enaloscloud.novamechanics.com/chiasma/swagger-ui/).
In the CeresAI-nano application, users can complete the provided table with the required input descriptors (total concentration, species, MT, cultured, category, duration, photoperiod and D12) or upload the input data using the provided template file. The atomistic descriptors can be obtained using the NanoConstruct tool (https://enaloscloud.novamechanics.com/riskgone/nanoconstruct/), which is also freely available. Upon execution, the web application generates predictions for the length class of the input NP–plant treatments, and indicates their reliability based on the APD limits in tabular format. The results are available for download, enabling further analysis. A detailed user-guide is also available to facilitate the use of the CeresAI-nano. An overview of the graphical user interface is presented in Fig. 4.
The interactions between NPs and plants are characterised by a high level of complexity and ambiguity6,8 including the complexity and diversity of the environment-plant system24 and the intricate network of parameters controlling plant responses.2 The parameters studied to unravel the NP–plant interactions usually include (but are not limited to) NP composition, shape, and size, NP surface charge and modification, plant species, treatment concentration, exposure duration, and the application method or exposure route.1,7,13,18 Beyond these, additional factors may be considered that will provide a more comprehensive and realistic assessment of NP impacts in agriculture and the environment and will be incorporated in future in silico models. Such parameters may include genomic, proteomic, and metabolomic studies to assess long-term effects of NP–plant interactions,71 soil texture, pH, and organic matter content to unravel NPs fate and biotransformation (e.g., dissolution) due to plant–microbial–soil interactions.72 The factors influencing plant length, − and plant growth in general, are complex and sometimes not entirely known, and the presence of NPs can be either beneficial, promoting plant development (e.g., increasing root and shoot length) or detrimental, such as inhibiting root elongation. These effects depend on NP type and size, treatment concentration, and plant species (see also the following paragraphs).
For instance, it has been shown that identical Ag NPs treatments using different NP shapes result in significantly different root length responses of the Arabidopsis seedlings, with decahedral NPs giving maximum length increase compared to the control sample.73 Moreover, the existence of an applied concentration threshold74 is reported, beyond which plant growth is adversely affected. This threshold varies depending not only on the NP type, but also the plant species since the same NPs have been known to have opposite effects on different plants. While some NP treatments have the same effect both on the shoot and the root length, it has been reported that NPs can have mixed effects on plant species such as Coriandrum sativum L. and Solanum melongena L.75 Another example is the case of Ag NPs that enhance Zea mays plant growth but inhibit the root growth of Lolium multiform.76 Plant varieties also modulate the NP–plant interactions. For example, lignin content differences among sweetpotato varieties influence root length development following treatment with CuO NPs.77 It is evident that the complex biochemical and physiological interactions between NPs and plants, which remain largely unexplored and not yet fully understood, hinder the development of interpretable ML models capable of accurately predicting the quantitative plant length response. Indeed, there are also complex dynamics within plants as to whether to divert energy to below ground growth (root system) versus above ground growth (shoots and fruit), that are only beginning to be explored.78,79 Developing case-by-case models may offer a viable solution when dealing with “conflicting” NP–plant interaction effects, although this approach demands a more targeted data collection and curation approach. Consequently, given the reduced size of the dataset and the limited number of distinct NP–plant observations, it was proposed to simplify this inherently complex problem by categorising the “Length” labels as either “positive” or “negative” and developing a classification ML model. This qualitative approach is not just a forced compromise to facilitate the development of reliable classification models but can provide valuable predictions regarding the effects of NPs on plant growth and support the design of optimal NPs to maximize growth and yield. Furthermore, the proposed classification strategy could act as the initial step of an SSbD workflow, guiding the development of species-specific regression models for refined validation or for identifying hazardous concentration thresholds.
The “positive”/“negative” classification allows the development of simpler and more robust models, but the interpretation of these classes should be performed with caution. Yet, important conclusions can be drawn from the length response prediction for different parts of the plant, since root growth is related to the whole plant growth through an “allometry” relationship.80 Depending on particle size, dose, composition, and physicochemical characteristics, NPs have been shown to have both beneficial and detrimental impacts on the root length and overall plant growth and development.7,81,82 The output class (“positive” or “negative”) refers to the length of the whole plant, root, or shoot – as specified by the input variable and usually is considered as a beneficial effect.2 However, a “positive” length response does not necessarily correlate with improved yield-related outcomes. For example, the root length or the root/shoot ratio can be used as an indication of environmental stress. In nutrient-deficient soils or in desert environments, root length usually increases to aid the plant obtain the necessary resources in terms of nutrients80 and water,83 respectively. Furthermore, in the case of grain producing crops (such as wheat, rice, maize etc.) plant length plays a key role in yield production. Shorter plants (dwarf and semi-dwarf varieties) are in general less prone to lodging than taller ones, allowing in this way higher planting densities. In parallel, these varieties tend to allocate a greater proportion of resources to grain production rather than vegetative growth, resulting in increased yield potential. Nonetheless, excessive dwarfing may lead to decreased yield.84–86 Therefore, the optimization of plant length through precision agriculture is important to achieve higher yields and consequently support global food demands. In cases where yield related data are scarce, the final XGBoost model can be used as a tool to gain preliminary insights on the plant length effects (“positive” or “negative”) under NP exposure, prior to a refined experimental assessment to define the parameters that will optimize plant architecture. Thus, this simple classification of the effects of NPs on plant length can provide valuable insights into the optimal NPs treatment plan, provided that it is interpreted correctly in the context of the specific NP–plant application.
As previously discussed, the type of NPs, their composition, size, and the physicochemical properties derived from their type can either positively or negatively regulate root length.1,7,13,82,87 In our case, the used atomistic descriptor (D12) encodes information about NP composition, shape, size, and crystallinity into a single value. It should be noted that although the NanoConstruct tool also calculates D12 for ellipsoidal NPs, the predictions derived from the reported model should be interpreted with caution, as the model was trained specifically for spherical NPs. NanoConstruct is also limited to NPs below 60 nm, and cannot calculate these parameters for carbon-based NPs, either graphene and carbon nanotubes or polymeric NPs. While descriptors for nanosheets and nanotubes can be generated with the NanoTube Construct tool,88 these descriptors are not directly comparable with those from NanoConstruct, which can lead to data gaps. NP exposure concentration has a varying impact on root development: some NPs promote root and shoot elongation at low concentrations, but reduce root length at higher concentrations.1,13,18,82,87,89 In the final model, the exposure concentration factor is represented by the total concentration variable. Finally, the NP effect varies across plant species and can be both positive and negative.13,82,87 This is the case of Ag NPs,89 which, within the same concentration range, were found to promote root length in the case of barley and reduce it in the case of lettuce due to seed treatments.82 The plant species and category are also incorporated into our final model, as well as other features encoding experimental conditions (i.e., exposure duration, photoperiod, MT, and cultivation method).
To further interpret the influence of the selected numerical features on the final model's decision-making process, a SHAP (SHapley Additive exPlanations) analysis90 was performed in using Tree SHAP to illustrate how these descriptors influence the prediction probability, specifically for the “positive” class (increased plant length). In Fig. S5 the SHAP summary plot is depicted, reflecting the magnitude and direction of each descriptor's impact on the model's output when compared to the average prediction. Descriptors are ranked by decreasing importance corresponding to their mean absolute SHAP value. Positive SHAP values indicate a contribution towards the “positive” class probability (increased length compared to the control), and negative SHAP values are associated with an increase in the “negative” class probability. Photoperiod contributes most to the classification through a monotonous relationship with higher feature values (red dots) driving the prediction towards the “negative” class. Photoperiod plays a fundamental role in plant studies as it serves as a reliable environmental cue for seasonal timing. It is directly connected to photosynthesis, influencing carbon fixation and biomass accumulation, and regulates a wide range of growth-related, physiological, and stress-response processes, including flowering, branch formation and circadian clock synchronization.91–94 In our analysis, longer photoperiods expressed in hours per day tend to drive the prediction towards the reduced plant length response class under NP treatment compared to the control. Nonetheless, other growth parameters such as the dry or fresh weight, the leaf area, the number of leaves etc. after NP treatment were not included in this model to assess how the plant length is associated with overall biomass accumulation. Again, the synergistic effects of NPs, plant species, and experimental conditions should be further investigated to develop a more focused and optimised framework for precision agriculture. Based on the SHAP summary plot, lower NP total concentration values and exposure times mainly drive the prediction towards the “positive” class (increased plant length), which for the case of concentration is in line with our previous findings. The D12 atomistic descriptor is the lowest-ranked descriptor, with lower values showing a tendency to drive the prediction towards the “negative” class probability.
It can be concluded that highly influential factors for assessing plant length response are included in our model and, as they encode controllable experimental conditions (e.g., laboratory, mesocosms,95,96 greenhouse,97 or hydroponic setups,98 as opposed to in situ experiments and field-based data), the model can serve as a primary tool for evaluating plant length promotion or inhibition and therefore, saving time from long-time experimental cycles. In the study by Santos et al.,2 which conducted a meta-analysis of research on the effects of nanomaterials on plants from 2019 to 2022, the median duration of experiments involving plants cultured in soil was 49 days. This time span is relatively short, as it falls below the typical growth cycle of annual crops (90–120 days), meaning that many studies capture only part of the plant's development. In this context, predictive modelling offers a valuable shortcut and can complement experimental work, enabling researchers to generate insights more rapidly and efficiently than through conventional experiments alone.
ML can be integrated to nano-enabled agriculture applications to support the prediction of plant growth indicators and contribute to the design of safe nano-agrichemicals, as demonstrated by this work. In brief, based on data available on literature we performed a comprehensive and detailed curation, as quality data are the cornerstone for the development of reliable and interpretable ML models. Data were enriched with calculated atomistic NP descriptors, which were used alongside experimental conditions and plant characteristics as input to an autoML workflow for the prediction of the effects of NP exposure on plant length. To correct class imbalance, synthetic data were generated following the SMOTE methodology and seven ML methodologies were optimised and evaluated in terms of prediction accuracy. The XGBoost methodology was selected as the best performing one, yielding an accuracy of 85% and a balanced accuracy of 83% in external validation. Additional bootstrapping and y-randomisation tests demonstrated the stability of the predictions and the model's robustness.
The curated data and model are freely available and documented in a standardized report, aligning with the FAIR data principles, to ensure accessibility and reusability. The final model has been disseminated as a user-friendly web application to facilitate broader use by the interested stakeholders and to support applications in nano-enabled agriculture. The CeresAI-nano tool does not require experimental input, instead, it uses variables such as experimental conditions, NP core, size and purity, and plant details. This enables rapid virtual screening of possible NP–plant treatments to assess their plant length effects, saving time from long-term experiments. With the growing interest in nanomaterial applications in agriculture driving increased experimental assessment, a synergistic integration with ML approaches is essential to advance research efficiency. This integration relies on the systematic collection of data and metadata to support the development of robust and reliable predictive models that can later contribute to the understanding of the NP–plant interaction mechanisms.
| ACC | Accuracy |
| ANN | Artificial neural network |
| APD | Applicability domain |
| A r | Relative atomic mass |
| AutoML | Automated machine learning |
| BA | Balanced accuracy |
| CIF | Crystallographic information file |
| COD | Crystallography Open Database |
| DLS | Dynamic light scattering |
| DT | Daytime temperature |
| EP | Exposure pathway |
| FAIR | Findable, accessible, interoperable and reusable |
| FN | False negatives |
| FP | False positives |
| LightGBM | Light gradient boosting machine models |
| GS | Growth stage of plant |
| kNN | k-Nearest neighbours |
| LOO | Leave-one-out |
| MCC | Matthews correlation coefficient |
| ML | Machine learning |
| MT | Measured tissue |
| NP | Nanoparticle |
| NT | Night temperature |
| OECD | Organisation for Economic Co-operation and Development |
| QC | Quality control |
| QMRF | QSAR model report format |
| QSAR | Quantitate structure–activity relationship |
| PRE | Precision |
| RCF | Root concentration factor |
| RMC | Relative metal/metalloid concentration |
| SEN | Sensitivity |
| SHAP | SHapley Additive exPlanations |
| SMOTE | Synthetic minority over-sampling technique |
| SPE | Specificity |
| SSbD | Safe and sustainable-by-design |
| TC | Total content |
| TEM | Transmission electron microscopy |
| thr | threshold (APD) |
| TF | Transport factor |
| TN | True negatives |
| TP | True positives |
| XGBoost | Extreme gradient boosting |
Supplementary information (SI) is available. See DOI: https://doi.org/10.1039/d5en00897b.
| This journal is © The Royal Society of Chemistry 2026 |