Chenyue
Wang
ab,
Lei
Zhang
*ab,
Chuanyue
Chen
a,
Kaile
Dou
a,
Jinya
Zhang
a,
Chongyang
Li
c,
Michael
Gozin
*def,
Weibo
Zhao
*g,
Chunlin
He
*a and
Siping
Pang
*a
aSchool of Materials Science & Engineering, Beijing Institute of Technology, Beijing 100081, China. E-mail: zhanglei@bit.edu.cn; chunlinhe@bit.edu.cn; pangsp@bit.edu.cn
bState Key Laboratory of Explosion Science and Safety Protection, Beijing Institute of Technology, Beijing 100081, China
cCollege of Mechanical Engineering and Automation, Chongqing Industry Polytechnic College, Chongqing 401120, China
dSchool of Chemistry, Faculty of Exact Science, Tel Aviv University, Tel Aviv, 69978, Israel. E-mail: cogozin@gmail.com
eCenter for Advanced Combustion Science, Tel Aviv University, Tel Aviv 69978, Israel
fCenter for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv 69978, Israel
gAlibaba Group, Beijing 100102, China. E-mail: wbzhao@pku.edu.cn
First published on 28th May 2025
Nitrogen and oxygen are the two most abundant elements in the atmosphere, yet stable compounds composed solely of these elements are relatively scarce. Conceiving novel stable nitrogen–oxygen compounds remains a formidable challenge for current experimental and theoretical research. In this study, we developed a sequential construction strategy to design 168 nitrogen–oxygen compounds with distinct structural innovation, followed by high-throughput quantum mechanical calculations with the highest possible accuracy. From the resulting 7820 structural and property parameters, we created a customized machine learning model that outperforms universal models in accuracy with 13.8% greater robustness across various data splits, achieving stable and high performance on small datasets. Data-driven analysis revealed the energy and electron-related characteristics as key factors in regulating thermodynamic stability, while physics-driven insights uncovered that electron delocalization and hyperstatic constraints fine-tune mechanical firmness. Among the designed nitrogen–oxygen compounds, 106 are expected to be more stable than the known compound N2O4, out of which 61 are expected to be even more stable than N2O5. Furthermore, their energy densities surpass those of all currently used nitrogen–oxygen oxidizers by 8.3–16.8%, highlighting our newly proposed compounds potential for use in rocket bipropellant systems. Our developed machine learning platform features a user-friendly graphical interface for easy assessment and may be of interest to researchers in other fields, including chemical industry and energy sectors.
In the context of energetic oxidizers, compounds such as N2O and N2O4 are widely used as oxidizers in rocket bipropellant systems, as they not only withstand relatively high temperatures before decomposition, but also release substantial energy when extracting electrons from fuels. On the one hand, dense oxygen enhances the electron-extracting capability, increasing energy release during exothermic reactions and generating greater external work, which can result in longer flight ranges at a given mass. For example, in principle, O4 doubles the specific impulse (Isp) in comparison with liquid oxygen (LOX, O2), reaching 500–700 seconds.24 On the other hand, nitrogen addition capitalizes on the significant energy difference between single/double N–N bonds in the reactants and the triple NN bonds in the products, further enhancing energy output.25 Recently, the thermostability of dinitramide (DN, –N(NO2)2) has been significantly improved, with a decomposition temperature (Td) of 243 °C.26 The successful synthesis of nitrogen-rich oxidizer trinitramide (TNA, N(NO2)3) can theoretically allow achieving the highest density impulse ever, 33% higher than that of the LOX.27 Therefore, conceiving novel stable nitrogen–oxygen compounds is a highly promising and innovative research direction oriented towards the introduction of conceptually new oxidizers for high-performance aerospace applications.
However, conceiving such novel stable nitrogen–oxygen compounds poses a formidable challenge for the current state-of-the-art experimental and theoretical research. Experimentally, the intrinsic nature of nitrogen and oxygen favors their separation, while the retro-synthetic analysis of nitrogen–oxygen compounds is extremely elusive. In comparison to the separated forms of dinitrogen (N2) and dioxygen (O2), nitrogen–oxygen compounds may exhibit significantly higher energy levels, due to induced strain, with N–N single bond energy being 50% higher, and O–O bond energy 41% higher, posing the thermodynamic difficulties. Achieving synthesis of such nitrogen–oxygen compounds may require special reaction conditions, such as high compression and laser/microwave irradiation, to provide the activation energy necessary to overcome barriers of approximately 50–70 kJ·mol−1,28 indicating kinetic challenges.24 It is possible that straightforward and systematic synthetic methodologies may not lead to the convenient preparation of the proposed nitrogen–oxygen compounds and “out-of-the-box” thinking is required. Theoretically, current computational algorithms for virtual compound generation exhibit evident inheritance characteristics, but the scarcity of available nitrogen–oxygen building blocks severely limits the diversity of combinations. Among the few virtually generated structures, unstable compounds vastly outnumber stable ones, further complicating the challenge to discover novel stable nitrogen–oxygen compounds.
Recent developments in high-throughput quantum mechanical calculations and machine learning methods, which have greatly enhanced the development of pharmaceuticals, catalysts, batteries, and other domains,29–34 show clear advantages in addressing our problem. These methods' application in structural design, performance prediction, and prospective candidate screening has accelerated the synthesis of new advanced energetic materials.35–42 However, universal machine learning models present limitations, when applied to these specialized materials, exhibiting fluctuating accuracy across different data splits.43–45 This highlights the need for advanced machine learning models specifically tailored to predict the stability of energetic materials.
In this work, we developed a sequential construction strategy to design 168 nitrogen–oxygen compounds, free from the constraint of structural analogy, which significantly enhances structural innovation. Subsequently, we conducted high-throughput quantum mechanical calculations with the highest possible accuracy. Utilizing the resulting 7820 structural and property parameters, we created a customized machine-learning model that demonstrated superior accuracy and robustness in predicting the stability of nitrogen–oxygen compounds. Our findings reveal that energy and electron-related characteristics are key in regulating thermodynamic stability, while electron delocalization and hyperstatic constraints fine-tune mechanical firmness. Among the designed nitrogen–oxygen compounds, 106 are expected to be more stable than N2O4, and 61 could be even more stable than N2O5. Furthermore, the energy densities surpass those of all currently used oxidizers by 8.3–16.8%, highlighting their potential for use in rocket propulsion.
Using the proposed sequential construction strategy, 168 nitrogen–oxygen compound structures were constructed (Fig. 1). With increased complexity, the three-, four-, five-, and six-membered cyclic fragments were progressively incorporated as building blocks. Each type of cyclic basic building blocks is connected either directly or by a bridge to other functional moieties, sequentially forming singular, fused, or linked skeletons, which exhibit more advanced arrangements. It is noted that acyclic compounds are constructed by direct combinations of functional groups with non-cyclic building blocks. The construction strictly adheres to chemical bonding principles, with three or five bonds for nitrogen and two for oxygen.
We note that although virtual molecule generation is particularly popular in literature, such computational algorithm-dependent approaches exhibit obvious inheritance characteristics. The innovative molecular structures usually account for the minority among the generated molecules, while the unstable molecule structures may take a vast majority. In contrast, the sequential construction strategy we proposed is free from the constraints of structural analogy and undoubtedly demonstrates significant structural innovation. One of our guiding principles is to assemble molecules from commonly stable cyclic building blocks with as few atoms as possible, connected either by thermodynamically favorable bridged linkers or through atomic fusion. Furthermore, the manual construction of the molecules is based on a comprehensive assessment of potentially stable structures, as shown by the quantum mechanical calculations in the later section. This design philosophy aimed to prioritize chemically sound structures with higher symmetry and potential stability, while minimizing the number of compounds required for high-accuracy quantum mechanical calculations. Moving forward, targeted generative models capable of exploring the full design space will significantly advance molecular design.
To evaluate the factors affecting the stability of the compounds, a high-throughput calculation flow was built to obtain energetic, electronic, bonding, and structural properties. The energy-related property was characterized by their energy gap, namely, the difference between the highest occupied molecular orbital (HOMO) and the lowest unoccupied molecular orbital (LUMO). The electronic property was measured by the dipole moment and the ratio of the positive average value to the negative average value (Vs+/Vs−). The bonding properties include Laplacian bond order for each chemical bond and the proportion of the O-connected bond. Laplacian bond order was calculated by integrating the Laplace function of the electron density in the bonding region using the Multiwfn package.47 The structure-related property was evaluated via bond lengths and bond angles between the constituent atoms for each of the studied compounds, bond types and numbers, as well as the corresponding compound size, namely, the distances from the molecule's centroid to the outermost oxygen atoms.
We developed a customized machine learning model to address the known accuracy problem associated with predicting the stability of energetic materials, particularly with small datasets. The main concept of the customized model is to evaluate the most popular universal models in dealing with classifying issues and weighted mix the best-performing ones, improving adaptability to nitrogen–oxygen compound datasets. The considered universal models include factorization machines (FM), logistic regression (LR), naive bayes (NB), support vector machine (SVM), random forest (RF), and gradient boosting decision trees (GBDT). Two main factors guided the model's development: prediction accuracy and robustness across different data splitting.
From the accuracy perspective, grid search was employed to optimize hyperparameters to improve the metrics, including Accuracy, Recall, and F1 score, as defined by
![]() | (1) |
![]() | (2) |
and
![]() | (3) |
From the robustness perspective, the accuracy of the customized model was evaluated against different data splitting and compared with the aforementioned six universal models. The training and testing data were divided 100 times in different ways, and the accuracy metrics were recorded at each cycle of model training, wherein the 5-fold cross-validation method was used to mitigate the overfitting or underfitting risk. Among the customized and six universal models, the one exhibiting the minimum difference between the corresponding highest and lowest metrics demonstrates the best robustness against data splitting, implying the highest adaptation to the small dataset of nitrogen–oxygen compounds. The hyper-parameters of the customized model are detailed in Table S1 in ESI.†
Preliminary exploration was conducted to choose different combinations of feature descriptors. The results suggested that the four feature descriptors of energy gap, dipole moment, Laplacian bond order, O-connected bond proportion present the best availability and relevance to the target property, generating the highest accuracy metrics in predicting nitrogen–oxygen compound's stability.
The target property of a designed structure is its stability, assessed from both thermodynamic and mechanical perspectives. From a thermodynamic perspective, stability is assessed by structural optimization, which is currently one of the most prevalent and reliable approaches.49–53 Compounds in which chemical bonds were fragmented upon structural optimization were classified as unstable, while those that remained intact were considered potentially stable. Compounds predicted to be thermodynamically stable could be considered feasible for synthesis. From a mechanical perspective, stability has been demonstrated to closely related to the firmness of the structure's individual bond,54,55 which can be quantitatively characterized by the Laplacian bond order (LBO).56,57 A compound is considered to possess mechanical stability if its minimum Laplacian bond order exceeds the threshold of 0.07. Mechanically stable compounds are considered as likely to be synthetically accessible, enabling potential applications.58
With both thermodynamic and mechanical stability meeting application requirements, the higher stored energy density enables greater external work generation and may potentially serve as energetic oxidizers in propellants for super-long-flight space exploration. The energy density was characterized by specific impulse and heat of formation. The specific impulse was calculated using EXPLO5 v6.05 package, defined as 59
Isp ≈ [kTc(1−PaM/Pc)/M]0.5, | (4) |
Notably, 12 of our designed compounds have been previously known or experimentally synthesized,2–13 namely, O2, NO, O3, NO2, N2O, N2O2, N2O3, N2O4, N2O3, N4O2, N4O, and N2O5. Their structures have been characterized by X-ray diffraction and 55 bonding values were reported, as illustrated by purple-filled frames in Fig. 1. When compared to the experimental results, our calculations yield a bond length correlation coefficient (R2) of 0.98 and a root mean square error (RMSE) of 0.04 Å. The bond angle R2 is 0.99 with an RMSE of 1.4°, as shown in Fig. 2(a). Additionally, 11 of our designed compounds have been previously calculated in other studies, with 56 corresponding bonding parameters reported in the literature.2–11 However, these prior studies employed somehow lower accuracy levels. Our comparative analysis shows an R2 value of 0.99 and an RMSE of 0.02 Å for bond length statistics, as well as an R2 value of 0.99 and an RMSE of 0.75° for bond angle statistics, as presented in Fig. 2(b). The minimal discrepancies between our calculation outcomes and the previously reported experimental/computational values strongly support our computational method's high accuracy and reliability.
![]() | ||
Fig. 2 Accuracy verification of quantum mechanical calculations by comparing with experimental/calculation values. Reproduction of 55 (a) bond length and (b) bond angle values for 12 synthesized nitrogen–oxygen compounds, as well as 56 values for 11 compounds by prior reported calculations. Concrete bond length and angle data are provided in the ESI.† |
Based on these findings, we created a customized model by leveraging the advantage of FM, SVM, and LR models in handling small datasets and outliers. To determine the optimal mixing weight, we started with an FM:
SVM
:
LR ratio of 5
:
0
:
5 and gradually adjusted the proportions. Inclusion of the SVM model significantly improved the metrics, and after extensive testing, the optimal ratio was found to be FM
:
SVM
:
LR = 4
:
3
:
3. This combination outperformed any individual universal model, achieving the best metrics: Accuracy = 94.9, F1 = 96.3, and Recall = 98.7, with corresponding average scores of 93.7, 95.6, and 98.0, respectively, as shown in Fig. 3(a and b).
The robustness of our customized model was evaluated by its Accuracy scores across different data splits, compared to those of six universal models, as shown in Fig. 3(c). Each model underwent 500 training cycles, with scores averaged over five-fold cross-validation. The index ΔAS, representing the accuracy fluctuation between the highest and lowest scores, was used to characterize robustness. As depicted in Fig. 3(c), the 100 data points of our customized model fluctuated closely around the average value, with the lowest ΔAS value of 2.81. This fluctuation is 34.5–280.0% smaller than those of the universal models NB, GBDT, and RF, and also surpassed its individual components, out-performing FM by 13.8%, SVM by 11.0%, and LR by 3.91%. Analysis of the error bar values, shown in black, reveals that the customized model has the smallest error bar, measuring only 0.0624, which reflects minimal fluctuation in performance across different data splits. More details are provided in Fig. S1 in the ESI.†
From above, our customized model surpasses universal models in both accuracy and robustness, achieving stable and high performance on small datasets of nitrogen–oxygen compounds. The model features a user-friendly graphical interface, making it easily accessible to researchers, as shown in Fig. S3 (ESI†).
Feature importance analysis was performed after machine learning. Through feature importance analysis, we identified the most influential factors in regulating nitrogen–oxygen compound stability: energy gap, dipole moment, Laplacian bond order, and proportion of O-connected bond, in decreasing order of importance. The energy gap and dipole moment emerged as the dominant factors, with 74% and 16% importance percentages, respectively. Fig. 4(b) illustrates the distributions of these two factors. For both thermodynamically stable and unstable categories, the scatter plots show distinct aggregations around different average values, which correspond to the peak positions of their respective normal distributions. The significant differences in the average values between these two categories allow us to determine the optimal energy gap and dipole moment ranges for stable nitrogen–oxygen compounds. Based on the first and third quartiles of the stable category, the optimal energy gap range is > 13.46 eV, with higher energy gaps implying more difficult electron transitions and greater stability. The pivotal role of the energy gap in determining material stability has been extensively supported in previous studies.62 Similarly, the optimal dipole moment is < 1.54 Debye, as lower dipole moments indicate more uniform electron distribution and greater stability in nitrogen–oxygen compounds.
As the bonding features depend on local environmental factors, they act to fine-tune the thermodynamic stability of nitrogen–oxygen compounds. As illustrated in Fig. S2 (ESI†), these bonding features present small fluctuations and do not exhibit clear threshold values like energy gap and dipole moment.
Applying the above-revealed mechanism to all 120 designed stable compounds established a physical model outlining firmness, as depicted in Fig. 5(b). Oxygen LP delocalization plays a central role in regulating firmness, with additional factors including the number of oxygen atoms, their proportion, bond order, and the overall dipole moment. These factors govern the minimum Laplacian bond order following a logarithm growth trend. A physical model for nitrogen–oxygen compound size was also established, as shown in Fig. 5(c). Compound size is defined as the radius from the molecule's centroid to the outermost oxygen atoms. LP delocalization remains the dominant regulatory factor, alongside the outermost oxygen atoms' number, proportion, and bond order. These factors regulate compound size according to a quadratic growth trend. The fitting parameters of these models are listed in Tables S2 and S3 in the ESI.† It's noteworthy that a larger compound size correlates with reduced firmness.
Notably, compounds a1–a12 exhibit exceptional firmness and smallest sizes, reasonably interpreting their successful synthesis and widespread application. Specifically, the experimentally validated compounds N2O (a5), and N2O5 (a12) exhibit bond firmness values of 1.49, and 0.23, respectively-all higher than the 0.07 of N2O4 (a8), which is widely used as an oxidizer in rocket propellants. Consistently, their experimentally measured decomposition temperatures are 600 °C, and 45 °C, respectively-all exceeding the 30 °C of N2O4 (a8). This consistency supports the alignment of our predicted mechanical stability with experimental trends. In particular, compound e15 exhibits a superior firmness of 1.40, which is comparable to the 1.49 firmness of the known stable compound N2O (a5). As shown in Table 1, compounds c14, d7, d18, and e14 also demonstrate notable firmness values of 0.54, 0.52, 0.51, and 0.45, respectively. The firmness of tetra-nitrogen dianion (c14) and pentazole anion (d18) has been predicted in our prior studies, and the way to stabilize them via acidic entrapment was thoroughly analyzed.64,65
Among the designed nitrogen–oxygen compounds, 106 exhibit greater firmness than N2O4 (a8) and 61 are firmer than N2O5 (a12), as shown in Fig. 5. These results suggest that the designed compounds, in particular those with small sizes, have the potential to withstand external heat or stress and maintain their structural integrity under varying conditions, indicating their suitability for practical applications.
Next, we evaluated the energy density of the stable compounds using Isp, which characterizes their capability of generating external work and launch distance of oxidizers when forming into propellant formulas. We considered the two most commonly used mixtures: formula (1), comprising 71% oxidizer by weight, 20% Al fuel, and 9% hydroxyl terminated polybutadiene (HTPB); and formula (2), comprising 73% oxidizer, 13% Al fuel, and 14% HTPB. Fig. 6 demonstrates that the specific impulses of e15, e14, and d7 generally surpass those of the current top-performing AP, RDX, HMX, and CL-20 oxidizers, in both formulations. The specific pulses of the three oxidizers range from 270.3 to 305.4 s, exceeding the best-performing CL-20 (249.6 s) by over 8.3%. Notably, e15 and e14 achieve specific impulses of 302.4 and 305.04 s, representing increases of 15.7% and 16.8%, respectively, compared to the most widely used oxidizer AP.
![]() | ||
Fig. 6 Energy density level of three stable nitrogen–oxygen compounds (e15, d7, and e14) compared to current top-performing oxidizers (AP, RDX, HMX, and CL-20). The energy density level is characterized by specific impulse when formulated into formula (1) (71 wt% oxidizer, 20 wt% Al, and 9 wt% HTPB) and formula (2) (73 wt% oxidizer, 13 wt% Al, and 14 wt% HTPB). |
Finally, by using three typical stable nitrogen–oxygen compounds-e15, e14, and d7-as examples, we demonstrate that our designed novel compounds exhibit exceptional firmness, allowing them to withstand ambient and extreme conditions. Moreover, their high specific impulses surpass those of all currently used oxidizers, predominantly forming into environmentally benign N2, CO2, and water products upon reaction with hydrocarbon fuels. This combination of high stability and high-energy density underscores their strong potential for application in rocket propellants, warranting the exploration of feasible synthetic routes despite the associated challenges. Notably, handling nitrogen–oxygen compounds could be risky, and researchers must undergo professional training and implement necessary protective measures to ensure safety.
Data-driven analysis revealed the energy and electron-related characteristics as key factors in regulating the thermodynamic stability of nitrogen–oxygen compounds, while physics-driven insights uncovered that electron delocalization and hyperstatic constraints fine-tune their mechanical firmness. In this context, more constraints enable the compound to maintain stability under external forces. Among the designed nitrogen–oxygen compounds, 106 are expected to be more stable than synthesized compound N2O4, and 61 are more stable than N2O5, allowing them to potentially withstand ambient and extreme storage and application conditions. Simultaneously, the novel compounds are predicted to possess high specific impulses up to 305.4 s, surpassing all currently used oxidizers. This high stability and high-energy density combination highlights their potential for rocket propulsion.
Footnote |
† Electronic supplementary information (ESI) available. See DOI: https://doi.org/10.1039/d5ta00267b |
This journal is © The Royal Society of Chemistry 2025 |