Moritz P.
Haag
and
Markus
Reiher
*
Laboratorium für Physikalische Chemie, ETH Zürich, Vladimir-Prelog-Weg 2, CH-8093 Zürich, Switzerland. E-mail: markus.reiher@phys.chem.ethz.ch; Fax: +41 44 633 15 94; Tel: +41 44 633 43 08
First published on 28th February 2014
Chemical reactivity of a set of reactants is determined by its potential (electronic) energy (hyper)surface. The high dimensionality of this surface renders it difficult to efficiently explore reactivity in a large reactive system. Exhaustive sampling techniques and search algorithms are not straightforward to employ as it is not clear which explored path will eventually produce the minimum energy path of a reaction passing through a transition structure. Here, the chemist's intuition would be of invaluable help, but it cannot be easily exploited because (1) no intuitive and direct tool for the scientist to manipulate molecular structures is currently available and because (2) quantum chemical calculations are inherently expensive in terms of computational effort. In this work, we elaborate on how the chemist can be reintroduced into the exploratory process within a virtual environment that provides immediate feedback and intuitive tools to manipulate a reactive system. We work out in detail how this immersion should take place. We provide an analysis of modern semi-empirical methods which already today are candidates for the interactive study of chemical reactivity. Implications of manual structure manipulations for their physical meaning and chemical relevance are carefully analysed in order to provide sound theoretical foundations for the interpretation of the interactive reactivity exploration.
Usually, the elucidation of reaction mechanisms starts with the set up of structures which are likely to be close to either the reactants, the products or the transition state structures. Subsequent optimisation of the electronic structure and of the atomic arrangement is computationally expensive, and it is by no means guaranteed that these optimisations produce the desired mechanism. A trial-and-error protocol has to be repeated until all potentially relevant reaction steps have been identified. In most cases the build-up of the starting structures is carried out in conventional molecule editors using a computer mouse and a three dimensional ball-and-stick representation of the structure.
Despite the remarkable progress in the development of fast algorithms for electronic structure calculations and the continuously increasing computer power, it is still a challenge to predict reaction mechanisms of molecular systems consisting of a few hundred atoms. The reason for this is the inherently high complexity induced in the potential energy surface by the large number of degrees of freedom of such systems. Already for an elementary reaction step, a transition-state search requires starting structures from which local optimisations can be initiated and such optimisations may easily fail. Then, only little additional information is provided on how to improve on such a failed search.
In addition, technical hurdles need to be overcome. Firstly, the build-up of three dimensional chemical structures with two dimensional input devices is a slow and cumbersome procedure. Secondly, manipulating the system and receiving the resulting response from the calculations are sequential and delayed processes that often require different computer programs. This separation is not only due to the electronic structure optimisation being computationally expensive, but also due to the fact that there exists currently no general implementation to have the manipulation of the system and the instantaneous presentation of the quantum chemical reactivity response in the same program.
Both issues can efficiently be resolved by having all the processes united in a single virtual environment. Such a virtual environment allows for a deeper immersion of the scientist into the problem. In addition to the visual presentation the complex output of quantum chemical calculations can be perceived more intuitively by addressing further human senses such as the haptic sense. The main advantage then is the seamless combination of manipulation and output presentation. It allows the chemist to exploit prior knowledge about the system and his/her ‘chemical intuition’ for the exploration of reaction mechanisms.
Studying complex phenomena in a more intuitive way is not entirely new to chemists and biologists. For molecular dynamics,1,2 protein docking3 and structural biology4 interactive approaches and even haptic interactions have been applied. The idea of an interactive build-up of molecular assemblies with continuous energy minimisation in the background for a quick assembly of reasonable structures has recently been implemented for parametrised analytical potentials and classical force fields in the molecule editors SAMSON5 and AVOGADRO.6
In 2007, we initiated a research program to explore chemical reaction mechanisms interactively with quantum chemical approaches. We started with the haptic exploration of interpolated Born–Oppenheimer potential energy surfaces within the framework of what we call Haptic Quantum Chemistry.7 During such a haptic exploration scientists are able to feel the quantum mechanical forces exerted on the atoms, when they are moved with an input device capable of providing force-feedback. We further extended this approach to larger systems and presented a protocol to refine potential energy surfaces guided by the haptic exploration.8 To be able to treat structural relaxation in complex systems we also investigated the possibility to calculate the necessary forces directly.9 Recently, we have investigated real-time reactivity exploration10 in the framework of the SAMSON environment, for which a fast semi-empirical method for structure optimisations of hydrocarbons is already available.11,12
In this work, we explore the physical meaning of externally controlled configurational changes in molecular assemblies. We work out the requirements for treating molecular assemblies in virtual environments and how much of the underlying physical theory can be recovered in an interactive exploration of chemical reactivity. The goal is to provide an intuitive and at the same time chemically meaningful experience of a reactive molecular system.
The interactive study of reaction mechanisms in a virtual environment as described in this work is envisaged as being part of an even larger framework comprising additional automatic or semi-automatic tools supporting the investigation of complete reaction networks. We, therefore, start in section 2 with a description of this bigger picture to clarify the context of the methodology described in this work. The main ingredient for an interactive study of chemical reactivity is a real-time availability of first-principles forces, which will be detailed in section 3. We continue then by dissecting the main components of a reactivity exploration in a virtual environment in section 5. In section 6 the manipulation and the presentation of molecular systems are discussed in detail. This is then followed by an analysis of the physical laws governing a molecular system in a virtual environment in section 7. We then conclude the work and provide an outlook at the end of this paper.
The components of this virtual reactivity laboratory are as follows: The terminal in green colour is a symbol for the visual interface at which the relevant data can be displayed to the operator. The display may be a simple screen that could be placed right next to a bench in a wet lab or it can be as advanced as a virtual reality cave. The ‘relevant data’ will be the reaction network consisting of nodes and links, which is undergoing a rolling improvement that does not depend on whether the operator is in front of the terminal or not. Instead, service programs take care of as many tedious tasks and decisions as possible. We call these service programs ‘jeannies’. They are supposed to run constantly in the background and independently take as many decisions as possible. The decisions taken are displayed to the user in order to give him or her the opportunity to steer these tasks when deemed necessary.
Information on the nodes and links of the reaction network are graphically displayed upon clicking on these entities. They will then open in a new frame or window on the terminal. The most important frame is the molecule editor which allows the investigation and manipulation of molecular structures to be taken from or eventually integrated into the reaction network. For the manipulation of molecular configurations, new jeannies are set up (denoted in grey and red in Fig. 1), which may fulfil highly system-dependent tasks — from supplementing heavy-atom only PDB structures with hydrogen atoms to the automated construction of reactive complexes, that might lead to new nodes in the reaction network, to inverse and rational design tools13,14 for the construction of totally new reactants and structures.
Many other advanced jeannies can be envisaged. For instance, a kinetic modeller that is able to solve large systems of coupled kinetic differential equations — analogously to algorithms developed in systems biology and bio-informatics — is decisive for an automatic evaluation of the reaction network. Crucial regions in the network can be identified in terms of their kinetic relevance and can then be refined with accurate quantum chemical methods. Also these refinement calculations can be automatically launched on a high-performance compute-cluster (though the operator may actively suppress them if he or she considers them not useful for the exploration).
To obtain the quantum mechanical response, the electronic structure has to be calculated first. The forces can then be obtained analytically from the results of this optimisation procedure. The response of the atomic structure is calculated by minimising the forces on the unmanipulated atoms. To provide the result in real-time there is an upper time limit for all steps. However, in most cases the electronic structure optimisation is the limiting factor. Depending on external factors — such as the details of the optional structure optimisation procedure, the user-defined speed of structure manipulation or the required adiabaticity of the manipulation — the limit can vary considerably. However, an electronic structure (and force) calculation requiring on the order of one hundred milliseconds will be sufficiently fast.
In order to make the electronic structure optimisation sufficiently fast, usually additional approximations have to be introduced. A general rule is: The faster the optimisation is, the more severe are the approximations. Thus, in addition to the time limit it is necessary to specify a lower limit for the quality. To do so we recall the goals of an instantaneous exploration of chemical reactivity in a virtual environment, namely to explore the potential energy surface and identify the structures corresponding to the reactants, products, and transition states of elementary reactions, keeping in mind that they are only a starting point for further refinement using the standard methodology developed in quantum chemistry. Hence, the potential energy surface, i.e., the electronic structure method employed to obtain it, should reproduce all main features of the exact surface. Thus, local minima and transition states should be detectable, while their absolute position and depths or heights are not required with high accuracy.8 Clearly, not in every situation are all of the features needed. A first exploration to familiarise with the molecular system can be of low quality and only later, parts of the surface are examined more accurately. More information on this iterative character of a haptic exploration of potential energy surfaces to continuously refine them can be found in ref. 8.
No matter what technique is chosen, interpolation allows one to achieve a very high accuracy of the potential energy surface and the forces because the raw data can be obtained with high accuracy ab initio methods. However, the application of interpolation is also limited. If the dimensionality of the hyper-surface increases, too many points have to be calculated and the interpolation becomes slow. In addition, if relaxation is allowed, the configuration space spanned by the manipulation is no longer unique but depends on the history of the exploration. Usually, addition or abstraction reactions involving only atoms or diatomic molecules in systems where relaxation does not play a major role are good candidates for the haptic exploration based on interpolation of pre-calculated surfaces.8
The simplest method of this family is the atom superposition and electron delocalization molecular orbital (ASED-MO) method,30 which is based on the extended Hückel method31 to treat the valence electrons of atoms. It applies atomic-repulsion corrections32 in order to obtain correct structures, which is the main deficiency of the extended Hückel method. This method has already been shown to be feasible for the interactive study of structures containing only carbon and hydrogen.11,12 It has the advantage of being non-iterative as the Fock operator does not depend on the molecular orbital coefficients owing to the severe approximation involved. Moreover, only a few simple integrals have to be evaluated.
The next more complex group of semi-empirical methods invokes the neglect of differential diatomic overlap (NDDO)33,34 and the modified neglect of diatomic overlap (MNDO)35,36 approximations. The PM637 and OMx38 family of methods are recent members of this group. They are based on the Hartree–Fock theory but apply approximations such as neglecting electron–electron interaction integrals and replacing many of the remaining ones by empirically adjusted parameters.
Also density functional tight-binding (DFTB) methods like the most recent DFTB339–41 approach are suitable semi-empirical methods for a fast and quite reliable force calculation. Although DFTB methods involve fit parameters they are not truly semi-empirical in a strict sense since all these parameters are determined from full Kohn–Sham density functional theory (DFT) calculations. As the name suggests DFTB methods can be understood as a DFT analogue to NDDO/MNDO-based methods.42
All these semi-empirical methods are electronic structure methods (in contrast to reactive force field approaches). From the electronic structure (eventually, from the orbitals) quantum chemical properties can be calculated. Furthermore, all apply a finite basis set expansion of the orbitals. Hence, the optimised molecular orbital coefficients provide a reasonable estimate for more elaborate electronic structure methods to refine the results of the exploration. The connection to more accurate methods is therefore straightforward.
We have explored this methodology taking DFT with a minimal basis set and ECPs and applying the usual integral screening and density-fitting techniques to accelerate the calculations. We have applied this minimal set-up first to calculate data points for surface interpolation8 and then to show that reasonable energies and forces can be calculated for small systems even in real time.9 Interestingly, such minimal approaches are experiencing a renaissance in quantum chemistry. The corrected small basis set Hartree–Fock method proposed by Sure and Grimme43 for the fast calculation of mass spectroscopy data takes Hartree–Fock theory as a starting point with a minimal basis set called MINIX. To minimise the errors electronic core potentials and three different atom pair-wise correction terms with pre-fitted parameters have been applied.
The recent development in the field of quantum chemistry to utilise the computer power offered by graphical processing units or other specially tailored hardware, will make first-principles force calculations possible with increasing accuracy. Elaborated screening techniques to avoid unnecessary integral evaluations and sophisticated algorithms to evaluate the remaining integrals will also contribute to this development.9
Our set of test molecules of different sizes and different elemental composition is shown in Fig. 2. The two alanine oligopeptides represent typical biological molecules. The system denoted as FeGP (iron-guanylylpyridinol) is a model for the active site of [Fe] hydrogenase, where the methylenetetrahydromethanopterin (HMPT) is a cofactor which we chose to model by structure e in Fig. 2. Both structures have been taken from ref. 52. The so-called compound A (cf. structure A in Fig. 3, of ref. 53) is a large organic molecule. The Schrock complex with an approaching N2 molecule54–60 is an example of a large transition metal complex employed in homogeneous nitrogen-fixation catalysis.
Fig. 2 Molecular systems used for the timings. (a) Alanine dimer, (b) Alanine trimer, (c) Schrock trisamidoamine Mo complex with N2, (d) FeGP, (e) model for HMPT. Structures a, b, e have been obtained from a DFTBA49,50 optimisation in Gaussian 0951 Rev. d.01. Structure d has been taken from ref. 52, structure c from ref. 8. |
All calculations started from the default molecular orbital (MO) coefficients guess provided by the respective program. The timings obtained are compiled in Table 1. Those molecules containing only C, N, O and H atoms required a similar number of iterations to reach self-consistency. However, a comparison of the required iterations for Ala3 and the FeGP model system shows that the iteration number has only a small influence on the execution time. Although the times measured here contain the overhead of reading and writing to disk, printing information to output files and setting up the SCF iterations, they are all within the time range of what we would require for real-time calculations. Only the very large Schrock complex requires several seconds to optimise an electronic structure, which is mainly due to its size.
AM1 | RM1 | PM3 | PM6 | PM6-D3 | PM7 | DFTB3 | |
---|---|---|---|---|---|---|---|
Ala2 | 248 ± 5 | 249 ± 9 | 249 ± 7 | 248 ± 9 | 251 ± 6 | 249 ± 10 | 344 ± 37 |
19 atoms | (14) | (14) | (14) | (14) | (14) | (15) | (11) |
C, H, N, O | |||||||
Ala3 | 249 ± 5 | 249 ± 6 | 250 ± 6 | 252 ± 5 | 252 ± 6 | 252 ± 5 | 342 ± 32 |
26 atoms | (14) | (14) | (14) | (14) | (14) | (14) | (13) |
C, H, N, O | |||||||
FeGP | – | – | – | 258 ± 6 | 259 ± 6 | 259 ± 5 | – |
29 atoms | (–) | (–) | (–) | (236) | (236) | (61) | (–) |
Fe, S, C, O, N, H | |||||||
HMPT | 255 ± 6 | 255 ± 4 | 255 ± 7 | 256 ± 4 | 257 ± 4 | 257 ± 5 | 354 ± 40 |
43 atoms | (14) | (14) | (17) | (20) | (20) | (15) | (12) |
C, H, N, O | |||||||
Compound A | 275 ± 21 | 276 ± 29 | 273 ± 6 | 289 ± 54 | 295 ± 63 | 279 ± 34 | 562 ± 32 |
84 atoms | (13) | (13) | (17) | (25) | (26) | (14) | (10) |
C, H, N, O | |||||||
Schrock-N2 | – | – | – | 9197 ± 498 | 9414 ± 574 | 5719 ± 103 | – |
280 atoms | (–) | (–) | (–) | (142) | (142) | (95) | (–) |
C, H, N, Mo |
The timings given here are only upper limits. In an implementation tailored to the needs of real-time quantum chemistry execution time can be reduced by restructuring the algorithms considering the following points.
• Separate all system specific initialisation and perform it at the beginning.
• Take MO coefficients from previous optimisations as a starting point for the next.
• Eliminate every output that is written to disc or to the console.
• Avoid any reading from or writing to disc.
• Perform as many steps as possible in-core.
• Choose convergence thresholds according to the resolution of the output device(s).
Since all methods investigated compare similarly in terms of providing real-time energies, they may be selected according to their accuracy, their ability of being systematically improvable, and to the possibility of including them in a subsystem approach. Because of its close connection to DFT the DFTB approach is well suited for subsystem approaches using DFT.
To investigate the semi-empirical and density-functional tight-binding methods for their ability to yield such qualitatively correct surfaces, we chose a rotor-like molecule with three anthracene blades that undergoes a rearrangement with a challenging plateau-like barrier and that has recently been studied in our group with more accurate first-principles methods.53 The energy profile from ref. 53 serves as a reference. The reference potential energy surface features two local minima (A and B) which are connected by two transition states (TS1 and TS2) with a shallow minimum in between (MinAB). For the corresponding structures see Fig. 3.
Fig. 3 Reference (starting) structures taken from ref. 53 for the reaction profiles in Fig. 4 and Table 2. |
Since every semi-empirical Hamiltonian produces a different (approximate) potential energy surface the energies have been calculated after a structure optimisation employing the respective electronic structure method (cf.Table 2). The NDDO-based calculations were performed with MOPAC2012.61,64 The DFTB3 structures were optimised in Gaussian09 d.0151 with the DFTBA49 method and the DFTB339–41 values were obtained from a single-point energy calculation on these structures with the DFTB+62 (v.1.2.2) program. The results are collected in Table 2 and selected energy profiles are shown in Fig. 4.
A | TS1 | MinAB | TS2 | B | ||
---|---|---|---|---|---|---|
BP86-D3/def2-TZVP53 | E | 0.0 | 20.1 | 19.6 | 20.5 | −3.7 |
AM1 | E | 0.0 | 17.3 | 0.4 | 22.0 | 0.4 |
RMSD | 0.118 | 0.354 | 1.334 | 0.265 | 0.506 | |
RM1 | E | 0.0 | 23.4 | 15.1 | 32.0 | 0.6 |
RMSD | 0.099 | 0.405 | 0.276 | 0.268 | 0.359 | |
PM3 | E | 0.0 | 16.2 | 13.7 | 18.3 | 0.4 |
RMSD | 0.135 | 0.314 | 0.392 | 0.194 | 0.612 | |
PM6 | E | 0.0 | 13.2 | 12.6 | 13.8 | 0.5 |
RMSD | 0.123 | 0.196 | 0.128 | 0.123 | 0.488 | |
PM6-D3 | E | 0.0 | 13.0 | 12.3 | 13.3 | −1.4 |
RMSD | 0.115 | 0.196 | 0.123 | 0.125 | 0.361 | |
PM7 | E | 0.0 | 18.3 | 12.1 | 30.9 | −8.3 |
RMSD | 0.085 | 0.405 | 0.237 | 0.273 | 0.090 | |
DFTBA | E | 0.0 | 21.8 | 0.0 | 18.6 | 0.3 |
RMSD | 0.122 | 0.731 | 1.199 | 0.194 | 0.353 | |
DFTB3 | E | 0.0 | 19.2 | 0.0 | 17.9 | 0.2 |
Fig. 4 Energy profile of the reaction A → TS1 → MinAB → TS2 → B relative to the energy of A of the corresponding method. Only a selection of the methods from Table 2 is shown. The reference BP86, DFT-D3, def2-TZVP profile has been taken from ref. 53. |
Except for AM1 all semi-empirical methods show the local minima and saddle points of the reference surface (cf.Table 2). As expected, the heights or depths vary from method to method but the overall structure of the energy profile is conserved throughout. The transition state structures deviate more from the reference structures than the local minima, as is expected for semi-empirical methods.48 The correct sign of the energy difference between product and reactant is only reproduced by PM6-D337,47 and PM7.48 This is caused by a strong dispersion interaction effect, which is explicitly treated in a post-SCF manner in PM6-D3 and is directly incorporated in the PM7 Hamiltonian.37,47,48 All NDDO-based methods have the first transition state energetically below the second one, as in the reference calculation.
PM6-D3 shows the best performance of all methods analysed. Although the heights of the TS1, MinAB and TS2 are significantly lower compared to the DFT reference the relative energies of the transition states and the middle minimum as well as the relative energy between product and reactant are reproduced. The best method for obtaining the correct structures of reactant and product (A and B) is PM7 with RMSD values below 0.1Å.
Interestingly, it was not possible to locate the stable minimum MinAB with the DFTB methods, but they reproduced the two transition states and the minima A and B quite well.
The overall performance of all methods studied here is astonishingly good, i.e. they perfectly meet the quality requirements stated in the beginning. The fact that the semi-empirical approaches without dispersion correction overestimate the stability of the MinAB structure is no drawback as the detection of a potential intermediate is feasible. Applied in an interactive chemical reactivity study these methods would show the desired features of the potential energy surface and would guide the user to the correct structures. The relatively small RMSD values with respect to the DFT reference (cf.Table 2) indicate that, if the user is able to correctly locate the minima, they will be perfectly suited as starting structures for a refinement employing a more accurate method.
In the case of intermolecular reactions, the starting point is a stable configuration with no intermolecular interactions, e.g., at the dissociation limit. This means that many orientational changes are possible without a change in potential energy and therefore without forces. The reactive fragments are oriented such that a suitable initial position is obtained to start the reactive exploration. From there the scientist may record all configurations visited during the exploration. The recording can be stopped and restarted several times which results in a multitude of different paths. Usually both, the initial and the end configuration, are local minima that are stored separately from the paths. During the exploration the chemist is guided by the forces and by the structural changes occurring as a result of the structure manipulations. The exploration is therefore driven by a combination of the operator's prior knowledge, the information provided through the forces, and the structural changes upon structural manipulations.
Such an exploration can be performed to study the shape of the potential energy surface in a localised region or to directly find possible reaction paths between already known stable configurations (local minima).
The structures, energies and gradients can be analysed and the distinct points with zero gradients can be extracted. They can be considered as nodes of a reaction network that emerges successively during an exploratory study. The paths between the structures with zero gradients (stationary points) connect the nodes as links. If they correspond to a minimum energy path, the associated transition state structure node can be marked as such. They are usually the main objective of the exploration. If, however, the stationary points are far apart in configuration space, no unique transition structure will be found for any connecting path, which can then be taken as a hint to search for additional local minimum structures in between (new nodes). This then produces a more fine-grained reaction network for which new connecting paths are defined. They are the starting point for further transition state searches—either by haptic exploration or by automatised protocols launched in the background.
The mechanical manipulation of molecular matter is called mechanochemistry. Theoretical work on mechanochemistry has been performed on ring opening reactions.76,77 In this example it has been shown how additional mechanical forces alter the potential energy profiles of reactions that involve the rupture of chemical bonds. A review on the theoretical concepts and the computational tools has recently been provided by Ribas-Arino and Marx.78
Suitable visual output devices for an efficient immersion are devices that are able to give a comprehensive and intuitive picture of the virtual world. The larger the visual devices are, the more they exclude the perception of reality around the user. Panoramic screens or head-mounted displays85–87 are best suited for this purpose. Rendering the atoms, bonds and all volumetric data with ray tracing is very helpful in this regard. Also current three dimensional presentation techniques like polarised or active shutter systems are beneficial, since they allow for a much easier perception of the spatial configuration of the molecular system.
To intensify the immersion by addressing the tactile sense requires the rendering of the occurring forces by haptic devices. Haptic pointer devices, for instance, can render forces in three to six dimensions. They can be employed to probe the forces acting on different sites of the molecular assembly. The scientist picks an atom and feels the resulting force response of the system. Also two or more haptic pointer devices can be utilised simultaneously to understand the interplay of forces. Such devices can be purchased at modest prices, which makes them attractive for virtual environments employed in chemical laboratories.
A more elaborate way to render forces and to address the human tactile sense are external skeletons. Since they render the force directly for each finger, they provide an even more natural experience for the tactile sense. However, these devices are still very expensive and currently not useful for a large scale deployment in chemical laboratories.
Additional data like force vectors on atoms not probed by a haptic device, the total energy, partial charges, isosurfaces of electron densities or molecular orbitals need to be seamlessly integrated as well. The most intuitive form to do so is to visualise them in three dimensions together with the molecular structure. Since these quantities as well as the structures change very fast during the manual exploration efficient implementations using graphical processing units and multi-core central processing units are needed.88,89
Global scalar quantities like the electronic energy could be rendered by addressing even more senses. For example, the auditory sense could be utilised for this purpose. High-frequency sounds could signal a configuration with very high energy and thus regions in configuration space that are not accessible in a given thermal setting. However, a change of background colour achieves the same and is certainly less annoying. Accordingly, the tactile sense remains the most important one besides human vision.
Input devices facilitate the human–computer interaction and therefore play an important role. In an immersive virtual environment they allow for a manipulation of the virtual objects. Therefore, the real-world position of the device is transferred into virtual-world coordinates of a virtual representation of the device. With that the virtual objects, in our case, atoms or atom groups, can be moved. A general requirement for devices to be suitable for immersion is that a physical movement of the operator is transferred directly into a movement in the virtual world. Transformations applied in between, e.g. to avoid range limitations of the physical device or to stabilise motions, need to be transparent for the operator, since the positions of virtual objects are controlled with them.
Since the forces resulting from such manipulations can be very sensitive with respect to position, a precise position control is most important. A lack of precision renders the control over the manipulation nontransparent which means that physical movements of the device are not consistent with the virtual movement. This also implies that the devices should not loose accuracy if the user quickly moves the devices. Fluctuations in the position due to instabilities caused by the user is also a reason for a less precise position. Especially freely movable input devices require attention to avoid unintentional small amplitude motions. The stabilisation procedure can take place already in the hardware, but also on the software side. Stabilisation can be efficiently achieved by scaling down the input movement.
The interaction with large molecular assemblies in a virtual environment is challenging because they are complex and dense (many objects in a small space). Moving in and interacting with such an environment should be done with devices that allow a manipulation directly in three dimensions. Input devices with less dimensions, like an ordinary computer mouse, allow only a two dimensional position control and are therefore less intuitive to handle. Such input devices acting in two dimensions require additional visual cues and sometimes also additional input with another device to be able to manipulate three dimensional objects.
Three dimensional input devices can be divided into two categories. To the first category belong devices in which a physical object like a ball or a pen is moved. The haptic pointer device, which we have been using in the framework of haptic quantum chemistry, features a pen connected via joints to a machine which calculates position and orientation and sends them to the computer. Such devices allow a relatively precise control over position in three dimensions (below 0.01mm according to manufacturer specifications90,91).
The second category contains devices which do not require to move a physical object. They directly capture the motion of the user's hand. Such motion sensing devices provide an even more natural form of three-dimensional input. Modern motion sensing devices can have an accuracy down to the sub-millimetre range.92 A problem with such devices is the detection of two fingers very close to each other. Hence, grabbing atoms with such devices is difficult, since it involves two fingers coming close to each other. Pointing at the atoms and moving them by just one finger is easier to implement, but is also less intuitive. The accuracy of the finger tip positions can be improved by wearing passive or active markers that make the detection easier.
Currently haptic pointer devices are more suitable since input and output can be combined in one such device.
In a straightforward implementation both forces would be exactly the same. However, the molecular forces are in the range of 1 nN, whereas the forces which can be felt by the user are on the order of 1 N. It is immediately clear that both forces cannot be the same but have to be transformed. This transformation needs to fulfil two requirements in order to be suitable for studying chemical reactivity. As a guide through the configuration space avoiding unphysical configurations the rendered forces need to be intuitive and smooth. Yet, the transformation between the molecular and the haptic force needs to be as transparent as possible, since from the force direction and magnitude the user infers the shape of the potential energy surface. The most transparent transformation is to multiply the molecular force by a constant factor to obtain the haptic force. This is how we transform the nanoscale molecular forces to a haptic force that is experienced by the user.
The shape of potential energy surfaces occurring in reactive chemical systems requires additional transformations. Shallow and deep minima, steep walls and flat hills may lie in close proximity on a reactive potential energy surface. This results in a wide range of forces that need to be rendered. Since the range of forces which can be rendered by the devices is limited and also the tactile sense has a limited resolution, the forces need to be scaled. For instance, in strongly repulsive regions huge forces quickly exceed the limits of the devices. Hence, the forces need to be scaled down. In almost flat regions, however, where shallow minima need to be detectable the force requires to be scaled up. A constant force scaling93 as mentioned before is therefore not suited. A variable gain scheme94 can account for both situations without the need to change the scaling manually.
The third reason why one needs to transform forces is more technical. To avoid instabilities and to compensate for slow simulation loops often frequency filters are applied. However, a too dominant filter can lead to an in-transparent force rendering. To obtain a stable control scheme usually the position is not directly controlled but the user applies an additional force that is incorporated into the simulation.95,96 This, however, hides the true forces acting on the manipulated atom from the user, which makes it unsuitable for studying chemical reactivity, where the forces transport a substantial part of the desired information.
A seamless transition from moving a molecule to probing reactivity is needed when atoms come closer and the forming or breaking of bonds becomes possible. The problem can be illustrated by considering the abstraction of an atom from a molecule versus just pulling at an atom to move the molecule when no further constraints are applied. Whether pulling at an atom results in an abstraction or just in a movement of the whole molecule depends on how fast the system responds in terms of the relaxation described at the beginning of this section. A manipulation within an infinitely fast relaxing system would correspond to a reversible adiabatic process. If the virtual environment is designed to mimic this scenario there would be no net change in energy and therefore also no reactivity. A non-adiabatic process is emulated when the relaxation rate is slower than the manipulation by the user. In this scenario moving a molecule by just pulling at an atom would not be possible without breaking a chemical bond.
This dilemma can be resolved in different ways depending on the limits imposed by the underlying force calculations: either by adjusting the relaxation rate or by adding additional constraints. The first is only possible if the relaxation (structure optimisation) is always faster than the manipulation. Then one can adjust the rate such that the users can switch between adiabatic and non-adiabatic manipulations by adjusting their manipulation speed. Pulling fast at an atom induces the abstraction, while pulling slowly just moves the molecule. Complementary to that would be to change the relaxation rate instead.
If an adjustment of the relaxation rate is not possible or unwanted, additional constraints can be introduced. Possible constraints are to fix atoms in space, preserve internal structures or pin the orientation and centre of mass of a molecule. The user can realise them employing additional input devices (e.g., one hand holds a metal complex in position and the other probes the reactivity of a possible ligand) or by keeping the corresponding degrees of freedom fixed during the relaxation procedure.
In both solutions employing additional input devices is the preferred way since then the constraints can easily be altered or removed if necessary. By contrast, changing the parameters of the relaxation procedure, e.g., changing the rate or the constraints, requires interruption of the exploration. Accordingly, this should only be done if the constraints are expected to remain constant during the exploration, e.g., when certain parts of the molecular assembly are known to be non-reactive.
A natural way of restriction is provided by force–feedback devices. The rendered force guides the user and allows only reasonable manipulations, since a rapid increase in energy is associated with a steep gradient pushing the user in the opposite direction. By this an artificial blocking of certain manipulations can be avoided.
Another possibility is to perform a very fast relaxation of the remaining degrees of freedom. The energy is thereby removed from the system so that the system always maintains a configuration with a reasonable energy. For the physical implications of such an energy removal see next section.
The dynamics of the electrons are not explicitly represented in the virtual environment, but they largely define the interactions between the atoms. This separated treatment of the electron and atom dynamics is based on the Born–Oppenheimer approximation.97 In fact, the electrons give rise to the potential in which the atoms move. The electronic energy is then defined by the electronic Schrödinger equation
ĤelΨel = EelΨel | (1) |
[nuc + Eel]Ψnuc = EnucΨnuc. | (2) |
The electronic energy Eel ({RI}) is a parametric function of the nuclear coordinates {RI}. A first-principles treatment of the electrons requires that Eel ({RI}) has to be calculated by solving equation (1). This has to be done at discrete configurations of the atoms.
Despite its calculation as a discrete function, Eel can be considered as a continuous function of all atom coordinates. We may define a continuous variable x in 3M dimensional space as
x = (R1, R2,…, RM)T with RI = (xI, yI, zI)T. | (3) |
Accordingly, Eel(x) is a hyper-surface in a 3M-dimensional Cartesian configuration space spanned by the coordinates of all atoms. There is a conservative force field connected to this surface. It describes the forces acting on each atom A and is given by the negative gradient gA of the potential energy Eel(x)
fA (x) = −gA with gA = ∇AEel(x). | (4) |
The gradients (or forces) can be calculated directly from the electronic structure. Only this analytic differentiation is sufficiently fast for immersive techniques.
From this setting it follows naturally that the molecular system has to be described classically with respect to the atom movements and quantum mechanically with respect to the electronic structure and derived properties—a standard and well-investigated approximation for mechanistic studies in complex molecular systems. Interestingly, this partition into a classical and a quantum mechanical description reveals another way to determine which information has to be presented visually and which information needs to be rendered by addressing other senses. The part described by classical mechanics is presented visually and the highly non-intuitive quantum behaviour of the electrons is expressed and presented as forces rendered to the tactile sense.
A fully quantum mechanical treatment of the atom movement would also be possible, but the visualisation of such a situation cannot be easily realised in a virtual environment where molecules are represented by ball-and-stick arrangements. Instead, nuclear probability distributions need to be considered, which are related to the molecular structures in the Born–Oppenheimer approximation.98,99 However, the tremendous computational demands for such a full quantum treatment are prohibitive, apart from the fact that the force on an atom is then no longer straightforwardly defined.
We assume that the collection of coordinates can be separated into different sets (subsystems). The first separation follows naturally from the way the user interacts with the system. For every manipulation we can define a set of coordinates s which are strictly defined by the manipulation (system coordinates) and the set of all remaining (environment) coordinates e,
(R1, R2,…, RM) → (s, e). | (5) |
Since the set of environment coordinates is not constrained by the manipulation, different ways to treat them are possible. They may be completely frozen, they may be subdivided into subsystems with reduced degrees of freedom or they may be optimised. Completely frozen inactive coordinates are only reasonable if they are not very much affected by the manipulation (detected by forces defined as gradient components with respect to these coordinates). Although the environment is frozen the active (system) part movement is not just ballistic, since interactions can still occur. Partly frozen coordinates (frozen subsystems) are useful, when some groups of atoms are expected to be internally rigid. The most general treatment is to optimise the inactive coordinates according to a certain protocol, which is discussed in the following section.
(6) |
Removing the excess energy from the free coordinates is an artificial dissipation of energy. If always the direction with minimum resistance is chosen, the path is equal to the so called minimum energy path between reactants and products.
The redistributed energy gives rise to a motion of the atoms along the unconstrained degrees of freedom. Apart from structure relaxation, this motion can also be treated by the usual methods of molecular dynamics (MD) and corresponds then to a steered BO-MD100,101 scheme similar to steered classical MD.102 Under certain conditions, the work performed by the operator is equal to the free energy change of an equilibrium process according to the Jarzynski equality.103
x(ξ) → f (ξ). | (7) |
If ξ describes a minimum energy path (MEP), there are only forces along the constrained coordinates and the forces in the (orthogonal) environment coordinates are reduced or zero by structure minimisation. Therefore, the work W along the MEP can be obtained from
(8) |
The technical implementation of the energy dissipation process, however, gives rise to a sensation of viscosity when a molecule is grabbed and pulled around in the virtual environment. The removal of energy by relaxing all unconstrained degrees of freedom requires a certain amount of time. This delays the relaxation and, hence, pulling at a molecule shows a resisting force. This resisting force leads to a sensation of friction as if the molecule would be dragged through a viscous liquid. It appears that this artifact is in fact useful, since the user naturally expects inertia when accelerating atoms of fragments, although the haptic exploration does not consider any mass-dependent reaction to the exerted force (action).
fh = C (|fm|)fm, | (9) |
As we have already discussed in our work on the haptic exploration of potential energy surfaces8 the requirement for the scaled surface is that it preserves the main features like the positions of minima and first-order saddle points. The steepness of walls or valleys is not that important if their relative depth or height is approximately preserved. Especially, schemes with a constant scaling factor93 for the forces (C = const) need to meet this requirement. Clearly, the reaction network explored in such a way can be refined by accurate quantum chemical methods in the background.
In variable gaining schemes94 the scaling factor is a function of the absolute value of the haptic force. To preserve the existence of all features allowing only a modification of their relative heights or depths, the function C is subject to some conditions. (1) C = 0 only at |fm| = 0 and nowhere else and (2) C must be strictly positive. Only then are no artificial saddle points created. With such a function, for instance, a shallow minimum is detectable but its depth may be exaggerated.
It is important to give the operator the freedom to change the scaling during an exploration. Since during an exploration all structures, gradients and energies can be recorded the true potential energy surface can be reconstructed afterwards.
However, since every constraint introduces additional forces, the force on a certain atom, can be significantly different with and without a constraint. If, for instance, some atom positions are fixed either by an additional input device or by just excluding them from the relaxation procedure, new constraints are introduced. These constraints in turn alter the behaviour of the system and can be regarded as additional forces. It is clear that the design of such constraints is a crucial ingredient for the manual exploration of reaction mechanisms.
In this work, we undertook the next step and elaborated on a new route of development in quantum chemistry towards ubiquitous computing that deeply immerses a chemist into his/her molecular target. This virtual environment poses new challenges for the organisation and presentation of tons of quantum chemical raw data. Generating alpha-numerical input files for complicated electronic structure calculations and processing the alpha-numerical output is one of the main obstacles prohibiting ubiquitous interactive computing in chemistry. This traditional, but pedestrian way of reactivity exploration is likely to die out, when ubiquitous computing, big-data handling, and immersive technology becomes available within a single implementation. Before such a virtual reactivity lab can be made available, a thorough analysis of its components is mandatory.
A central part is certainly how the quantum mechanical response is calculated in such an environment. The three main requirements for a suitable electronic structure method are: (1) it needs to be fast enough for a real-time execution, (2) it has to be based on the first principles of quantum mechanics for bond breaking and making and (3) the quality has to be such that the main features of the potential energy surface are detectable. As a first step one has to resort to approximate electronic structure methods. Hence, we discussed and evaluated contemporary semi-empirical methods suitable to study chemical reactivity in real time. With respect to both, time and accuracy, the semi-empirical methods investigated here permit an interactive study of chemical reactivity within a virtual environment. Combining the results of this work with the results we obtained in our previous studies7–9 we now have a range of methods at hand, which allow the calculation of real-time quantum mechanical responses. These methods range from interpolation techniques, semi-empirical methods up to minimal DFT and HF calculations.
Furthermore, we describe in detail here the design of a virtual environment tailored for the exploration of chemical reaction mechanisms and the subsequent study of reaction networks. We show how the quantities presented in the proposed virtual environment are connected to those manifest in a quantum mechanical description. The introduced approximations, which are necessary to allow for an interactive study of chemical reactivity have been analysed and the inherent limits have been worked out. We also discuss routes on how these limits can be partly alleviated for the incorporation of, e.g., tunnelling or entropic effects. The scaling of the forces (and therefore also of the potential energy surface) in a virtual environment has been analysed, too. Carefully chosen, they do not impair the physics of the system but can even enhance the experience. In fact, an interactive exploration of the Born–Oppenheimer potential energy surface given by the electronic energy can be carried out with a minimal quantum chemical model that recovers the rough position of minima and first-order saddle points and thus the specific structure of the surface (i.e., without introducing artifacts like spurious or missing stable intermediates).
The operator does not need to analyse any intermediate results. Instead, good candidates for stable structures are stored as nodes of a reaction network, which the operator sees emerging on the screen. Connecting lines of these nodes can be supplemented with a barrier height from an automated transition state search, which can be launched in the background by one of many service programs, which we have called ‘jeannies’ here. If the jeannie realises that there is no unique transition state, the connecting line will most likely not represent an elementary reaction step and this fact can be visualised. The jeannie may hand this information to another service program responsible for the optimisation of local minimum structures (within the structural interval defined by the two nodes in the reaction network). Since real-time quantum chemical data is a prerequisite for the whole virtual environment, the local-minimum and the transition-state searches do not need to be very efficient, but they need to be very stable. This poses a whole new set of constraints on such algorithms.
While the operator observes the expanding reaction network on the screen and can investigate structures on nodes as well as the connecting transition-state structure by a simple mouse click, he/she may choose to steer the construction of the emerging network. Certain branches of the network may thus be shut down and excluded from further exploration at will when deemed necessary. Accepted nodes may be refined by more accurate quantum chemical calculations, automatically launched in the background by another jeannie. While the accurate optimisation of a minimum structure at a node is certainly valuable for obtaining reliable structural information, the associated electronic energy is only of secondary importance. Instead, the energetical position of the node within the reaction network is of primary importance. In this systems-chemical view of complex chemical processes, a rolling kinetic modelling of all elementary reaction steps emerging in the network will determine, which nodes and links are most important. These are the ones for which more accurate quantum chemical calculations are then launched by the jeannie.
While we have already realised some of the components necessary for the chemical virtual reality lab, many ideas described here await their implementation and we will continue to finally provide such a tool to the chemistry and materials science communities. Currently, the chemical reactivity of hydrocarbons can already be explored in real time and reaction networks for these compounds can be mapped out.10 If the fully-fledged tool is completed, it will represent a new type of predictive and creative tool for chemical research in silico. Imagine a catalytic reaction mediated by some catalyst. Modelling a catalytic cycle is currently a very laborious and time-consuming undertaking. Because of the huge effort required, side reactions, of which there are orders of magnitude more than there are reaction steps in the catalytic cycle, are usually omitted. The virtual reality lab would be able to deal with side reactions, too. The data can be naturally absorbed into the components of the reaction network introducing new configurations to be constructed and finally added to the existing nodes. Natural reaction partners of the intermediates in the catalytic cycle are all molecules occurring in the cycle (including the intermediates themselves) as well as standard molecules from the environment like solvent molecules (especially water) and dioxygen. The plethora of structures needed for such a screening of potentially degradative reactions can again be set up and screened automatically by jeannies. Clearly, another useful ingredient for this endeavour is a jeannie that aids designing structures for reactive systems with specific functionality.13,14
Hence, the virtual environment envisaged here will be much more than a simple tool to control and manage quantum chemical calculation (as by standard graphical user interfaces to quantum chemistry program packages). Instead, it will be a new tool for truly creative work in chemistry that can challenge the work of experimental chemists.
This journal is © The Royal Society of Chemistry 2014 |