High-throughput screening of metal–organic frameworks for kinetic separation of propane and propene †

We apply molecular simulations to screen a database of reported metal–organic framework structures from the computation-ready, experimental (CoRE) MOF database to identify materials potentially capable of separating propane and propene by diffusion. We report a screening workflow that uses descriptor analysis, conventional molecular dynamics (MD), and Nudged Elastic Band (NEB) energy barrier calculations at both classical force field and Density Functional Theory (DFT) levels. For the first time, the effects of framework flexibility on guest transport properties were fully considered in a screening process and led to the identification of candidate MOFs. The hits identified by this proof-of-concept workflow include ZIF-8 and ZIF-67 previously shown to have large differences in propane and propene diffusivities as well as two other materials that have not been tested experimentally yet. This work emphasises the importance of taking into account framework flexibility when studying guest transport in porous materials, demonstrates the potential of the data-driven identification of high-performance materials and highlights the ways of improving the predictive power of the screening workflow.


Introduction
In the petrochemical industry, separating olefins from paraffins by cryogenic distillation is an important but energy demanding process. [1][2][3][4] Alternative methods are required to lower the energy consumption for key separations such as ethene/ethane and propene/propane separations. [5][6][7][8][9] Using porous materials such as metal organic frameworks (MOFs) is one of the most promising paths because of the wide range of structures with different properties that can be explored. [10][11][12][13][14][15][16][17][18] In fact, with over 90 000 thousand MOFs reported to date in the Cambridge Structural Database (CSD), 19 it is impractical to test all these materials for any given separation experimentally. There might be many known materials that have high performance for separations for which they have never been tested. This is an opportunity for computational chemistry to assess the performance of these materials in silico and identify best candidates for future experimental testing.
In this work, we focus on kinetic separation of propane and propene and assess the MOF materials present in the Computation-Ready, Experimental (CoRE) MOFs database that includes experimental guest-free structures of more than 4700 three-dimensional non-disordered MOFs. 20 This database provides a starting point for screening calculations. It was used in a number of separation studies [21][22][23][24][25][26][27][28] and was recently updated to include over 14 000 structures. 29 The early computational studies [21][22][23][24][25] of gas adsorption and separation using the CoRE MOF database were performed for rigid frameworks and were focused on calculating static thermodynamic properties like adsorption energies, Henry's coefficients and adsorption isotherms. These properties provide a measure of guest-framework interactions and can guide the selection and design of adsorbents for applications such as powder beds. However, the kinetic properties of gases are also relevant for designing molecular sieves for filtering applications. That is why more recent studies, 26,27 also based on the original CoRE MOF database, included molecular dynamics (MD) calculations of self-diffusion coefficient of guest molecules in rigid frameworks. There are two shortcomings to this approach. First, treating the framework as rigid does not allow the pore dimensions to change due to thermal fluctuation or because of the interaction with the guest. Second, the random thermal movement of the guest molecule can only access relatively low energy barriers and more advanced computational methods that bias the guest trajectory are required to assess experimentally relevant diffusion barriers. 30 These methods include metadynamics, 31 adaptive biasing force 32 and umbrella samplings 33,34 at a finite temperature while the nudged elastic band (NEB) method 35,36 allows for a relatively quick identification of the minimum energy path between two local minima.
The computer simulation studies that use flexible framework models combined with advanced computational tools show that taking framework flexibility into account is important to produce guest diffusivities observed experimentally. 37 In particular, Verploegh et al. showed that the flexible window aperture in ZIF-8 framework has a large effect on the diffusion of gases, including propane and propene, when their molecular sizes are comparable or smaller than the average window size, ultimately affecting the calculated diffusion free energy barriers. 33 These umbrella sampling calculations rely on predefining the reaction coordinate along the diffusion path through the pore window and can make accurate predictions when dedicated force field models are used. 38 Witman et al. conducted the first set of screening calculations using UFF-fixmetal (UFF-FM) force field to describe the flexibility of the structures from CoRE MOF database and its effect on the adsorptive separation of Xe/Kr mixtures. 28 To the best of our knowledge, there have been no large-scale screening studies of diffusion barriers in MOFs for any guest molecules and our work is the first attempt to do so.
The small difference in size and shape of propane and propene ( Fig. 1) makes it difficult to separate them by a sieving mechanism. The most often quoted values for kinetic diameters are 4.3 Å propane and 4.0 Å for propene, 39 while some reports suggest using a smaller value of 4.2 Å for propane. 37 There are several examples of MOFs that have been shown experimentally to have distinct kinetics for propane and propene. The examples include ZIF-8 37 (3.4 Å), ZIF-67 40 (3.3 Å) that are present in the CoRE MOF database and other materials that were reported more recently Zn(ox) 0.5 (trz) 10 (2.9 Å) and Zn(ox) 0.5 (atrz) 10 (2.6 Å), ELM-12 41 (4.0 Å) and KAUST-7 12 (2.7 Å) with the number in brackets indicating the pore window diameter. It is evident that in all cases, the size or the pore window is similar or smaller than the size of the guests and therefore the guest diffusion through such pore will require some flexibility of the pore window.
In the remainder of this paper, we present computational details of the screening workflow that assess diffusion barriers of propane and propene in MOFs where both the framework and guest are treated as flexible. We report the application of this workflow to CoRE MOF library of structures and discuss our findings.

Computational details
There are several challenges for high-throughput calculations of kinetic separations. First, similar to the screening for adsorptive separations, it is imperative to have a database of chemically correct host structures because missing or erroneous atoms will lead to incorrect identification of host properties. In this paper, we use the original CoRE MOF database (accessed form https://cmcp-group.github.io/CoRE-MOFs on 17/09/2017) without any further modification. Second, since the pore window flexibility is essential, we need a robust model to describe this flexibility for a chemically diverse set of MOFs present in the database. The UFF-FM force field was shown to reproduce the elastic properties of MOFs well and we use it to pre-screen the candidate structures before using more expensive but more accurate density functional theory (DFT) calculations. Third, we need an automated computational tool to calculate diffusion barriers. For this purpose we use the nudged elastic band (NEB) method 35,36 that is often used to calculate ion mobility in solids but has also been applied to study molecular diffusion. 42,43 Since the NEB method can calculate the barrier energy using either a force field or a DFT approach this will allow us to make a direct comparison between the UFF-FM and DFT data. The automatic generation of the NEB setup that involves the identification of a local minimum at each side of the pore window and the construction of the path between these two end points is one of the challenges we tackle in this paper. Scheme 1 summarises the details of our screening workflow that focuses on finding MOF structures that can separate propane and propene through sieving mechanism. At each step of the screening, we identify the most promising MOF structures to be considered at the next more computationally demanding step of screening, thus gradually reducing the number of candidates as indicated by the numbers of structures in Scheme 1.
Below we provide the details of the main four steps of the workflow.
Step 1. Simple geometric descriptors -pore and window size For a binary gas mixture, a perfect sieve would allow the diffusion of one component through but completely block the other component. If both components can readily diffuse through the material, we would discard such material. We used Zeo++ to calculate the window size, also referred to as the pore limiting diameter (PLD) or the largest free sphere D f . After excluding 501 MOFs marked as charged from the list of 4764 MOF structures present in CoRE MOF database we discarded MOFs with window diameter smaller than 2.5 Å and greater than 4.3 Å. This simple descriptor-based screening based on rigid framework geometries and using spherical probes reduced the number of options from 4263 to 2238. For the next step of the screening workflow, we used flexible framework models and atomistically accurate models of the guests.

Step 2. Molecular dynamics simulations of guest diffusion
To discard the systems that allow propane diffusion even on a short time scale, we used large-scale atomic/molecular massively parallel simulator (LAMMPS) to perform 5 ns conventional molecular dynamics (MD) simulation in the NVT ensemble. An elevated temperature of T = 400 K was used to accelerate the dynamics of guest diffusion in a flexible host while keeping the simulations numerically stable. A single propane or propene guest molecule was added in the middle of the pore cavity and the LAMMPS input files were generated from these CoRE MOF structures with the guest added using the lammps_interface code (accessed on https://github.com/ peteboyd/lammps_interface on 06/10/2017) developed by Boyd et al. 44 The electrostatic interactions were not included since both propane and propene are non-polar molecules and the effect of the electrostatics on diffusion barrier was expected to be relatively small. 45,46 In our MD simulations, all chemical bonds, angles and dihedrals were treated as flexible using either harmonic potentials or hybrid Fourier cosine/periodic potentials according to the UFF-FM force-field. This force field was previously used to study adsorption of Xe/Kr mixtures in flexible MOFs. 28 In our tests it performed similarly to the UFF4MOF 47 but had a wider coverage of metal atom chemistries.
Nudged elastic band (NEB) calculations of diffusion barrier. For the 670 MOF candidates that blocked propane in our conventional MD simulations, we calculated the diffusion energy barriers for both propane and propene using the NEB method. The NEB method provides the information about the flexibility of window aperture during the guest transport but does not take into account any entropic effects. Thonhauser et al. showed that the effect of vibrations and zero point energy was small in diffusion barriers found by NEB method. 42 We performed two rounds of NEB calculations: one using UFF-FM force field for 670 MOFs to shortlist MOFs that for diffusion barriers reproducibly show over 10 kJ mol À1 difference between propane and propene and the other round using Density Functional Theory (DFT) to more accurately assess the diffusion barriers for the shortlisted 23 candidate MOFs.
Step 3. Classical force field level of NEB We performed classical force-field-based NEB in LAMMPS by using the same UFF-FM potentials as those used in our MD simulations. However, the main challenge to set up NEB calculations for each MOF was to automatically generate the end points of the NEB paths. These are two host + guest structures with the guest molecule residing on either side of the pore window. To assist the location of the pore window, the Zeo++ developers modified the Zeo++ code so it can identify a Voronoi node that was close to the narrowest part of the channel. We found that in approximately 80% cases, the algorithm was able to identify two points, one on each side of the window, and in 20% cases when the pores shape was complex, these two points either coincided or belonged to two different windows.
For the cases when two distinct Voronoi nodes were identified near the same window, we used the vector connecting these two points as the direction of the channel and developed a code to orient and place the guest molecules at the opposite sides of the window. Specifically, four orientations (901 rotations) of the guest molecule located at one of three separations from the identified Voronoi nodes (1 Å, 2 Å, and 3 Å) were used. Thus we setup 12 NEB paths for propane, 12 NEB paths for propene where the CH 3 group passes through the window first, and 12 NEB paths where the propene CH 2 group passes through the window first. Each NEB path consisted of 32 replicas, which were created by placing the guest molecule along the linear path connecting the two NEB end points. Following the standard partitioning scheme recommended in LAMMPS, in each replica, the atoms were partitioned into NEB-atoms (the guest) and non-NEB-atoms (the framework). Only the NEB-atoms were connected by inter-replica spring and felt the forces from inter-replica atoms. Conceptually, the framework atoms provided a background potential for the guest atoms. The energy barrier for each MOF was calculated as the difference between the lowest maxima and the lowest minima from the entire set of tried NEB paths. Scheme 1 Schematic representation of the computational screening workflow with the numbers on the right indicating the number of structures that were passed to the next level of screening.
For quality control, we used two NEB setups: the first setup was better in identifying the lowest energy configurations but sometimes produced an unphysical diffusion path, while the second setup was better in finding the path through the pore window but worse in finding adsorption sites. In the first setup, we performed energy minimisation of the NEB end points before building the path that connects them. The stopping criterion for force was 0.1 kcal mol À1 Å À1 (maximum force on each atom), and the Hessian-free truncated Newton algorithm was used in LAMMPS. Once the two end points were optimized, the NEB path was created as a straight line between them. In some cases, when the minimized end points were reached, the guest molecules moved far from their initial position at the centre of the channel axis and the straight-line NEB path that connected the two end points would no longer pass through the pore window but through the pore wall next to the window. In other cases, after the energy minimisation both NEB end points could end up in the same pore cavity, not at the opposite sides of the window. Therefore, these NEB paths would not pass the narrowest part of the pore channel and would return a low energy barrier for moving the guest within a single pore cavity.
In the second setup, we created the NEB paths between the end points without their energy minimization. These end points configurations were optimized at the same time as when the NEB calculation was performed. Thus the second setup conceptually prevented the NEB path from hitting the pore wall. It also decreased the chances of both NEB end points moving to the same pore cavity. However, if these problems did not occur in the 1st setup, the energy barriers calculated using the 1st NEB setup were more accurate than those calculated using the 2nd NEB setup because the 1st NEB setup provided a better identification of adsorption sites -the minima found after the initial energy minimization in the 1st NEB setup were typically lower in energy than the end points in the 2nd NEB set up. We shortlisted all MOFs identified by the 1st NEB setup if the diffusion barriers identified by the 2nd NEB setup were within 30% difference and the propene diffusion barrier was not prohibitively high, i.e. less than 100 kJ mol À1 .
In all force field NEB calculations damped dynamics with a time step was 1 fs was used to stably optimise the system. The force tolerance for the NEB calculation was 0.5 kcal mol À1 Å À1 , with a spring constant of 1.0 kcal Å À2 .
Step 4. Density functional theory level of NEB We performed the density functional theory (DFT) level NEB in Vienna Ab initio Simulation Package (VASP 5.4.4) 48 for 23 candidates selected based on the energy barriers calculated by UFF-FM. All DFT calculations were performed with projector augmented wave (PAW) 49 pseudopotentials using periodic boundary conditions, and the optB86b-vdW functional which includes van der Waals correction and in general gave minimized structures close to experimental structures. 50 We allowed the volume and the shape of the primitive cell to change during the conjugate gradient structure optimisation, with a plane-wave energy cut-off 600 eV and stopping optimisation when the energy of subsequent ionic steps differed by less than 10 À6 eV. Gaussian smearing with a width of 0.1 eV was used. Most MOFs had a relatively large unit cell with each dimension greater than 10 Å, thus a k-point grid of 2 Â 2 Â 2 with G point sampling was sufficient for these systems.
For the NEB DFT calculations, we set the force tolerance to 0.05 eV Å À1 and used quasi-Newton energy minimization with a NEB spring constant of 5 eV Å À2 , and the climbing image NEB method. The replicas along the NEB paths were generated spanning the entire unit cell in order to explore all possible local minima that may be found along the pore. The energy barrier was defined as the difference between the highest and the lowest energy along the calculated minimum energy path. The number of replicas in each NEB path varied between 8 and 25 replicas, depending on the length and the complexity of the path and the initial distance between the guest molecules in each replica was a maximum of 1 Å.

Molecular dynamics simulations
From the 2238 shortlisted MOF structures with window size less than 4.3 Å we were able to perform 5 ns MD simulations of guest diffusion at 400 K for 1440 MOFs. The remaining 35% were unsuccessful either due to the errors incorporated in the input structures or due to misidentification of chemical bonds when converting CoRE MOF entries to LAMMPS input files. The results of the successful MD simulations are shown in Fig. 2. Each circle in Fig. 2 represents a MOF structure with its window diameter plotted along the x-axis and its pore diameter plotted along the y-axis while the colour of the circle represents the outcome of the single MD simulation. We considered the guest molecule being able to readily diffuse through the MOF if it travelled more than the size of the unit cell in at least one direction (x, y, or z) during the 5 ns MD simulation. Based on this criterion, we observed that both gases diffused to the neighbouring unit cell within 5 ns in 743 MOFs (blue circles), only propene diffused in 224 MOFs (green circles), only propane diffused in 27 MOFs (orange squares), and both guests remained in the same unit cell in 446 MOFs (red circles).
The distribution of coloured circles in Fig. 2 correlates well with the MOF window sizes. Most blue circles (both guests diffused) correspond to MOFs with a wide pore window (D f 4 3.8 Å), and most red circles (neither guest diffused) correspond to MOFs with a narrow pore window (D f o 3.0 Å). MOFs that only allowed propene diffusion (green circles) are concentrated in the window range of 3.5-3.8 Å. Some MOFs in Fig. 2 appear to be outliers from the common trends. For example, there are several MOFs with wide pores (D f 4 4.0 Å) that were deemed to block diffusion of one or both guests because the guest molecules were not able to find the window and pass though it during the short simulation time of 5 ns. The stochastic nature of guest diffusion was also the main reason for some systems (orange squares) showing propane diffusion but not propene. There are also some blue circles for MOFs with a narrow pore window (D f o 3 Å). For most of these MOFs the shape of the pore window is not circular and D f corresponds to the smallest limiting dimension while the pore is wider in the orthogonal direction. For some of the outliers it was found that the pore structure changed significantly because either UFF-FM force field did not provide an accurate description or the input structures from CoRE MOF database had missing atoms or erroneous atoms. For example, there are seven structures of ZIF-8 present in CoRE MOF database (those with CSD reference codes of OFERUN02, OFERUN03, FAWCEN, FAWCEN01, FAWCEN02, FAWCEN03, KAMZUV) and only one of them (OFERUN03) contains a correct number of hydrogen atoms.
For the next step of screening, we consider only the 670 MOFs that did not show propane diffusion at the short time scale of the MD simulation but might still allow its diffusion at the experimental time scales. For example, driven by random thermal motion, neither propane nor propene passed through the pore window of ZIF-8 in our 5 ns MD simulations, whereas extending simulation time to 50 ns showed a single hopping event for propene. That is why Nudged Elastic Band calculations were used to directly assess experimentally relevant diffusion barriers at the next step of the workflow.

Nudged elastic band (NEB)
Classical force field level of NEB. Fig. 3 shows the diffusion energy barriers calculated using NEB method with UFF-FM force field for the 670 shortlisted MOFs: the barriers for propane are along the x-axis and the barriers for propene are along the y-axis. Fig. 3(a) is for the NEB calculations with initial minimization of the end points (the 1st NEB setup, 545 successful runs) and Fig. 3(b) is for the NEB calculations without initial minimization of the end points (the 2nd NEB setup, 512 successful runs). The rationale for running two sets of NEB calculations was to test whether one of the setups is more preferable over the other. We found that the 1st NEB setup was better at identifying the energy minima while the 2nd setup produced NEB path passing through the pore window more reliably. Both setups suffered from common shortcomings outlined below.
First, in cases when the NEB end points were incorrectly assigned either to the same pore cavity or to two different channels in approximately 20% of the MOFs, the calculated diffusion barriers were respectively very low (B10 kJ mol À1 ) or very high (B1000 kJ mol À1 ) and did not correlate with the results of MD simulations. Second, in some cases when the NEB end points were initially located at opposite sides of the window, they moved during the initial energy optimisation (1st NEB setup) or during the NEB calculation (2nd NEB setup) and ended up in the same pore cavity. This happened more often in the 1st NEB setup than the 2nd and typically resulted in a low diffusion energy barrier. The structures with CSD reference codes ECUCIP, QUPDEL and AZILEA, shown in Fig. 3a and b, are the examples of MOFs for which the whole NEB trajectory (respectively for propene, propane and both guests) ended up in the same pore cavity when we used the 1st NEB setup, as indicated by a low E diff , but not in the 2nd NEB setup. Third, upon energy minimisation, some structures experienced large displacements of framework atoms due to the force field limitations. This often led to the closure of the pore window and a relatively high diffusion barrier. Fig. 4 shows all MOFs selected from Fig. 3(a) that consistently (within 30% difference) reproduce diffusion barriers for both guests in the 1st and 2nd NEB setups and have a calculated diffusion barrier for propene less than 100 kJ mol À1 . Among these 164 MOFs, 116 MOFs (all points under the solid black line in Fig. 4) have energy barriers for propane at least 10 kJ mol À1 higher than that for propene. For these MOFs the NEB calculations are mostly consistent with the MD simulations, as most green points in Fig. 4 correspond to low diffusion barriers for propene and most red points correspond to high diffusion barriers for propene.
We carefully inspected the 116 shortlisted structures to remove duplicates and chemically incorrect structures. From the list of 82 chemically correct and unique MOFs we selected a representative subset of 23 MOFs (marked by a black dot in Fig. 4) that had fewer than 400 atoms per unit cell and did not contain elements heavier than caesium except IFEPIT (Lanthanide MOF) for case study purposes.
DFT level of NEB. We performed DFT NEB calculations for 23 MOFs spanning different chemistries. The calculated NEB diffusion energy barriers from DFT NEB (dark red for propane and dark green for propene) and force field NEB (light red for propane and light green for propene) calculations are shown in Fig. 5 for comparison. According to the experimental studies, ZIF-8 and ZIF-67 are two materials that have been used for sieving propane and propene, 13,30,33,37 and were both shortlisted by our screening workflow. The diffusion barriers for ZIF-8 (OFERUN03) calculated with DFT are 42.6 kJ mol À1 for propane and 20.8 kJ mol À1 for propene and for ZIF-67 (GITTOT01) are 40.7 kJ mol À1 for propane and 23.6 kJ mol À1 for propene. For KAUST-7 MOF, that was not included into the CoRE MOF database, the calculated diffusion barriers are 51.1 kJ mol À1 for propane and 35.3 kJ mol À1 for propene. 51 Therefore, we identify a MOF to be a hit if the calculated diffusion energy barrier for propane is greater than 35 kJ mol À1 , while for propene it is at least 10 kJ mol À1 lower than that for propane and less than 40 kJ mol À1 to allow for propene diffusion. The high energy barriers calculated with UFF-FM for OPENIH and GUPJEG for propane were very promising, approximately 40 kJ mol À1 higher than propene. However, in the DFT calculations, only OPENIH ([Cu 2 (btc)(4,4 0 -bpt)]Á2H 2 O, where btc = 1,2,4,5-benzenetetracarboxylate acid and 4,4 0 -bpt = 1H-3,5-bis(4-pyridyl)-1,2,4-triazole), 52 the first new hit identified by the workflow, has a significant difference: 40.6 kJ mol À1 for propane and 28.7 kJ mol À1 for propene. The second new hit with DFT energy barriers that match our selection criteria Fig. 4 164 MOFs from the 1st NEB setup (coloured either red or green as in Fig. 3a) that had consistent diffusion energy barriers in both 1st NEB and 2nd NEB setups. There are 115 candidate MOFs under the black line the energy barrier for propane is at least 10 kJ mol À1 higher than for propene, 23 of which selected for DFT studies are marked by a black dot. is Ag-based MOF IJENER ([Ag 2 TMBD], where TMBD = tetrakis (methylthio)-1,4-benzenedicarboxylic acid) 53 with the calculated diffusion barriers 57.6 kJ mol À1 for propane and 36.5 kJ mol À1 for propene. All other systems had calculated diffusion barriers outside the filtering criteria but four of them were close and are worth mentioning. In the high energy barrier range, Ca, Sr-based VOTMAS (propane 57.6 kJ mol À1 and propene 41.3 kJ mol À1 ), and Co-based TAFCIO (propane 71.3 kJ mol À1 and propene 49.7 kJ mol À1 ) may be considered as promising candidates for propane/propene separation via diffusion at high pressure applications. In the low energy barrier range, Mn-based WOPDEL (propane 35.2 kJ mol À1 and propene 21.1 kJ mol À1 ) and two isostructural materials: Fe-based NIH-BEM02 (propane 25.4 kJ mol À1 and propene 14.3 kJ mol À1 ) and Mg-based XEHSEJ (propane 27.1 kJ mol À1 and propene 20.5 kJ mol À1 ), could be used for propane/propene separation at low pressure. It is interesting to note that the latter two materials are also isostructural to Co-based (RAVVOA01) and Zn-based (TEVZEA) MOFs that were shortlisted for the DFT calculations. Their Ni-based analogue was recently shown to separate acetylene (C 2 H 2 ) from carbon dioxide (CO 2 ) in breakthrough experiments. 54 Fig . 6 shows the minimum energy paths calculated using DFT NEB method for propane (the red circles) and propene (the green circles) for the two new hits, OPENIH and IJENER, that were identified to be most similar to ZIF-8 and ZIF-67. To better visualize the change in pore shape during guest transport in both MOFs, the snapshots of the structures at minimum and maximum energy regions are shown in Fig. 6 with the free pore volume shown in light blue. Both guest molecules have energy minima at the cavity regions at the ends of the NEB path whereas the barriers are observed in the window region. In both MOFs, propene experiences a lower diffusion energy barrier than propane. The clear difference between the two MOFs is that in structure OPENIH (Fig. 6a) the diffusion path is complex and contains additional minima corresponding to the guest residing in the pore window, while structure IJENER (Fig. 6b) has a single barrier for either guest. While window dimeter is smaller for IJENER (2.58 Å) than that for OPENIH (3.18 Å) in the absence of the guests, it is evident that IJENER has a more flexible window as it experiences a larger size change (from 2.58 Å to 3.03 Å for propene and to 3.10 Å for propane) than the window in OPENIH (from 3.18 Å to 3.33 Å for propene and to 3.39 Å for propane). In IJENER, the size of the window is controlled by the position of the methyl groups protruding into the 3-dimensional channel and bonded to sulphur atoms that in turn make two bonds with the framework and therefore are relatively mobile (Fig. 6b). In contrast, the 1-dimensional channel in OPENIH is composed from p-stacked 4,4 0 -bpt linkers and the mechanism for window opening to allow guest transport is via small displacements of the individual linkers within the stack. Both mechanisms are different from window opening in ZIFs where it is controlled by the tilt of the imidazolate-based linkers forming the window (Fig. 1d). The example of OPENIH, IJENER and ZIF MOFs reinforces the importance of using energetic considerations and a flexible host model to assess guest transport properties rather than solely relying on the average pore dimensions observed in diffraction experiments.
The correlations between UFF-FM force field and DFT energy barriers is shown in Fig. 7. The UFF-FM force field was exceptionally good at estimating energy barriers of Zeolitic Imidazolate Framework (ZIF) with pyrazole linkers in ZIF-8 (Zn-pyrazole) and ZIF-67 (Co-pyrazole). The energy barriers from UFF-FM and DFT were in line with previous computational studies and experiments of ZIF-8 55 and ZIF-67. 6,30 The force field energy barriers for HAJLIO (Fe-pyrazole) and DEGJIK (Cu-pyridyltetrazole) were also close to those in DFT calculations. In general, the UFF-FM force field was able to predict the energy barriers well for MOFs with short carboxylate, pyrazole, pyridine, bipyridine linkers and Secondary Building Unit (SBU) containing Zn, Cu, Co, and Fe metals (Fig. 8). The energy barriers calculated with the UFF-FM force field were within AE40% of the DFT data for propane in FASQUN, HAJLIO, ZIF-8, ZIF-67, WOPDEL, RAVVOA, NIHBEM, DEGJIK, JEJKEP, PEGBEK01, and TEVZEA. We found that the UFF-FM force field did not work well with heavy atoms compounds, such as Te, Re, Gd in IFEPIT for which it significantly overestimated the energy barriers for propane and underestimated for propene. Four out of 23 MOFs shortlisted for DFT (LIFWEE, LEVYOC, CITXUZ and EXOZEX) had a water molecule coordinated to the metal ion in the reported experimental structures that was removed in the CoRE MOF database leaving an open metal site behind. With the exception of EXOZEX structure, three out of these four MOFs show large diffusion barriers in DFT due to strong guest-host interaction not captured in the UFF-FM calculations. For EXO-ZEX and FIGQUJ containing flexible linkers (with respectively two and three sequential sp 3 carbon atoms), UFF-FM underestimated the energy barriers because of relatively low energy cost in UFF-FM to fold these linkers. The calculated energy barriers of alkaline-earth metals (Sr, Ca) based MOF, CSD reference code VOTMAS in force field based NEB is only half that of DFT. Alkali and alkaline-earth compounds have stronger ionic electrostatics effects than transition metals compounds 56 and the difference can be observed even with non-polar molecules. For example, for the four structures with the same diamond network formed by formate linker with different metals (Co in RAVVOA, 57 Fe in NIHBEM, 58 Zn in TEVZEA, 59 and Mg in XEHSEJ 60 ), only for XEHSEJ with Mg, an alkalineearth metal, we found that the UFF-FM underestimated the energy barrier by more than a factor of two compared to DFT. In most cases, UFF-FM underestimated propene energy barriers to a greater extent than those for propane when compared with DFT: only 3 out of 23 MOFs, the UFF-FM energy barriers for propene were significantly higher than DFT (Fig. 7b). Fig. 8 shows the energy barrier difference between propane and propene (DE DFT = E propane À E propene ) as a function of window sizes for the 23 MOFs shortlisted for DFT calculations. A large value of DE DFT is indicative of a stronger separation ability but cannot be used a single figure of merit as it does not take into account the absolute values of the diffusion barriers. For example, the calculated diffusion barrier for propene in CITXUZ is 87.5 kJ mol À1 which is too high for this material to be recommended as a good candidate for separation. The dashed line of best fit in Fig. 8 shows a weak correlation between DE DFT and the window size with a large scatter of  the data points around the line. This indicates that knowing only the window size of a particular MOF is insufficient to predict the difference in diffusion barriers, DE DFT . One needs to account for the flexibility of the framework and the details of guest-host interactions to calculate the diffusion barriers of the guest molecules. This validates our screening process that was carried out by considering flexible models of hosts and guests.

Conclusions
We have developed a workflow for high-throughput screening of MOFs for separating propane and propene by diffusion. In the heart of the workflow is the automated setup that identifies the position of the narrowest part of the pore channel, the window, and performs nudged elastic band calculations to identify the diffusion barrier for each guest molecule through this window. We computationally identified two materials, ZIF-8 and ZIF-67, that were previously shown to have large differences in propane and propene diffusivities as well as two other materials (CSD reference codes OPENIH and IJENER) that have not been tested experimentally for separating propane and propene yet.
We found that UFF-FM force field, compared to DFT method, was able to estimate diffusion energy barriers well for MOFs containing Fe, Cu, Co, and Zn metals in the SBUs and short pyrazole, carboxylate, pyridine, bipyridine linkers. The UFF-FM was not able to estimate the barriers energy for MOFs with long flexible linkers such as those containing two or more sequential sp 3 carbon atoms. While this work presents the first attempt to calculate diffusion barriers in a high-throughput screening approach, its predictive power can be improved by (1) improving the quality of the materials database, (2) improving the accuracy of pore window identification, (3) improving the accuracy of the force field calculations and (4) using computational biasing tools that include entropic effects.