 Open Access Article
 Open Access Article
      
        
          
            Stanley 
            Lo†
          
          
        
        
       *a, 
      
        
          
            Sterling G. 
            Baird†
*a, 
      
        
          
            Sterling G. 
            Baird†
          
          
        
       *bc, 
      
        
          
            Joshua 
            Schrier
*bc, 
      
        
          
            Joshua 
            Schrier
          
        
       d, 
      
        
          
            Ben 
            Blaiszik
d, 
      
        
          
            Ben 
            Blaiszik
          
        
       eg, 
      
        
          
            Nessa 
            Carson
eg, 
      
        
          
            Nessa 
            Carson
          
        
       f, 
      
        
          
            Ian 
            Foster
f, 
      
        
          
            Ian 
            Foster
          
        
       eg, 
      
        
          
            Andrés 
            Aguilar-Granda
eg, 
      
        
          
            Andrés 
            Aguilar-Granda
          
        
       h, 
      
        
          
            Sergei V. 
            Kalinin
h, 
      
        
          
            Sergei V. 
            Kalinin
          
        
       io, 
      
        
          
            Benji 
            Maruyama
          
        
      j, 
      
        
          
            Maria 
            Politi
io, 
      
        
          
            Benji 
            Maruyama
          
        
      j, 
      
        
          
            Maria 
            Politi
          
        
       k, 
      
        
          
            Helen 
            Tran
k, 
      
        
          
            Helen 
            Tran
          
        
       abl, 
      
        
          
            Taylor D. 
            Sparks†
abl, 
      
        
          
            Taylor D. 
            Sparks†
          
          
        
       *cm and 
      
        
          
            Alán 
            Aspuru-Guzik†
*cm and 
      
        
          
            Alán 
            Aspuru-Guzik†
          
          
        
       *abln
*abln
      
aDepartment of Chemistry, University of Toronto, 80 St George St, Toronto, ON M5S 3H6, Canada. E-mail: stanley.lo@mail.utoronto.ca; alan@aspuru.com
      
bAcceleration Consortium, University of Toronto, 80 St George St, Toronto, ON M5S 3H6, Canada. E-mail: sterling.baird@utoronto.ca
      
cDepartment of Materials Science and Engineering, University of Utah, Salt Lake City, UT 84108, USA. E-mail: sparks@eng.utah.edu
      
dDepartment of Chemistry and Biochemistry, Fordham University, The Bronx, New York 10458, USA
      
eUniversity of Chicago, Chicago, IL 60637, USA
      
fEarly Chemical Development, Pharmaceutical Sciences, R&D, AstraZeneca, Macclesfield SK10 2NA, UK
      
gArgonne National Laboratory, Lemont, IL 60439, USA
      
hFacultad de Química, Universidad Nacional Autónoma de México, Ciudad Universitaria, 04510 Ciudad de México, Mexico
      
iDepartment of Materials Science and Engineering, University of Tennessee, Knoxville, TN 37916, USA
      
jAir Force Research Laboratory, Materials and Manufacturing Directorate, Wright-Patterson AFB, OH 45433, USA
      
kDepartment of Chemical Engineering, University of Washington, Seattle, WA, USA
      
lDepartment of Chemical Engineering, University of Toronto, 80 St George St, Toronto, ON M5S 3H6, Canada
      
mChemistry Department, University of Liverpool, Liverpool L7 3NY, UK
      
nDepartment of Computer Science, University of Toronto, 40 St George St, Toronto, ON M5S 2E4, Canada
      
oPhysical Sciences Division, Physical and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, WA 99354, USA
    
First published on 15th February 2024
This review proposes the concept of a “frugal twin,” similar to a digital twin, but for physical experiments. Frugal twins range from simple toy examples to low-cost surrogates of high-cost research systems. For example, a color-mixing self-driving laboratory (SDL) can serve as a low-cost version of a costly multi-step chemical discovery SDL. Frugal twins already provide hands-on experience for SDLs with low costs and low risks. They can also offer as test beds for software prototyping (e.g., optimization, data infrastructure), and a low barrier to entry for democratizing SDLs. However, there is room for improvement. The true value of frugal twins can be realized in three core areas. Firstly, hardware and software modularity; secondly, purpose-built design (human-inspired vs. hardware-centric vs. human-in-the-loop); and thirdly state-of-the-art (SOTA) software (e.g., multi-fidelity optimization). We also describe the ethical benefits and risks that come with the democratization of science through frugal twins. For future work, we suggest ideas for new frugal twins, SDL educational course outcomes, and a classification scheme for autonomy levels.
The concept of accelerated discovery via automation goes by several names, including SDLs,3,4 materials acceleration platforms,9 Lab 4.0,10–12 Internet of Laboratory Things,13–15 Robot Scientists,16 the Autonomous Research System (ARES),17 and autonomous experimentation systems.18 While each term has its own nuances, here we use the term SDL exclusively and interpret it as referring to autonomous research systems used to accelerate materials discovery without human intervention. It is important to note that for the rest of the article, automation refers to the use of technology to perform tasks with minimal human intervention, while autonomy implies the ability of a system to operate independently, making decisions and taking actions without human control.
SDLs that are used to solve societal challenges are considered to be materials acceleration for societal solutions (MASS) platforms.1 Such platforms need to be widely deployed and adopted if societal challenges are to be addressed. However, such “critical MASS” (in the words of Seifrid et al.1) will require lower costs, enhanced ability to reconfigure and expand, and a joint effort to make available easy-to-understand examples and systems for more advanced research tasks. Since the introduction of the concept of an artificial intelligence system to laboratory automation in 1985 by Isenhour,19 the development of SDLs has gained traction. However, there are only a handful of fully autonomous low-cost SDLs reported in the literature. Stach et al.18 provide a community perspective on SDLs in the context of academia, industry, government laboratories, and funding agencies, and supply a descriptive table of selected SDLs across a variety of applications including chemical vapor deposition,20 nanocrystals,21 flow-22 and vial-based23 chemistry, oil-in-water emulsions,24 additive manufacturing,25 thin films,7 quantum materials,26 and solid-state materials.27 Many review and perspective articles have already been contributed to the field,1,3,4,9,18,28–46 and a list of 25 recent low-cost SDLs is given in Table 1.
| Name | Field | Purpose | Costa | Ref. | 
|---|---|---|---|---|
| a Estimated costs in USD. Abbreviations: carbon nanotube (CNT); additive manufacturing (AM); Autonomous Research System (ARES); LEGO Low-cost Autonomous Science (LEGOLAS); Bayesian Optimization Bartender (BOB); metal–organic framework (MOF). | ||||
| Educational ARES | Education | 3D printing | 300 | 63 | 
| Additive manufacturing ARES | Mat. Sci. | 3D printing | 1000 | 17 | 
| Pioreactor for real-time … culture measurements | Biology | Cell growth | 250 | 64 | 
| Autonomous Research System (ARES) | Mat. Sci. | CNT growth | 5000 | 20 | 
| Closed-loop Spectroscopy Lab: Light-mixing | Education | Color opt. | 50 | 65 | 
| Bayesian Optimization Bartender (BOB) | Education | Color opt. | 200 | 66 | 
| Accelerate Synthesis of MOFs | Mat. Sci. | Crystallinity | 830 | 67 | 
| Evolution of oil droplets … | Chemistry | Evolution | 1000 | 68 | 
| A … robot for discovering … protocell behavior | Chemistry | Evolution | 1000 | 24 | 
| … a configurable 3D printed fluidic platform | Chemistry | Evolution | 2000 | 69 | 
| A microfluidic platform [for] chemical evolution | Chemistry | Evolution | 5000 | 70 | 
| Chemical synthesis robot for nanomaterials | Mat. Sci. | Morphology | 15 ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 | 71 | 
| Cheap automated synthesis platform | Chemistry | Organic synth. | 450 | 72 | 
| Networking chemical robots | Chemistry | Organic synth. | 500 | 73 | 
| Autonomous … platform for … synthesis | Chemistry | Organic synth. | 10 ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 | 74 | 
| “The Chemputer” | Chemistry | Organic synth. | 30 ![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 | 75 | 
| 3D printed [microfluidic] autonomous analyzer … | Chemistry | Photometry | 2050 | 76 | 
| High-Throughput [CdSe Nanocrystal Synthesis] | Chemistry | Quantum dots | 2000 | 59 | 
| Crystallization Robot | Mat. Sci. | Randomness | 3000 | 77 | 
| Scientific Inquiry in Middle Schools | Education | Titration | 250 | 47 | 
| LEGO Low-cost Autonomous Science (LEGOLAS) | Education | Titration | 300 | 48 | 
| Autonomous titration for chemistry classrooms | Education | Titration | 600 | 78 | 
| Automated pH Adjustment … | Education | Titration | 650 | 79 | 
| Automatic titrator for intro chemistry labs | Education | Titration | 934 | 80 | 
| Automatic titration for teaching chemistry | Education | Titration | 4160 | 81 | 
What sets our review apart from others is that we explicitly focus on low-cost SDLs, i.e., frugal twins of high-cost SDLs. We hope that this attention to the importance of low-cost SDLs will shift perspectives on the educational and research capabilities of low-cost systems and provide a common reference point for building new solutions.
The question of what is low- vs. high-cost is both a subjective and contextual problem. Monetary cost and space constraints are particularly apparent in educational settings, as indicated by the large fraction of educational SDLs specifically described as low-cost, under 1000 USD,47–49 and which occupy relatively small footprints. This is in part because the final objectives are often based on learning outcomes rather than specific research objectives.
In both contexts, there is a range between monetary costs that can be covered by business-as-usual “spare” monetary resources vs. costs that require dedicated support from grants and other funding sources. For example, the National Science Foundation currently places a threshold of 5000 USD to differentiate between consumables and equipment, above which a purchase must be “adequately justified” on a grant proposal. An example such as the Opentrons OT-2 platform (∼7500 USD starting cost) likely fits more clearly into the “dedicated support” category for many education-oriented systems and somewhere in-between “spare resources” and “dedicated support” for research tasks. Nevertheless, the context depends on a multitude of other factors including the specific research group, institution, country, and socioeconomic status. For example, the monetary amount a research group in a developed country considers low-cost will be significantly higher than what a local school in a developing country would consider low-cost due to practical reasons such as but not limited to lower amounts of funding, greater costs for delivery, unfair pricing, difficulty of foreign exchange, and priority to secure a livelihood.50–52
With an emphasis on chemistry and materials science applications and as part of a broader focus on MAPs and MASS, we walk through topics relevant to low-cost SDLs. First, we describe the development of “frugal twins” that capture the core principles of real-world systems at an education-friendly cost, and present areas where the community benefits from low-cost twins (Section 2). Next, we delineate how educational outcomes and autonomy can equip the next generation of scientists with industry-relevant skills (Section 3). Afterwards, we detail how modularity for hardware and software plays an important role in reducing redesign costs for future systems (Section 4.1). We also illustrate how using a hardware-centric approach when developing SDLs can reduce system complexity by leveraging existing hardware in unconventional ways in comparison to other design approaches (Section 4.2). Next, we highlight how discovery can be accelerated further through high-throughput and parallelized systems (Section 4.3.1). With the growth of cloud infrastructure, we show that cloud experimentation (similar to cloud computing, but for experiments) decentralizes hardware, computing, and domain expertise, reducing the barrier-of-entry for SDLs and enabling robust and efficient batch optimization (Section 4.3.4). Finally, we describe ideas for new frugal twins, suggest potential SDL course outcomes, and discuss how to classify autonomy levels in SDLs (Section 6). To encourage a continuing discussion, we also provide a list of public, community-driven discussions (Section 7).
Any frugal twin of an SDL is located within a trade-off spectrum between cost and research capabilities, with the balance between two factors determining its usefulness for particular education and research activities (Section 2.1). We show in Fig. 1 some illustrative examples of these trade-offs for materials science and chemistry, and a list in Table 1 of various low-cost SDLs.
|  | ||
| Fig. 1 Spectrum of frugal twin capability vs. cost trade-off. (A) From left to right: solid dispenser for colored wax,55 chocolate 3D printer (*)56 with a 3-point bend test,57 arc melter (*),58 metal 3D printer (**) and mechanical testing system for metals (**). (B) From left to right: liquid handling for dye mixing,55 automated titrator built from LEGO,48 Jubilee sonochemical synthesis platform used with a plate reader (*) for absorbance and fluorescence measurements,59–61 automated liquid handler (**) integrated directly with a plate reader (*),61,62 and Chemspeed integrated with an HPLC-MS/MS. Images marked with (*) were reproduced with permission under the Creative Commons Attribution license (CC-BY). (**) Marked images were rendered using ChatGPT 4.0. | ||
In the materials science experiment, the high-cost capability is to 3D print various metal alloys at extremely high temperatures, as can be accomplished, for example, by a metal 3D printer. As cost decreases, the capabilities of frugal twins stray further away from the high-cost capabilities (Fig. 1). The arc melter can form alloys at high temperatures, but cannot 3D print them. The next drop in cost renders the instrument only capable of toy problems: the 3D chocolate printer can form and 3D print various chocolate compositions. Lastly, the “Hello World” of a materials science SDL, at the lowest cost shown, is the solid dispenser for colored wax, capable of producing candle wax in customized colours.55
Likewise in the chemistry context, the high-cost capability of multi-step, multi-batch synthesis and characterization can be accomplished by a Chemspeed integrated with high-performance liquid chromatography coupled with mass spectrometry (HPLC-MS). At a significantly lower cost, the Opentrons OT-2 platform can perform single-step, multi-batch synthesis and limited characterization techniques using an integrated plate reader, focused primarily on biological applications.62,82 The next lowest in cost is the Jubilee system which can be adapted to perform sonochemical synthesis and used with an offline plate reader.59,60 The automated titrator built from LEGO, one step lower in cost than the Jubilee, can no longer perform synthesis but only multi-batch liquid dispensing, and uses a pH probe for characterization.48 Lastly, the cheapest SDL is a liquid handler for dye mixing, tasked with obtaining a customized color as characterized by a light sensor.65,66,73,83,84
We note that it may not always be possible to create a useful frugal twin for an SDL. For example, a large part of cutting-edge research relies on expensive analytical instrumentation to be able to obtain sufficient information about experiments. In the context of compound characterization, instruments such as nuclear magnetic resonance spectroscopy (NMR) and HPLC-MS apparatus can cost hundreds of thousands of dollars to acquire and operate. However, infrared radiation can be a cheaper alternative to expensive analytical techniques like the ones mentioned before for tasks such as in-line reaction monitoring.85,86 This can be sufficient for low-fidelity reaction monitoring but is incapable of unknown compound characterization. Sacrificing research capabilities for lower costs is sometimes infeasible depending on the task at hand. To perform robust unknown compound characterization, low-cost (≤10![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD) alternatives to NMR or HPLC-MS do not currently exist on the market.
000 USD) alternatives to NMR or HPLC-MS do not currently exist on the market.
Analogous to the trade-off between cost and research capabilities, there can be a trade-off between throughput and fidelity.38 For example, a benchtop NMR is lower cost (≥40![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD)87 and easily adapted to flow chemistry SDLs, but sacrifices measurement precision and accuracy. The cost/benefit analysis must consider the expected speedup in the rate of progress for a lower fidelity analysis tool and the cost from potential inaccuracies compared to the gold standard analysis tools.
000 USD)87 and easily adapted to flow chemistry SDLs, but sacrifices measurement precision and accuracy. The cost/benefit analysis must consider the expected speedup in the rate of progress for a lower fidelity analysis tool and the cost from potential inaccuracies compared to the gold standard analysis tools.
Preliminary evidence acquired from a low-cost SDL can serve as a proof of concept for solving an analogous research problem that can then justify the funding for a more capable high-cost SDL. The low-cost SDL may have lower accuracy and reliability, but still provide evidence of feasibility for the proposed research, as well as answering some relevant research questions. In addition, the low-cost SDL can act as a proxy for estimating the acceleration factor that an SDL can offer in comparison to manual experimentation.
An example that compellingly captures how a frugal twin can promote rapid prototyping and teach transferable skills to students in a low-risk setting is the MIK-I, a frugal twin of “The Machine”.72,88 The initial goal for researchers is to build “The Machine”.88 However, prior to assembling this SOTA research tool, they built MIK-I with approximately 450 USD (Fig. 2), the main purpose of MIK-I's creation being to familiarize the researchers with automated synthesis platforms. The frugal twin is designed to handle liquids of different physicochemical properties such as density, viscosity, and surface tension. However, when building MIK-I, liquid handling became an issue because the pumps needed to be calibrated differently for each liquid in the system. This problem gave students hands-on experience with an issue that would also occur with the SOTA research tool, which would allow them to solve the eventual problem more readily. To evaluate the scope of MIK-I, the researchers successfully performed C–C bond formation reactions widely used in organic chemistry such as the Claisen–Schmidt condensations, Suzuki–Miyaura coupling, Knoevenagel condensations, and Morita–Baylis–Hillman reactions in an automated fashion (Fig. 3).72
|  | ||
| Fig. 2 MIK-I, a low-cost automated synthesis workflow platform. (a) Peristaltic pumps controlled by a Raspberry Pi, (b) synthesis reactor, (c) reagent bottles.72 | ||
|  | ||
| Fig. 3 The scheme for a general crossed aldol condensation reaction as a proof of concept.72 | ||
|  | ||
| Fig. 4 Example of a titration setup that can be equipped with automation, voice activation, computer vision, high-throughput capabilities, and machine learning. Adapted with permission from ref. 80, 81, 90 and 91. Copyright 2016, 2019, 2021 American Chemical Society. Adapted with permission from ref. 79 under the Creative Commons Attribution license (CC-BY). Copyright Elsevier 2022. | ||
A programmable titrator can also support a variety of other educational tasks. Students can be tasked with developing their own automation methods for this previously manual procedure, a problem that is engaging, encourages critical thinking and provides additional opportunities for learning. Typically, students develop their own heuristics, such as adding large amounts of titrant at the start of the experiment and slowly reducing the addition of titrant until the endpoint is reached, with the goal of optimizing for efficiency and accuracy. An automated titrator can accelerate the pace at which students can quantify and test multiple titration strategies for optimal efficiency and/or accuracy.78
The applicability of skills acquired from educational settings to research and industry settings is critical,92 and modification of a titration experiment presents a direct example of this transferability. For instance, Pomberger et al.79 designed their titration apparatus with high-throughput batch samples, and active machine learning (ML) to model the pH response of multi-buffered polyprotic systems, a challenging yet important task for many chemical labs and industrial plants. For context, educational titration setups with a single-buffered system like those mentioned above can be accurately described by the Henderson–Hasselbalch equation;79 however, this does not hold for multi-buffered polyprotic systems.79 Although the multi-buffered polyprotic problem has greater complexity, students can learn to adapt solutions to fit their needs and work around the limitations. By exploiting the benefits of modularity (outlined in Section 4.1), students can choose from several optimization algorithms such as ML, proportional-integral-derivative control, and model predictive control.79 Although automated solutions improve efficiency and robustness, an educational apparatus should also provide the option for a student to be put back in the loop (i.e., manual mode) because it can provide the student with more direct interactions with the hardware.
For the light-mixing example, Baird and Sparks65 developed a system known as Closed-loop Spectroscopy Lab: Light-mixing (CLSLab:Light) as a teaching and prototyping platform that entails mixing the light from red, green, and blue light-emitting diodes (LEDs) (Fig. 5). The demo employs light rather than matter while retaining the principles of SDLs. Taking language from the software community, it is a “minimal working example” of an SDL. The primary benefits of this device relative to more costly, time-intensive, higher-footprint (and, of course, more chemistry-relevant) liquid handlers such as Opentrons OT-2,82 Sidekick,94 evoBOT,95 OpenLH,96 OTTO,97 and OpenWorkstation98 are that it costs under 100 USD, requires less than an hour of setup time, takes up minimal desk space, and does not require chemical consumables. While CLSLab:Light cannot provide experimental data directly relevant to materials discovery, its features make it a prime candidate for classroom settings, allowing each student or team to obtain hands-on experience. Additionally, the platform can be used to prototype concepts such as creating a network of geographically distant experiments and implementing advanced optimization topics such as batch (Section 4.3.1) and multi-fidelity optimization (Section 4.3.2). Over a dozen tutorials and examples for basic optimization, advanced optimization, device communication, and data ecosystems are given in the Closed-loop Spectroscopy Lab documentation.
|  | ||
| Fig. 5 The CLSLab:Light demo. (a) A summary schematic of CLSLab:Light. (b) An annotated image of the CLSLab:Light. (c) Was adapted with permission from ref. 65. Copyright Elsevier 2022. | ||
CLSLab:Light has also evolved as an example and suggestion of SDL best practices. The software is modular, and open-source. Build instructions83 and a video build tutorial are provided, with parts lists designed to be modular and robust to supply chain issues. Additional features of the CLSLab:Light platform that helps students to learn and implement best practices are summarized in Table 2.
| Topic | Pain point | Resources | CLSLab:Light | 
|---|---|---|---|
| a Detailed setup instructions for MQTT and MongoDB are provided in Baird and Sparks.83 | |||
| Version control | Keep detailed, accessible, and efficient snapshots of your code at any point in time | Git, GitHub | GitHub repo/history | 
| Project generator | Streamline setting up modular code for a new project while conforming to best practices | PyScaffold, cookiecutter-pypackage | PyScaffold and initial commit | 
| Python packages | Make installation and setup easier for users | PyPI (pip), Anaconda | PyPI via setup.cfg | 
| Unit tests | Catch bugs and ensure functionality | pytest | Tests folder | 
| Continuous integration | Regularly and automatically validate code, run tests, and publish new versions | GitHub actions | Actions via ci.yml | 
| Secure wireless communication | Safely communicate within and between software and hardware | MQTT | MQTTa host/client | 
| Data management | Store data that is “Findable, Accessible, Interoperable, Reusable” (FAIR) | MongoDB, SQL | MongoDBa main.py | 
| Installation-free notebook tutorials | Make it easy for users to learn, test, and adapt the functionality | Google Colab, Binder | Tutorials page | 
| Documentation web host | Host a website with your documentation for free | Readthedocs, GitHub pages | Readthedocs site | 
| Documentation builder | Package your documentation, tutorials, and API as web-friendly HTML files | Sphinx, Jekyll | Source files, conf.py | 
Baird and Sparks83 have explored the commercialization of CLSLab:Light as an at-cost kit, with two successful rounds of crowdfunding via the GroupGets platform (see Campaign #1112 and Campaign #1129), totalling 39 kits; many kits have already been used in classroom settings at the University of Toronto, Massachusetts Institute of Technology, and University of Chicago. For continuing discussion related to packaging open-source hardware as commercial kits, see Discussion #124.
CLSLab:Light has already seen success, but domain-specific communities (biology, chemistry, solid-state materials science) will benefit from their own minimal working examples. Baird and Sparks83 have explored extensions that adapt the instructive lessons from CLSLab:Light to other domains. For example, using the modular software and hardware components, Baird and Sparks83 extend the platform to a liquid-based color-matching task (Closed-loop Spectroscopy Lab: Liquid-mixing (CLSLab:Liquid)) which uses the prototypical example of mixing red, yellow, and blue food coloring dyes (Fig. 7).
The inherent simplicity of the color matching application as demonstrated in CLSLab:Light has also inspired others to employ it in other settings. For example, Ginsburg et al.84 have implemented a color matching application in the context of their workcell execution interface science factory architecture.93 It is designed with modular instrument interfaces and workflow specifications used to implement an application that connects an Opentrons OT-2 liquid handler, liquid replenishment robot, and camera station (see Fig. 6). The Globus platform is employed45 to link optimization algorithms running on remote computers and to publish results to a remote data portal.
In sharp contrast to chemistry applications, low-cost examples of SDLs for solid-state materials science are effectively non-existent. To address this gap, an idea for a solid-state materials science extension involving the melting and mixing of colored wax powders is described in Section 6.1.
|  | ||
| Fig. 6 A photograph and diagram of the robotic work cell (indicated by each blue box) used for a WEI-based color mixing experiment. The Sciclops picks up a 96-well plate from its plate storage towers and transfers it to its exchange location. The PF400 then transfers the plate to the Opentrons OT-2, which mixes the three target colors. When the liquid reservoirs in the system are empty, the custom robot, Barty, refills them by using peristaltic pumps. Once mixing is completed, the plate is transferred to the camera location to be imaged. The plate is then looped between the camera and the Opentrons OT-2 until the experiment is over. The empty work cells (i.e. blue boxes) provide additional space for the robotic platform to expand its capabilities, showcasing modular design. Reprinted from Ginsburg et al.84 | ||
|  | ||
| Fig. 7 The CLSLab:Liquid demo. (a) A summary schematic of CLSLab:Liquid. (b) An annotated image of the CLSLab:Liquid.55 | ||
Nevertheless, with a specific, unique, and focused research problem, Gutierrez et al.68 take advantage of the full control over the end-to-end design of a novel, custom-built chemorobotic platform. This system is capable of exploring a diverse range of oil-droplet formulations which was designed to improve the understanding of evolutionary dynamics. Many low-cost components such as a RepRap 3D printer, camera, Arduino microcontroller, and 3D printed parts are used to gain the desired functionality for this specific experimental task.68 Later, this robot was redesigned with a 3D printed arena for droplet mixing which could be easily transformed into different environments, adding a new independent variable to experimentation.69 With high-throughput experimentation and automation, it is not crucial for the robot to be extremely accurate or precise, due to the ease of performing multiple replicates to reduce the uncertainty of results. In this oil-droplet system, several replicates are performed and the uncertainty of each measurement is accounted for before drawing conclusions from general trends.69 Full control over the design of the experimental apparatus is invaluable for niche research problems.
The modular Geneva wheel platform engineered by Salley et al.103,104 is another example of a low-cost SDL designed end-to-end to leverage the advantages of low-cost components and custom parts. The Geneva wheel platform is capable of rotating 24 reactors using the Geneva mechanism and a stepper motor which enables it to run 24 parallel reactions. For every rotation, the necessary reagents are dispensed serially into a reactor with peristaltic pumps. Each reactor has a magnetic stirring module which stirs 24 reactions in parallel. In addition, the sampling and cleaning modules can move in x, y, and z directions along the platform frame which enables in-line measurements, sample extraction, transfer between vials, and cleaning of reactors to prevent cross-contamination.103 Due to its modular nature, it can be easily reconfigured for the synthesis of gold nanoparticles, polyoxometalates, or other coordination compounds.71,103–106 From this system, an important takeaway is that “automation can only be so cheap before significant frustration is experienced”.102 In this example, Salley et al.102 replace cheap aquarium pumps with motor-controlled stepper pumps, which offer better control and accuracy over liquid dispensing while still remaining affordable.
Although the “Chemputer” is not as low-cost as our other considerations, it is worth mentioning because of its end-to-end design for universal chemical synthesis. The Chemputer not only has custom 3D printed parts and low-level electronic components such as syringe pumps, but also interfaces with existing chemistry instruments that may already be in the lab such as hotplates, photoreactors, flow reactors, a rotary evaporator, benchtop NMR spectrometers, and in-line spectrometers (UV-Vis, infrared spectroscopy (IR) and electrospray ionization-MS) to perform organic synthesis and characterization.107–116 Given its wide range of research capabilities, the “Chemputer” can cost over 30![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD with a setup time of 1 week. Manzano et al.74 develop the “mini-Chemputer,” which reduces the barrier of entry from 30
000 USD with a setup time of 1 week. Manzano et al.74 develop the “mini-Chemputer,” which reduces the barrier of entry from 30![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD to 10
000 USD to 10![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD, and 1 week to 1 day of reported setup time. Having full control over the end-to-end design of this system enabled the Cronin group to develop both the Chemputer, and the low-cost, portable mini-Chemputer.
000 USD, and 1 week to 1 day of reported setup time. Having full control over the end-to-end design of this system enabled the Cronin group to develop both the Chemputer, and the low-cost, portable mini-Chemputer.
Another example of end-to-end design is the Jubilee platform created by Vasquez et al.60 at the University of Washington.60 Originally, Jubilee was designed for multi-tool fabrication tasks and more. Some examples of its intended application ranged from multi-head 3D printing to multi-pen plotting, and simple liquid handling through syringes. Jubilee presents a modular tool-changing design that accommodates user-created tools and beds (Fig. 8a).60 Politi et al.59 have demonstrated the use of this versatile, multi-tool platform configured for automated ultrasound application (Fig. 8c), along with an Opentrons OT-2 liquid-handling robot and a well-plate spectrometer for the synthesis of CdSe nanocrystals. In this example, the authors were able to test 625 unique sample conditions, in triplicate, in less than two months, ensuring repeatability and reducing uncertainty on the results. The components to build the Jubilee platform can be individually sourced from readily available and 3D printed materials or even purchased as a kit, for a total cost of ≤2000 USD. Furthermore, the project is fully open-hardware and open-source, resulting in a series of resources, from build instructions to an active Discord channel for informal communication, and requires no previous building skills, which significantly lowers the barrier to its implementation in materials research spaces. No modification of the off-the-shelf, commercially available sonicator was required and simple electronics allowed for instrument interfacing. There is currently no commercially available solution for automating single-point sonochemical processing, making this example a great demonstration of how SOTA technology can be easily democratized through “maker skills” (3D design and fabrication, electronics, and programming) and cheaper electronics. While successful, the study by Politi et al.59 relied on three different instruments to conduct the workflow. It is however possible to integrate all the synthesis, processing, and characterization tools onto the same Jubilee platform, given its automatic tool-changing capabilities, creating a closed-loop experimental system. Finally, it should also be noted that systems like Jubilee, which originated from the digital fabrication space, might require additional hardening and possible small materials adjustments before they can be fully trusted as science tools (Fig. 8).
|  | ||
| Fig. 8 (a) The blueprint design of the Jubilee system which can equip modular multi-headed tools. (b) Example of the Jubilee system dispensing liquids into a 96-well plate. (c) The workflow of adapting Jubilee into the automated sonochemical synthesis of nanocrystals. Adapted from ref. 60 with written permission from the authors under the Creative Commons Attribution license (CC-BY). Adapted from ref. 59 with permission from the Royal Society of Chemistry. | ||
First, the robots were able to communicate by uploading results to the cloud and screening for results from other robots via Twitter. This system prevents robots from duplicating others' reactions and allows them to explore more efficiently as a team. Using a network connection, multiple physically separated robots can be synchronized in real time. Caramelli et al.73 use a chemical oscillator based on the Belousov–Zhabotinsky (BZ) reaction to showcase real-time control performance. The oscillation period is synchronized in real time between robots with an uncertainty of 2 s.
Reproducibility in the context of parallelization is necessary for accurate data acquisition. In one experiment, the network of robots collaboratively explored the conditions for the crystallization of tungsten POM clusters. Crystallization is a stochastic process, which makes it challenging to determine its ideal conditions, particularly on small scale. Nevertheless, the network of robots found six sets of conditions that offered reproducibility between 11.8 and 50%, which may be deemed acceptable for a stochastic process on a small scale.
Lastly, success in gameplaying offers the insight that large amounts of data enabled by powerful computation can push ML models to reach superhuman performance.117 Highly robust and reproducible materials chemistry SDLs can generate large amounts of data with low-cost experimentation and parallelization. Caramelli et al.73 demonstrated that two robots can compete against each other in a well-defined game to discover novel colors in the context of an azo coupling reaction. The rules are simple: novel results are rewarded, and common results are punished. Each time that a loser emerges at the completion of a game, the loser can change strategies by redefining their reaction space. The goal of the gamification of such an experiment is for the model to develop an optimal strategy to maximize the objective without human guidance. The success of this simple experiment provides the groundwork for similar SDLs to solve more complex problems through a low-cost and parallelized approach.
Given that devices inevitably break down at times, incorporating modularity into SDLs reduces the time and cost of maintenance. If one component breaks, then only that small portion of the instrument needs to be repaired or replaced. In addition, with smaller modular parts, debugging is simplified since each individual component can be tested separately, quickly determining the points of failure.
An SDL should be composed of a core infrastructure capable of interchangeably adapting to domain-specific requirements such as but not limited to liquid handling, solid dispensing, and thin-film manufacturing. This is more cost-effective than building a fixed, domain-specific system capable of performing all the desired tasks for only one given type of experimentation. After the first discovery campaign is completed, the cost of redesigning an inflexible SDL for further work could be much higher than for a modular system. To reduce the redesign cost for future systems, we need to incorporate modularity at the early conception stage of building any SDL.
Sometimes even small design choices can provide significant advantages and flexibility for an automation platform. In this context, the Jubilee60 platform is a great example of hardware modularity. In fact, the platform was designed in an application-agnostic fashion where tools can be interchangeably loaded on the platform, which can then automatically pick them up and return them after their task is complete. All of this is accomplished through a locking mechanism that allows the tool to lock onto the central carriage and a tool template pattern which ensures constant tool location. Another advantage of Jubilee is its ability to host not only simple sample transfer tools, such as a liquid handling pipette or syringe, but also tools for processing or manipulation and subsequent characterization such as a sonicator.59 This is not possible with commercially available liquid-handling robotic platforms, which can only complete a limited set of tasks before the labware needs to be moved onto a different automation instrumentation. The flexibility of Jubilee, in fact, allows for rapid reconfiguration of the platform for various applications, such as the nanomaterials synthesis shown by Politi et al.59
In some scenarios, software development best practices have been applied to chemistry and materials informatics optimization and workflow orchestration packages. As a set of computer instructions (codebase) evolves and matures, it often involves organizing lines of code into distinct blocks (functions) that perform specific tasks, and then further organizing these blocks into categories or groups (classes and modules) to create a more structured and manageable system.
A practical example of this is Gryffin,120 a Bayesian optimization tool that supports continuous and categorical variables, physicochemical descriptors, and batch optimization. Gryffin is written in Python and uses a common structure called a class to organize its code using “object-oriented programming.” Object-oriented programming is a style of coding involving the creation and use of ‘objects’, which are self-contained pieces of code that can store information and perform tasks.
In the case of Gryffin, an “instance” (i.e., copy) of an object is created based on the Gryffin class, which is referred to as “object instantiation” in programming terms. This object can be customized by supplying information about the variables to be tuned and the objectives to be optimized. Once this object has been created, you can use its built-in functions (class methods) to perform various operations. For example, you can use the recommend function to get recommendations from Gryffin, or the build_surrogate function to build a surrogate model—a simplified representation of a more complex system.
Likewise, alab_management and Bluesky utilize classes. For example, alab_management offers base classes for devices and tasks. A user only needs to create a custom class for a specific device or task once that can be reused, making it unnecessary to copy-paste “boilerplate” code. Bluesky, designed with synchrotron facilities in mind, uses “motors” and “detectors” to clarify the difference between hardware that performs tasks based on inputs (e.g., temperature controllers, sample changers) and characterization hardware that produces research data (e.g., photodiodes, CCD cameras, spectrometers).121
While the hardware associated with low-cost SDLs may not be as performant as high-cost examples, the same SOTA software that is deployed on a high-cost SDL can be deployed to a low-cost SDL with minimal effort. This enables both rapid, low-risk prototyping (Section 2.2) and opportunities to integrate low-cost and high-cost experiments via multi-fidelity optimization (Section 4.3.2). A more general discussion of SOTA optimization with workflow orchestration tools and algorithms is given in Section 4.3.
|  | ||
| Fig. 10 Examples of human-in-the-loop vs. human-inspired vs. hardware-centric design. (a) Wiping a needle by hand vs. (b) wiping a needle using a cloth attached to a robot arm vs. (c) helical insertion into a sponge. Adapted from ref. 17 under the Creative Commons Attribution license (CC-BY). Copyright © 2021, this is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply. (d) Mixing liquids together in a traditional lab setting using manual pouring vs. (e) using a peristaltic with a digitally controlled stir plate vs. (f) leveraging a bidirectional peristaltic pump to perform both liquid transfer and mixing. | ||
For example, robots can be made to use existing, human-centric lab equipment without modification.23 However, without complex sensing capabilities such as computer vision, a hard-coded system is sensitive to slight perturbations in absolute positions and orientations. This often requires extensive routine calibration and is tedious to implement when integrating new scientific instrumentation. The introduction of computer vision to recognize particular objects can introduce greater flexibility but suffers from the larger startup cost of the vision algorithm and may not elegantly handle all possible situations. Additionally, glassware is an essential component of any chemistry lab, but it is incredibly challenging for computer vision to recognize transparent objects.122
An alternative that combines the benefits of hard-coded routines and complex computer vision decisions is to use fiducial systems such as AprilTags,123,124 which are used by Wang et al.125 and Xu et al.122 (Fig. 11). These can be thought of as QR codes or bar codes attached to pieces of equipment to help with relative positioning. However, the true value is not simply to identify hardware with unique IDs; the AprilTag detection software allows for computation of “the precise 3D position, orientation, and identity of the tags relative to the camera.” More recent work also enables flexible fiducial markers to be placed on circular, annular, and other shaped objects126 such as vials. Likewise, Krogius et al.126 demonstrate the use of nested, recursive layouts for high dynamic range. While there are challenges associated with mimicking human behavior, there remain excellent use cases for the human-inspired approach.
|  | ||
| Fig. 11 AprilTags, a type of fiducial marker, are affixed to a base plate to allow for accurate detection of its position and orientation (six degrees of freedom) relative to the camera. Reproduced from ref. 122 with permission from H. Xu, Y. R. Wang, S. Eppel, A. Aspuru-Guzik, F. Shkurti and A. Garg, arXiv, 2021, https://doi.org/10.48550/arXiv.2110.00087. | ||
In terms of low-cost SDLs, Deneault et al.17 provide a prudent example of leveraging the existing robotic setup (a 3-axis printer) and moving the syringe into and against a fixed sponge with a helical motion to clean the external surface of the syringe (Fig. 10c). When cleaning a syringe, a human might run it under water, wipe it with a cloth (Fig. 10a), put it in an ultrasonic cleaner, or replace the tip entirely. A robotic arm with human-inspired design could be equipped with a cloth to wipe the syringe tip (Fig. 10b), or remove the tip and place it in an ultrasonic cleaner. However, helical insertion into a sponge leverages existing equipment at a low cost. While it has limitations (e.g., how well is the syringe tip cleaned relative to more standard procedures; cross-contamination), it is an informative example of hardware-centric design. Another example is liquid handling that is dual-purposed for both dispensing and mixing, where mixing occurs by cycles of forward and reverse pumping to agitate the solution (Fig. 10f) instead of using a magnetic stir bar and stir plate (Fig. 10d and e).
By designing equipment with desired material states and processing conditions in mind, we create hardware that is time- and cost-efficient for autonomous experimentation. Especially in low-cost settings, we should try to do as much hardware-centric design as possible. This will both lower cost and require less equipment.
Here, we draw from the “Pareto principle,” described by Jana and Tiwari127 as a commonplace case where “80% of the outcomes are controlled or decided by 20% of the activities or factors. For example, 80% of the total profit is generated by 20% of the product categories, or 80% of the maintenance expenses are incurred by 20% of the machines.” Applying the Pareto principle, the last 20% of automation may require 80% of the total effort towards bringing full autonomy to an experiment. A common example is sample transfer between automated experimental modules, especially of solid materials or sample containers. For example, samples often need to be moved between synthesis and characterization equipment, such as the transfer of wellplates between an OT-2 robot and a plate reader in Vaddi et al.101
In the low-cost automation literature, there are many examples which incorporate automated modules while leaving experimental step(s) as human-in-the-loop because of high opportunity cost (i.e., the benefits that are lost when one makes a decision over an alternative – such as the lost opportunity for students to learn hands-on from running an experiment manually when it is automated), time constraints, and tasks where humans are naturally better than robots. Xie et al.67 automate the design and synthesis of metal–organic frameworks (MOFs) using Bayesian optimization (BO) and a RepRap 3D printer but leave humans to transfer the sample from the robot to the X-ray diffraction instrument. Since many of these complex characterization techniques are costly and designed for humans, the time and cost of building another robot to perform sample transfer exceed the benefits gained from automating every single task in the workflow for greater efficiency. Rodriguez et al.128 provide an excellent example of automating the most effective process steps such as synthesis (with an Opentrons OT-2 liquid handling robot), melting point determination, and electrochemical characterization for discovering new deep eutectic solvent electrolytes. Rodriguez et al.128 did not automate the processes of sample transfer or handling of existing equipment such as a dehydrator and vacuum oven because of the great opportunity cost.
In a similar vein, most of the experimentation in Salley et al.,103 Cao et al.,129 and Lachowski et al.130 is automated except for the characterization tools which include XRD, viscosity analysis, and UV-Vis spectroscopy, respectively. Conversely, Chen et al.131 develop a new low-cost system, RAMSAY-2, for automating the burdensome task of sample preparation for mass spectroscopy. It involves two robot arms which aliquot solutions, incubate the samples with the reagents, deliver the samples to the ion source of the mass spectrometer, and initiate data acquisition.131 This approach significantly accelerates the characterization workflow but is a non-trivial solution that requires substantial time and effort. It is also important to consider the opportunity cost of automating tasks that are trivial for humans but challenging for robots due to the consequential researcher time spent. Automation is most profoundly effective when researchers are freed from performing tedious, time-consuming, and repetitive tasks. Another opportunity cost is the amount of money required to acquire instruments that are already automated. For example, an automated differential scanning calorimetry (DSC) instrument can be purchased for ∼50![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD.132 However, Rodriguez et al.133 automate DSC with a low-cost system of 1080 USD, which can run samples in 15 minutes, with up to 96 samples at a time.133 A cost/benefit analysis of the different design approaches and associated opportunity costs remains necessary to automate any solution.
000 USD.132 However, Rodriguez et al.133 automate DSC with a low-cost system of 1080 USD, which can run samples in 15 minutes, with up to 96 samples at a time.133 A cost/benefit analysis of the different design approaches and associated opportunity costs remains necessary to automate any solution.
Rather than polarizing the community between fully autonomous vs. human-in-the-loop generalist setups, we believe it is wiser to meet in the middle and pair the tool to the task. This type of experimentation and exploration, enabled by low-cost frugal twins, can form a rich test bed in classroom settings. For example, students could be tasked with a design problem and divided into three groups: human-in-the-loop, human-inspired robotic design, and hardware-centric design. The students can present their experiences, learn from other groups, and discuss trade-offs between each approach: how many experiments could be performed within the first day for each group? Within the first week? This can be replicated for different experiments to solidify best practices related to autonomous system design and cross-pollinate seemingly disparate design approaches.
|  | ||
| Fig. 12 Illustration of a network of parallel chemical synthesis robots working towards a common optimization goal.73 Reproduced from ref. 73 with permission under the Creative Commons Attribution license (CC-BY). Copyright © 2018, Caramelli et al. | ||
While the batch optimization described earlier implies that all experiments within the batch need to be completed before moving on to the next one, the complementary topic of asynchronous optimization uses resources as soon as they become available. This is important when experimental runtimes can vary depending on the input parameters: thereby, equipment downtime is reduced. Whether using batch or asynchronous optimization, care must be taken so that redundant or low-value experiments are not suggested by considering either completed or in-progress experiments. Examples of methods that factor in-progress experiments into the optimization scheme include Monte Carlo-based joint acquisition optimization and models where predictions for in-progress experiments are sequentially added as “fantasy datapoints” before suggesting the next experiment in the batch (see Appendix F2 of Balandat et al.134).
Several examples of cloud-based SDLs exist.65,73,138–144 Many commercial solutions have a heavy focus on biology applications such as Emerald Cloud Lab,139 the former Lilly-Strateos lab,140 Culture Biosciences,141 and Arctoris.142 On the other hand, solid-state materials science cloud laboratories are effectively non-existent except for some minor capabilities of biology- and chemistry-focused labs. While existing cloud labs have primarily targeted industry users, a noteworthy example beginning to target academic users is CMU Cloud Lab.145–150 This is a partnership between Carnegie Mellon University and Emerald Cloud Labs to build a subscription-based, 40 million USD facility with over 200 types of scientific instrument. Unlike typical user research facilities, academic and industry users can conduct an end-to-end experimental workflow and acquire the results from anywhere around the world, 24/7, 365 days a year.145–150 Typically, a research group needs to secure funding for the reagents, cost of the instrument, and upkeep costs to perform an experiment. Armer et al.151 outline several systemic reasons for the lack of adoption of cloud-based science, such as the lack of initial cloud access to gain preliminary data for grant applications, the lack of cloud science grants in general, the lack of academic training, and the costs for a cloud lab subscription in addition to university facility expenses. To tackle some of these concerns, having an academic institution such as CMU build its own cloud labs will reduce some of the barriers of entry for academics to access high-cost scientific equipment.151 In addition, CMU Cloud Lab promotes open science, a recent movement that aims to enhance the transparency, accessibility, inclusivity, and credibility of scientific knowledge,152 where problems and results can be shared easily.
A platform such as CMU Cloud Lab typically requires extensive capital and expertise to develop onboarding, security, access restriction, priority queuing, and workflow orchestration protocols. It also relies on human-in-the-loop sample transfer between modules, necessitating full-time technicians to perform menial tasks. The costs associated with these infrastructure components inevitably get passed onto the user which can be prohibitive for educational settings and citizen science. Since low-cost SDLs operate at a smaller scale and the risks associated with data leakage and malicious threats are lower, they are a great platform for prototyping SDL infrastructure with low operational costs. For example, free, open-source tools may be implemented into low-cost SDLs, such as Bluesky for workflow orchestration,121 secure, encrypted IoT-style communication through platforms such as HiveMQ,83 and the Google Authentication application programming interface for security measures.121 By leveraging the advantages of rapid, low-risk prototyping benefits of SDL frugal twins described in Section 2.2, we envision a low-cost SDL cloud lab that can act as a test bed for research-grade cloud experimentation ecosystems, but with dramatically lower operational costs. See Discussion #62 and Discussion #91 from Section 7.
Additive manufacturing (i.e., 3D printing) is a natural place for citizen science, as it is low-cost, operationally fairly safe, easy to learn with the abundance of online resources, and adaptable to many different objectives. For example, Deneault et al.17 developed an SDL known as Additive Manufacturing Autonomous REsearch System (AM ARES) for optimizing the print parameters of several materials for additive manufacturing. This is a low-cost additive manufacturing SDL that uses a 300 USD commercial 3D printer with a custom syringe extruder, Raspberry Pi controllers and webcams, and software that will be released as open-source (Fig. 13). The authors use BO to guide the selection of 3D print parameters for latex caulk with silicone additives, attaining excellent extrusion properties after 100 iterations. In addition, AM ARES performed self-calibration for three different unknown source filaments, which resulted in better performance than default manufacturer specifications in an average of 15 experimental iterations. Although this system is robust, low-cost, and a stepping-stone for many to learn about SDLs, there is yet to be widespread adoption due to the lack of educational infrastructure such as open-source software, course materials, and a step-by-step build guide.
|  | ||
| Fig. 13 A simplified closed-loop workflow of the AM ARES platform. Reproduced from ref. 17 with permission under the Creative Commons Attribution license (CC-BY). Copyright © 2021. This is a U.S. government work and not under copyright protection in the U.S.; foreign copyright protection may apply. | ||
To address this problem, the project was extended between the US Air Force Research Laboratory and Airship Consulting to create ATHENA, an affordable AM ARES system with open-source software (ARES OS 2.0) and off-the-shelf hardware. This initiative aims to make SDLs and autonomous experimentation systems widely accessible in grade schools, trade schools, and universities. ARES OS 2.0 is a platform-agnostic, web-facing software framework for autonomous experimentation SDLs which takes much of the software development burden from the researcher. The goal is to provide a library of open-source modules for all to use and contribute back to the growing community, with the intent that “Anyone Can Download an Autonomous ‘Research Robot’”.155 ATHENA is an example of the movement towards low-cost autonomous experimentation systems/SDLs to improve access to citizen scientists and especially under-served communities through open-source software and low-cost systems.
We do not have the solution for safeguarding SDLs, but methods exist to make it harder for ill-intentioned people and organizations to engage in harmful behavior and easier for researchers to implement preventive strategies against the (accidental) synthesis of harmful substances. The key is to address this problem early, quickly, and judiciously through governance, regulations, standards, education, awareness, and self-adherence to ethical use.
There are valuable open source practices that can be learned and adapted to low-cost SDLs because there are potential risks associated with open sourcing, such as open access to hazardous information or datasets and the potential misuse of research tools. To mitigate these risks, a cultural shift towards open methodology and open review may help regulate the dissemination of malicious code, data, or materials.160 Creators of SDLs should also consider designs which mitigate misuses or failure modes which would endanger lives or property. For example, incorporating steps to assess the toxicity of autonomously generated substances can prevent the release of unknown toxic chemicals into the environment.161
To address the lack of solid-state materials science SDL demos, we propose a solid-based color-matching demo extension (Closed-loop Spectroscopy Lab: Solid-mixing (CLSLab:Solid)) that uses a low-cost mobile robot arm, mixtures of granulated colored wax powders (Fig. 14), and a halogen lamp. Similarly to moving from a light-mixing to a liquid-mixing demo (Section 3.1.2), the solid-mixing demo requires hardware and workflow changes. At the start of the experiment, a robotic arm will pick and place one tealight candle in a holder from a stacked array of holders in a storage array onto a motorized turntable. The turntable will then move the candle holder to a position beneath a funnel connected to red, yellow, and blue wax powder dispensers. The candle will then be positioned beneath a heat source (e.g., halogen lamp) to melt and convectively mix the wax, followed by color sensing using the same sensor as CLSLab:Light and CLSLab:Liquid. When the candle holder returns to its original position on the turntable, the robotic arm will pick it up and place it into a separate storage/waste area.
|  | ||
| Fig. 14 A summary schematic of the CLSLab:Solid demo, which is envisioned as a minimal working example for an inorganic solid-state SDL. Taking from the light- and liquid-based color-matching demos, the task is to find the optimal mixture of wax powders and processing conditions to reach a desired, solidified wax color. This demo incorporates more advanced features than other demos due to need to handle and characterize solid samples. The demo is intended to be have reasonable trade-offs between the monetary cost, the time required for setup, and the device footprint.55 | ||
Moving one step further is the idea of a “robot chocolatier.” Chocolate captures key materials science principles such as liquid phase transformations, bulk material characterization (as opposed to thin-film), and processing–structure–property (PSP) relationships. This robot chocolatier (RoboChocolatier) will reuse many components from CLSLab:Solid and add a do-it-yourself (DIY) tensile tester and a chocolate 3D printer such as the highly customizable Cocoa Press. Both CLSLab:Solid and RoboChocolatier act as toy examples for the more industry-relevant materials discovery task of additively manufactured metal alloys for aerospace and automotive applications. Again, as a recurring theme, they can serve as proofs of concept that can be used during prototyping and the preparation of grant proposals (Section 2). For a continuing discussion of solid-state materials science SDL demos, see Discussion #153.
Other topics that the community may consider exploring in the context of SDL frugal twins include other types of inorganic synthesis, battery formulations,162–164 batch chemical synthesis, semiconductor fabrication, polymer synthesis,165 artificial organ compatibility, mobile and fixed robotic arms, autonomous multi-agent systems,166 microfluidic devices,44 and closed-loop microscopy.167,168
We present in Table 3 suggestions for possible educational outcomes for hands-on experience, learning best practices, and using algorithms. Hands-on hardware and software development experience, brainstorming designs, and expertise in applying optimization algorithms are emphasized. We encourage the community to weigh in on and converge on a set of desired outcomes and skills necessary for successful SDL implementations. In future work, we plan to flesh out the details for creating a syllabus, course outline, and course content along with practical examples for teaching SDLs to students. Eventually, as the ecosystem matures, we envision higher education programs and degrees specific to SDLs for chemistry and materials science.
| Topic | Potential learning outcome | 
|---|---|
| Experience | Familiarize the concept of SDLs (hardware, algorithms, orchestration) | 
| Acquire hands-on and software development experience by setting up a toy demo | |
| Propose a design for a research-oriented SDL via a white paper | |
| Best practices | Identify SDL best practices (e.g., modularity, reproducibility, safety, documentation) | 
| Identify best practices for “cloud experimentation” (e.g., data transfer, storage) | |
| Identify best practices for ML (e.g., validation, prevention of data leakage) | |
| Algorithms | Compare and contrast three forms of experiment planning algorithms | 
| Test the complexity/efficiency trade-offs for advanced optimization | |
| Identify methods for incorporating domain knowledge | 
Once again, it is inevitable to mention the multi-tool motion platform developed at the University of Washington.60 The platform was designed with community development and customization as one of the project's aims. Its original design was inspired by the RepRap and maker movements, which have already generated an array of open-source hardware toolkits enabling flexible and extensible technologies for laboratory automation. This connection anticipates the co-development of tools configured for platforms such as Jubilee. These features also make the platform a great educational tool, as it provides a solution with a low-cost barrier and allows students, from most disciplines, to obtain skills for all steps of an experimental campaign in a single SDL platform. A successful example of this is the implementation of Jubilee into engineering design courses at the University of Hawai'i at Mānoa.
To make these categories conceptually and visually easy to understand, emoji can be used to represent whether a process is fully autonomous vs. one that requires manual intervention (Fig. 15). This type of classification is utilized in https://github.com/AccelerationConsortium/awesome-self-driving-labs as of 2022-08-08. For a discussion centered on these representations, see https://github.com/AccelerationConsortium/awesome-self-driving-labs/discussions/15. Autonomy levels could also include failure rate/tolerance, number of iterations without manual intervention, or use of physics-based simulations to supplement experiments.
![[thin space (1/6-em)]](https://www.rsc.org/images/entities/char_2009.gif) 000 USD.178 There are not yet demonstrations of frugal twins for high-cost SDLs in biology applications. However, we are aware of two frugal twin systems for cell growth. PioReactor is <300 USD open-source bioreactor which can control peristaltic pumps and temperature and monitor realtime optical density to optimize yeast, bacteria, and algae growth.64 Gerber et al.179 described the use of a LEGO Mindstorm EV3 kit (<400 USD) to build a liquid handling robot with a light sensor, and its use in a K-12 afterschool setting to perform experiments related to sterile transfer and determine optimal sucrose concentrations for yeast growth.179 Despite these early examples, frugal twins in biology remain an underexplored research direction.
000 USD.178 There are not yet demonstrations of frugal twins for high-cost SDLs in biology applications. However, we are aware of two frugal twin systems for cell growth. PioReactor is <300 USD open-source bioreactor which can control peristaltic pumps and temperature and monitor realtime optical density to optimize yeast, bacteria, and algae growth.64 Gerber et al.179 described the use of a LEGO Mindstorm EV3 kit (<400 USD) to build a liquid handling robot with a light sensor, and its use in a K-12 afterschool setting to perform experiments related to sterile transfer and determine optimal sucrose concentrations for yeast growth.179 Despite these early examples, frugal twins in biology remain an underexplored research direction.
      
    
    
      
      | Topic | Repository | Link | 
|---|---|---|
| All discussions | Self-driving-lab-demo | All discussions | 
| Data and access management | Self-driving-lab-demo | Category | 
| Demo extensions and design | Self-driving-lab-demo | Category | 
| Examples and tutorials | Self-driving-lab-demo | Category | 
| Scaling up SDLs | Self-driving-lab-demo | Category | 
| Packaging open-source hardware as commercial kits | Self-driving-lab-demo | Discussion #124 | 
| Experimental orchestration software | Self-driving-lab-demo | Discussion #64 | 
| Educational outcomes and homework problems | Self-driving-lab-demo | Discussion #186 | 
| Solid-state materials science demo | Self-driving-lab-demo | Discussion #153 | 
| Low-cost powder handling | Self-driving-lab-demo | Discussion #153 | 
| Roadmap for demo extensions | Self-driving-lab-demo | Discussion #77 | 
| A network of cloud-based experiments | Self-driving-lab-demo | Discussion #62 | 
| Classifying level of autonomy | Self-driving-lab-demo | Discussion #15 | 
| What is a self-driving lab? | Awesome-self-driving-labs | Discussion #32 | 
| AM ARES | Additive Manufacturing Autonomous REsearch System28,29 | 
| CLSLab:Light | Closed-loop Spectroscopy Lab: Light-mixing8,10–12,30 | 
| CLSLab:Liquid | Closed-loop Spectroscopy Lab: Liquid-mixing10,14,30 | 
| CLSLab:Solid | Closed-loop Spectroscopy Lab: Solid-mixing30,31 | 
| HPLC-MS | High-performance liquid chromatography coupled with mass spectrometry6 | 
| ML | Machine learning8,17,34 | 
| SDL | Self-driving laboratory1–3,5–8,10,12,14,17–20,23–25,27,28,30–36 | 
| SOTA | State-of-the-art1,6,7,15,17,20,24,34 | 
| Footnotes | 
| † These authors contributed equally to this work. | 
| ‡ See “bad actor” definition in the Cambridge Dictionary. | 
| This journal is © The Royal Society of Chemistry 2024 |