Last update: January 2019
How can AI help make discoveries on the bench in a chemistry lab? How can chemical systems help devise better AI algorithms? I explored those questions with the team I lead in the Cronin Group. The group cutting edge knowledge combined with my experience from the developmental sciences allowed us to bring innovative ideas to the field, including the use of active learning, curiosity-driven exploration, and human comparative studies.
After my PhD, I grew frustrated that most AI research was done on simulated environments. I wanted to see this technology used in the real world on systems we do not ourselves design nor understand - that is not even on robots.
The physical sciences seem fitting. In physics, chemistry, or biology, an important part of everyday research is to understand yet unknown systems. Scientists devise experiments and build instruments to query these systems with the aim to be able to model, and later predict, their evolution.
But because such systems are not yet fully understood, or are massively parallel, they cannot be simulated. There is no cheating with the physical sciences, you cannot speed up experiments, you cannot be 100% sure that you are controlling the right experimental variables, and you do not know the right answer. That is all the fun of it.
This is in sharp contrast with the simulation, big data, and controlled robotic problems AI algorithms are designed on and for. [1] If we ought to bring AI advances to experimental research in the labs, we need to drop the habit of thinking that experiments are free, unlimited, and well defined. That habit comes from the extraordinary power of computers and can sometime turn AI research into a parameter tuning and stacking game.
Interestingly, a subfield of robotics, called developmental robotics, imposes itself similar constraints. Their aim is to build a robot that can learn like a child. A child has real world constraints, it has limited time to interact with the world and he has no complete built-in knowledge of the world around him. From his own body, to modern objects like smartphones and bikes, a child has to learn it all in a few years by interacting with the world. To do so a child constantly devises his own experiments to test the world around him, much like a scientist in its lab.
As it turns out, I pursued my PhD under the supervision of Pierre-Yves Oudeyer, one of the pioneers of this field. Once I connected the dots, an ocean of research directions opened to bring principles from developmental robotics into the physical sciences. How can a laboratory robot be curious and devise its own questions about a new systems? How could it learn a hierarchy of skills to produce increasingly more complex molecules autonomously? How could such machines learn from and collaborate with scientists? Could maturational constraints and the concept of embodiment help leverage the exploration power of algorithms on natural systems?
Hence came the decision to join a chemistry lab for a postdoc.
Lee Cronin, a pioneer in the digitization of chemistry, welcomed me into his lab and gave me the responsibility of an internal team to explore these questions. The group had already published a work showcasing the robotically assisted optimization of oil-in-water droplets, a suitable system for our experimentation.
The team was composed of Lee Cronin (PI), Jonathan Grizou (me), Abhishek Sharma, Jan Szymanski, James W. Taylor, Juan Manuel Parrilla Gutierrez, Laurie J. Points, Vasilios Duros, Sergio Martin Marti, Graham Keenan, Kevin Donkers. A mix of chemists, computer scientists, and material scientists.
The main challenge resided in a good communication between team members. Chemists had to understand the limitation of computers in making sense of data, as well as the limitation of robots in manipulating things. Computer scientists (me above all) had to understand that chemistry is hard, requires time, and that not everything can be created at will. The team was fantastic at sharing their respective constraints which is the main factor of success in the projects we have undertaken.
As the first trained roboticist to join the group, I also took care setting up a culture of AI and robot building, as well as developing the right tools for chemists to use simply, safely, and robustly. These tools are available on the group GitHub account and further described on this page.
We developed a chemical robotic discovery assistant equipped with a curiosity algorithm (CA) that was set to explore the states a complex chemical system can exhibit searching for novelty. This differ from optimization scenario where a global target is defined beforehand.
By applying the CA-robot to the study of self-propelling multicomponent oil-in-water droplets, we are able to observe an order of magnitude more variety of droplet behaviours than possible with a random parameter search and given the same budget. We demonstrate that the CA-robot enabled the discovery of a sudden and highly specific response of droplets to slight temperature changes.
Once the temperature dependance was discovered, six modes of self-propelled droplets motion were identified and classified using a time-temperature phase diagram and probed using a variety of techniques including NMR, which allowed the design of a payload release system triggered by temperature. This work illustrates how target free search can significantly increase the rate of unpredictable observations leading to new discoveries with potential applications in formulation chemistry.
The discovery of new gigantic molecules formed by self-assembly and crystal growth is challenging as it combines two contingent events; first is the formation of a new molecule, and second its crystallization. We constructed an automated workflow that can probe both events and employed it for a new polyoxometalate cluster which has a trigonal-ring type architecture.
The synthesis and crystallization was probed using an active machine-learning algorithm to explore the crystallization space. The algorithm results were compared with those obtained by human experimenters and a random sampling approach.
A critical constraint was that we could only perform 10 experiments per day. We ran the active learning by batches of 10 experiments and performed 10 iterations for a total of 100 experiments for each method and repeat. Meaning that 10 days and significant reagent were needed just to validate the approach the first time. Our result showed that the active learning algorithmic search increases the crystallization prediction accuracy to 82.4±0.7% over 77.1±0.9% from human experimenters given the same experimental budget.
Intuition-enabled Machine Learning beats the Competition when Joint Human-Robot Teams per-form Inorganic Chemical Experiments. Duros, V., Grizou, J., Sharma, A., Mehr, S.H.M., Bubliauskas, A., Frei, P., Miras, H.N. and Cronin, L. (2019). Journal of chemical information and modeling. [pdf] [journal] [project]
Human vs Robots in the Discovery and Crystallization of Gigantic Polyoxometalates. Grizou, J., Duros, V., Xuan, W., Hosni, Z., Long, D.-L., Miras H., Cronin L. (2017). Angewandte Chemie 129.36: 10955-10960. [pdf] [SI] [code] [journal]
We developed a liquid-handling robot built with the aim of evolving the size and shape of gold nanoparticles (AuNPs) as a function of composition via an automated evolutionary process. The robot synthesises nanoparticles by mixing reagents in different ratios, placing them in vials and extraction to in-line UV-Vis analysis that is used to compute a fitness value.
The project aimed at exploring how we could reuse the result of genetic algorithm experiments (e.g. evolution of nano-spheres) to evolve more complex nanoparticles such as nano-rods. The diagram below illustrate this process.
After several cycles and the application of a genetic algorithm (GA), the fitness factor at each stage increased every generation leading towards the desired outcome. Our system could synthesise recursively gold nanospheres, gold nanorods (AuNRs), and arrow-headed gold nanorods with high reproducibility.
Our results show that the evolution of nanomaterials is possible in a fully automated fashion and has the potential to change the way AuNPs are studied and explored.
Evolution via natural selection is governed by the persistence and propagation of living things in an environment. The environment is important since it contributes in shaping evolution. Although evolution has been widely studied in a variety of fields from biology to computer science, still little is known about the impact of environmental changes on an artificial chemical evolving system outside of computer simulations.
We developed a fully automated 3D-printed chemorobotic fluidic system that is able to generate and select droplet protocells in real time while changing the surroundings where they undergo artificial evolution.
Compared to our previous work, the use of a 3D-printed device enable us to change the physical environment in which the droplet population evolves. The 3D printed platform also innovates as no moving parts are required to initiate, run, and clean droplet experiments.
Using such 3D-printed devices, we were able to change the physical environment in which the droplet population evolves. We thus explicitly studied how the genotype is modulated through a programmable environment to express its phenotype, in contrast to the more studied genotype to phenotype direct approach.
By successively evolving the droplets from one environment to another, we were able to observe disruption in their evolutionary trajectories, as well as adaptation of their genome.
Exploring and understanding the emergence of complex behaviors is difficult even in “simple” chemical systems since the dynamics can rest on a knife edge between stability and instability.
We study the dynamics of a oil droplets in an aqueous environment using an automated platform equipped with artificial intelligence. Compared to previous work, we increased the number of experimental parameters and allowed the robot to choose both the concentration of oils in the droplets and the concentration of surfactants in the aqueous phase. Our aim was to identify more droplet behaviors and understand better the mechanisms behind them.
To test the platform, we started by running evolutionary experiments and comparing them to our previous work.
By opening the parameters space to the chemical environment of the droplets, we were able to generate faster moving droplets. The recipes and data generated from this process were then used for physicochemical analysis, where traditional chemical analysis, machine learning, and archetypal droplet experiments are used to study the behavioral mechanisms and to predict droplet behaviors.
The data acquired were used to build predictive models of the system, of which no physical model, or only inaccurate models, were available.
Physical properties such as viscosity, surface tension, and density are shown to be related to behaviors, as well as to droplet behavioral niches, such as collective swarming. By dying droplets with phenolphthalein, we could observe the internal dynamics within the droplets and depending of their composition. For example, pentanol goes very pink, has rapid flows, and dissolves, while DEP only goes pink at the interface.
This work focus on the optimization of dynamic properties of droplets on a robotic platform. It was completed before I joined the Cronin group, and made me realize it became possible to apply a range of new exploration methods freshly developed in computer science research groups directly on physical systems in the lab.
They presented a liquid-handling robot built with the aim of investigating the properties of oil droplets as a function of composition via an automated evolutionary process. The robot makes the droplets by mixing four different compounds in different ratios and placing them in a Petri dish after which they are recorded using a camera and the behaviour of the droplets analysed using image recognition software to give a fitness value.
In separate experiments, the fitness function discriminates based on movement, division and vibration over 21 cycles, giving successive fitness increases. Analysis and theoretical modelling of the data yields fitness landscapes analogous to the genotype–phenotype correlations found in biological evolution.
Evolution of oil droplets in a chemorobotic platform. Gutierrez, J. M. P., Hinkley, T., Taylor, J. W., Yanev, K., & Cronin, L. (2014). Nature communications, 5, 5571. [pdf] [journal]
In 2017, we organized a workshop providing an extensive overview of this research. [pdf] [slides] [website]
We have developed a collection of software and hardware tools that has been made open source to help further research in this area. These tools are described on the open robotics project page and available on the Cronin group GitHub account.
A curious formulation robot enables the discovery of a novel protocell behavior. Grizou, J., Points, L. J., Sharma, A. & Cronin, L. (2020). Science Advances. [pdf] [doi] [robot design] [experiment code] [analysis code] [project]
A Nanomaterials Discovery Robot for the Darwinian Evolution of Shape Programmable Gold Nanoparticles. Keenan, G., Salley, D., Martín, S., Grizou, J., Sharma, A. and Cronin, L. (2020). Nature Communications. [pdf] [doi] [project]
Intuition-enabled Machine Learning beats the Competition when Joint Human-Robot Teams per-form Inorganic Chemical Experiments. Duros, V., Grizou, J., Sharma, A., Mehr, S.H.M., Bubliauskas, A., Frei, P., Miras, H.N. and Cronin, L. (2019). Journal of chemical information and modeling. [pdf] [journal] [project]
Artificial intelligence exploration of unstable protocells leads to predictable properties and discovery of collective behavior. Points, L. J., Taylor, J. W., Grizou, J., Donkers, K., & Cronin, L. (2018). PNAS - Proceedings of the National Academy of Sciences, 201711089. [pdf] [journal]
Adaptive artificial evolution of droplet protocells in a 3D-printed fluidic chemorobotic platform with configurable environments. Parrilla-Gutierrez, J. M., Tsuda, S., Grizou, J., Taylor, J., Henson, A., & Cronin, L. (2017). Nature communications, 8(1), 1144. [pdf] [journal]
The evolution of active droplets in chemorobotic platforms. Points, L. J., Grizou, J., Gutierrez, J. M. P., Taylor, J. W., & Cronin, L. (2017). Artificial Life Conference Proceedings 14. [pdf]
Human vs Robots in the Discovery and Crystallization of Gigantic Polyoxometalates. Grizou, J., Duros, V., Xuan, W., Hosni, Z., Long, D.-L., Miras H., Cronin L. (2017). Angewandte Chemie 129.36: 10955-10960. [pdf] [SI] [code] [journal]
Developmental robotics in a chemistry lab. Grizou, J., Points L.J., Cronin, L. (2017). Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2017 Joint IEEE International Conferences on. [pdf] [slides] [website]
Evolution of oil droplets in a chemorobotic platform. Gutierrez, J. M. P., Hinkley, T., Taylor, J. W., Yanev, K., & Cronin, L. (2014). Nature communications, 5, 5571. [pdf] [journal]
The above work - done between 2015 and 2018 - is the application of ideas from a small subfield of machine learning and robotics, called developmental robotics (see my PhD lab), to an emerging field at the intersection of robotics, AI, and chemistry.
I believe the projects we showcased provide a good window into future applications of AI and robotics in formulation sciences. More specifically in R&D departments to help find new formulations for consumer goods and medical treatments, under the ever evolving constraints from law enforcement (forbidden compounds, environment preservation), resource exhaustion, and budget optimization. The research presented above can significantly reduce the time and resources needed for the development of new products. Especially useful will be machines that can explore rather than optimize and collaborate with human rather than replace them.
I also believe that constraints from working on physical systems in the labs can be beneficial to the field of AI. When the only solution to querying data is to perform time consuming and costly experiments. It forces AI practitioners to redefine their working methodology and might change the focus of what matters when designing new algorithms.
I am eager to see this research applied to outside the lab. Do not hesitate to get in touch to share ideas or discuss potential projects.
[1] There is good reasons for that, convenience and speed for fundamental research is one, but also because a lot of problems worth solving can be captured adequately by data and simulation. My point focus on hands-on experiments in laboratories.