Connecting the concepts of quantum state tomography and molecular representations for machine learning

Raul Ortega-Ochoa; Luis Mantilla Calderón; Juan Bernardo Perez Sanchez; Mohsen Bagherimehrab; Abdulrahman Aldossary; Tejs Vegge; Tonio Buonassisi; Alán Aspuru-Guzik

doi:10.1039/D5DD00484E

Connecting the concepts of quantum state tomography and molecular representations for machine learning

Raul Ortega-Ochoa,

†*^ab Luis Mantilla Calderón,†*^cd Juan Bernardo Perez Sanchez,

^cd Mohsen Bagherimehrab,^fc Abdulrahman Aldossary,^cd Tejs Vegge,

^ab Tonio Buonassisi^e and Alán Aspuru-Guzik

*^cdfghij

Author affiliations

* Corresponding authors

^a Department of Energy Conversion and Storage, Technical University of Denmark, Kongens Lyngby 2800, Denmark
E-mail: rauoc@dtu.dk

^b CAPeX Pioneer Center for Accelerating P2X Materials Discovery, DK 2800 Kgs. Lyngby, Denmark

^c Department of Computer Science, University of Toronto, 40 St George St., Toronto, ON M5S 2E4, Canada
E-mail: luis@cs.toronto.edu, alan@aspuru.com

^d Vector Institute for Artificial Intelligence, Schwartz Reisman Innovation Campus, W1140-108 College St., Toronto, ON M5G 0C6, Canada

^e Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA

^f Department of Chemistry, University of Toronto, 80 St. George St., Toronto, ON M5S 3H6, Canada

^g Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON M5S 3E5, Canada

^h Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON M5S 3E4, Canada

ⁱ Acceleration Consortium, 700 University Ave., Toronto, ON M7A 2S4, Canada

^j NVIDIA, 431 King St. W #6th, Toronto, ON M5V 1K4, Canada

Abstract

Quantum state tomography has been widely used to reconstruct the quantum state of a system from a set of informationally-complete measurements. Obtaining enough information about, e.g., the wavefunction of a molecule allows its complete characterization. On the other hand, deep learning models have proven useful to perform molecular property prediction (forward design) and inverse design subject to property constraints within the approximate bounds of the data manifold, suggesting that their learned representations are reliable within the region of chemical compound space spanned by their training data. In this work, from the tomographic perspective, we argue that enforcing faithful prediction of an increasing number of diverse molecular descriptors from a shared learned representation progressively constrains the space of admissible internal explanations, driving the inter-alignment of models as they converge towards representation that can explain all observed properties. In the limit where the set of descriptors approaches information-completeness, this alignment drives the learned representations to states that can act, locally, as informationally-equivalent to the molecule's reduced quantum density matrix – a deep tomography. Under this lens, the generalization capabilities of a deep learning model, and the alignment among successful models, arise from unphysical or shortcut solutions becoming progressively incompatible as supervision approaches informational completeness.

Digital Discovery

Connecting the concepts of quantum state tomography and molecular representations for machine learning

Abstract

Article information

Download Citation

Permissions

Connecting the concepts of quantum state tomography and molecular representations for machine learning

Social activity

Search articles by author

Spotlight

Advertisements