5:00 PM - CH01.09.08
Enabling Real-Time Human/AI Collaboration During Data Intensive Synchrotron Light Source Studies with Constrained Matrix Factorization
Daniel Olds1,Phillip Maffettone1,Aidan Daly2
Brookhaven National Laboratory1,Flatiron Institute2
With the National Synchrotron Light Source II (NSLS-II) coming online in 2015 as the brightest source in the world, the imminent upgrades at the Advanced Photon Source (APS-U), Advanced Light Source (ALS-U), and Linear Coherent Light Source (LCLS-II), and advances in detector technology, the data generation rates at x-ray light sources are skyrocketing1. While such advances open the door to new high throughput and in situ studies, these data intensive studies make prompt analysis of the data difficult, leading to researchers often ‘flying blind’ while at the beamline, opening the door to mistakes or missed opportunities that are not revealed until weeks-to-months after the experiment completes2. To fully leverage the capabilities offered by advanced synchrotron light sources, new methods of analysis must be developed that can keep pace with such data intensive experiments.
We have developed a method of Constrained Matrix Factorization (CMF) which is both efficient and highly scalable for real-time analysis of beamline data3. Beyond the positivity constraint found in standard Non-Negative Matrix Factorization (NMF), our algorithm allows users to apply additional constraints to both the weights and components used to fit the data. In this way, researcher provided intuition or prior knowledge can be injected into the fitting procedure, producing more physically relevant and interpretable results. This process can be done dynamically and interactively during an experiment, providing model-free insights into the progress of a study and guiding scientists towards their research objectives. We will present the details of the method, as well as several examples of its use during recent in situ studies including temperature dependent studies of molten salts4 and decomposition studies of Metal Organic Frameworks (MOFs) during gas flow reactions5. We will also present how this and other recently develop AI-driven tools6,7 can be readily integrated with Bluesky8, the open source python data acquisition system initially developed at NSLS-II.
 Ratner, Daniel, et al. (2019). BES roundtable on producing and managing large scientific data with artificial intelligence and machine learning. DOE SC Office of Basic Energy Sciences.
 Olds, D. (2020). Synchrotron X-ray Diffraction for Energy and Environmental Materials: The Current Role and Future Directions of Total Scattering Beamlines in the Functional Material Scientific Ecosystem. Synchrotron Radiat. News, 33(5), 4-10.
 Maffettone, P.M., Daly, A.C., Olds, D., Appl. Phys. Rev (In Press).
 Li, Q. J., Sprouster, D., Zheng, G., Neuefeind, J. C., Braatz, A. D., Mcfarlane, J., Olds, D.,... & Khaykovich, B. (2021). Complex Structure of Molten NaCl–CrCl3 Salt: Cr–Cl Octahedral Network and Intermediate-Range Order. ACS Appl. Energy Mater, 4(4), 3044-3056.
 Ganesan, A., Purdy, S. C., Yu, Z., Bhattacharyya, S., Page, K., Sholl, D. S., & Nair, S. (2021). Controlled Demolition and Reconstruction of Imidazolate and Carboxylate Metal–Organic Frameworks by Acid Gas Exposure and Linker Treatment. Ind. Eng. Chem. Res.
 Maffettone, P. M., Banko, L., Cui, P., Lysogorskiy, Y., Little, M. A., Olds, D., ... & Cooper, A. I. (2021). Crystallography companion agent for high-throughput materials discovery. Nat. Comput. Sci., 1(4), 290-297.
 Banko, L., Maffettone, P. M., Naujoks, D., Olds, D., & Ludwig, A. (2021). Deep learning for visualization and novelty detection in large X-ray diffraction datasets. npj Comput Mater 7, 104.
 Campbell, S. I., Allan, D. B., Barbour, A. M., Olds, D., Rakitin, M. S., Smith, R., & Wilkins, S. B. (2021). Outlook for artificial intelligence and machine learning at the NSLS-II. Mach. learn.: sci. technol, 2(1), 013001.