1
|
Ahmed WW, Cao H, Xu C, Farhat M, Amin M, Li X, Zhang X, Wu Y. Machine learning assisted plasmonic metascreen for enhanced broadband absorption in ultra-thin silicon films. LIGHT, SCIENCE & APPLICATIONS 2025; 14:42. [PMID: 39779674 PMCID: PMC11711677 DOI: 10.1038/s41377-024-01723-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 11/04/2024] [Accepted: 12/16/2024] [Indexed: 01/11/2025]
Abstract
We propose and demonstrate a data-driven plasmonic metascreen that efficiently absorbs incident light over a wide spectral range in an ultra-thin silicon film. By embedding a double-nanoring silver array within a 20 nm ultrathin amorphous silicon (a-Si) layer, we achieve a significant enhancement of light absorption. This enhancement arises from the interaction between the resonant cavity modes and localized plasmonic modes, requiring precise tuning of plasmon resonances to match the absorption region of the silicon active layer. To facilitate the device design and improve light absorption without increasing the thickness of the active layer, we develop a deep learning framework, which learns to map from the absorption spectra to the design space. This inverse design strategy helps to tune the absorption for selective spectral functionalities. Our optimized design surpasses the bare silicon planar device, exhibiting a remarkable enhancement of over 100%. Experimental validation confirms the broadband enhancement of light absorption in the proposed configuration. The proposed metascreen absorber holds great potential for light harvesting applications and may be leveraged to improve the light conversion efficiency of ultra-thin silicon solar cells, photodetectors, and optical filters.
Collapse
Affiliation(s)
- Waqas W Ahmed
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Haicheng Cao
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Changqing Xu
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Mohamed Farhat
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Muhammad Amin
- College of Engineering, Taibah University, Madinah, 42353, Saudi Arabia
| | - Xiaohang Li
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
- Division of Physical Science and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Xiangliang Zhang
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, IN, 46556, USA.
| | - Ying Wu
- Division of Computer, Electrical and Mathematical Sciences and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
- Division of Physical Science and Engineering, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia.
| |
Collapse
|
2
|
Landinez Borda EJ, Berard KO, Lopez A, Rubenstein B. Gaussian processes for finite size extrapolation of many-body simulations. Faraday Discuss 2024; 254:500-528. [PMID: 39282946 DOI: 10.1039/d4fd00051j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2024]
Abstract
Key to being able to accurately model the properties of realistic materials is being able to predict their properties in the thermodynamic limit. Nevertheless, because most many-body electronic structure methods scale as a high-order polynomial, or even exponentially, with system size, directly simulating large systems in their thermodynamic limit rapidly becomes computationally intractable. As a result, researchers typically estimate the properties of large systems that approach the thermodynamic limit by extrapolating the properties of smaller, computationally-accessible systems based on relatively simple scaling expressions. In this work, we employ Gaussian processes to more accurately and efficiently extrapolate many-body simulations to their thermodynamic limit. We train our Gaussian processes on Smooth Overlap of Atomic Positions (SOAP) descriptors to extrapolate the energies of one-dimensional hydrogen chains obtained using two high-accuracy many-body methods: coupled cluster theory and Auxiliary Field Quantum Monte Carlo (AFQMC). In so doing, we show that Gaussian processes trained on relatively short 10-30-atom chains can predict the energies of both homogeneous and inhomogeneous hydrogen chains in their thermodynamic limit with sub-milliHartree accuracy. Unlike standard scaling expressions, our GPR-based approach is highly generalizable given representative training data and is not dependent on systems' geometries or dimensionality. This work highlights the potential for machine learning to correct for the finite size effects that routinely complicate the interpretation of finite size many-body simulations.
Collapse
Affiliation(s)
| | - Kenneth O Berard
- Department of Chemistry, Brown University, Providence, Rhode Island 02912, USA.
| | - Annette Lopez
- Department of Physics, Brown University, Providence, Rhode Island 02912, USA
| | - Brenda Rubenstein
- Department of Chemistry, Brown University, Providence, Rhode Island 02912, USA.
| |
Collapse
|
3
|
Hou B, Wu J, Qiu DY. Unsupervised representation learning of Kohn-Sham states and consequences for downstream predictions of many-body effects. Nat Commun 2024; 15:9481. [PMID: 39488548 PMCID: PMC11531501 DOI: 10.1038/s41467-024-53748-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 10/18/2024] [Indexed: 11/04/2024] Open
Abstract
Representation learning for the electronic structure problem is a major challenge of machine learning in computational condensed matter and materials physics. Within quantum mechanical first principles approaches, density functional theory (DFT) is the preeminent tool for understanding electronic structure, and the high-dimensional DFT wavefunctions serve as building blocks for downstream calculations of correlated many-body excitations and related physical observables. Here, we use variational autoencoders (VAE) for the unsupervised learning of DFT wavefunctions and show that these wavefunctions lie in a low-dimensional manifold within latent space. Our model autonomously determines the optimal representation of the electronic structure, avoiding limitations due to manual feature engineering. To demonstrate the utility of the latent space representation of the DFT wavefunction, we use it for the supervised training of neural networks (NN) for downstream prediction of quasiparticle bandstructures within the GW formalism. The GW prediction achieves a low error of 0.11 eV for a combined test set of two-dimensional metals and semiconductors, suggesting that the latent space representation captures key physical information from the original data. Finally, we explore the generative ability and interpretability of the VAE representation.
Collapse
Affiliation(s)
- Bowen Hou
- Department of Mechanical Engineering and Material Sciences, Yale University, New Haven, CT, 06511, USA
| | - Jinyuan Wu
- Department of Mechanical Engineering and Material Sciences, Yale University, New Haven, CT, 06511, USA
| | - Diana Y Qiu
- Department of Mechanical Engineering and Material Sciences, Yale University, New Haven, CT, 06511, USA.
| |
Collapse
|
4
|
Haldar S, Rahaman SS, Kumar M. Study of the Berezinskii-Kosterlitz-Thouless transition: an unsupervised machine learning approach. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2024; 36:415804. [PMID: 38941995 DOI: 10.1088/1361-648x/ad5d35] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/19/2023] [Accepted: 06/28/2024] [Indexed: 06/30/2024]
Abstract
The Berezinskii-Kosterlitz-Thouless (BKT) transition in magnetic systems is an intriguing phenomenon, and estimating the BKT transition temperature is a long-standing problem. In this work, we explore anisotropic classical Heisenberg XY and XXZ models with ferromagnetic exchange on a square lattice and antiferromagnetic exchange on a triangular lattice using an unsupervised machine learning approach called principal component analysis (PCA). The earlier PCA studies of the BKT transition temperature (TBKT) using the vorticities as input fail to give any conclusive results, whereas, in this work, we show that the proper analysis of the first principal component-temperature curve can estimateTBKTwhich is consistent with the existing literature. This analysis works well for the anisotropic classical Heisenberg with a ferromagnetic exchange on a square lattice and for frustrated antiferromagnetic exchange on a triangular lattice. The classical anisotropic Heisenberg antiferromagnetic model on the triangular lattice has two close transitions: theTBKTand Ising-like phase transition for chirality atTc, and it is difficult to separate these transition points. It is also noted that using the PCA method and manipulation of their first principal component not only makes the separation of transition points possible but also determines transition temperature.
Collapse
Affiliation(s)
- Sumit Haldar
- S. N. Bose National Centre for Basic Sciences, J D Block, Sector III, Salt Lake City, Kolkata 700106, India
| | - Sk Saniur Rahaman
- S. N. Bose National Centre for Basic Sciences, J D Block, Sector III, Salt Lake City, Kolkata 700106, India
| | - Manoranjan Kumar
- S. N. Bose National Centre for Basic Sciences, J D Block, Sector III, Salt Lake City, Kolkata 700106, India
| |
Collapse
|
5
|
Haga T. Machine learning analysis of dimensional reduction conjecture for nonequilibrium Berezinskii-Kosterlitz-Thouless transition in three dimensions. Phys Rev E 2024; 110:014137. [PMID: 39160920 DOI: 10.1103/physreve.110.014137] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 07/01/2024] [Indexed: 08/21/2024]
Abstract
We investigate the recently proposed dimensional reduction conjecture in driven disordered systems using a machine learning technique. The conjecture states that a static snapshot of a disordered system driven at a constant velocity is equal to a space-time trajectory of its lower-dimensional pure counterpart. This suggests that the three-dimensional (3D) random field XY model exhibits the Berezinskii-Kosterlitz-Thouless transition when driven out of equilibrium. To verify the conjecture directly by observing configurations of the system, we utilize the capacity of neural networks to detect subtle features of images. Specifically, we train neural networks to differentiate snapshots of the 3D driven random field XY model from space-time trajectories of the two-dimensional pure XY model. Our results demonstrate that the network cannot distinguish between the two, confirming the dimensional reduction conjecture.
Collapse
|
6
|
Gil-Fuster E, Eisert J, Bravo-Prieto C. Understanding quantum machine learning also requires rethinking generalization. Nat Commun 2024; 15:2277. [PMID: 38480684 PMCID: PMC10938005 DOI: 10.1038/s41467-024-45882-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Accepted: 02/06/2024] [Indexed: 03/17/2024] Open
Abstract
Quantum machine learning models have shown successful generalization performance even when trained with few data. In this work, through systematic randomization experiments, we show that traditional approaches to understanding generalization fail to explain the behavior of such quantum models. Our experiments reveal that state-of-the-art quantum neural networks accurately fit random states and random labeling of training data. This ability to memorize random data defies current notions of small generalization error, problematizing approaches that build on complexity measures such as the VC dimension, the Rademacher complexity, and all their uniform relatives. We complement our empirical results with a theoretical construction showing that quantum neural networks can fit arbitrary labels to quantum states, hinting at their memorization ability. Our results do not preclude the possibility of good generalization with few training data but rather rule out any possible guarantees based only on the properties of the model family. These findings expose a fundamental challenge in the conventional understanding of generalization in quantum machine learning and highlight the need for a paradigm shift in the study of quantum models for machine learning tasks.
Collapse
Affiliation(s)
- Elies Gil-Fuster
- Dahlem Center for Complex Quantum Systems, Freie Universität Berlin, Berlin, Germany
- Fraunhofer Heinrich Hertz Institute, Berlin, Germany
| | - Jens Eisert
- Dahlem Center for Complex Quantum Systems, Freie Universität Berlin, Berlin, Germany.
- Fraunhofer Heinrich Hertz Institute, Berlin, Germany.
- Helmholtz-Zentrum Berlin für Materialien und Energie, Berlin, Germany.
| | - Carlos Bravo-Prieto
- Dahlem Center for Complex Quantum Systems, Freie Universität Berlin, Berlin, Germany.
| |
Collapse
|
7
|
Sayyad S, Lado JL. Transfer learning from Hermitian to non-Hermitian quantum many-body physics. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2024; 36:185603. [PMID: 38277690 DOI: 10.1088/1361-648x/ad22f8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 01/26/2024] [Indexed: 01/28/2024]
Abstract
Identifying phase boundaries of interacting systems is one of the key steps to understanding quantum many-body models. The development of various numerical and analytical methods has allowed exploring the phase diagrams of many Hermitian interacting systems. However, numerical challenges and scarcity of analytical solutions hinder obtaining phase boundaries in non-Hermitian many-body models. Recent machine learning methods have emerged as a potential strategy to learn phase boundaries from various observables without having access to the full many-body wavefunction. Here, we show that a machine learning methodology trained solely on Hermitian correlation functions allows identifying phase boundaries of non-Hermitian interacting models. These results demonstrate that Hermitian machine learning algorithms can be redeployed to non-Hermitian models without requiring further training to reveal non-Hermitian phase diagrams. Our findings establish transfer learning as a versatile strategy to leverage Hermitian physics to machine learning non-Hermitian phenomena.
Collapse
Affiliation(s)
- Sharareh Sayyad
- Max Planck Institute for the Science of Light, Staudtstraße 2, 91058 Erlangen, Germany
| | - Jose L Lado
- Department of Applied Physics, Aalto University, FI-00076 Aalto, Espoo, Finland
| |
Collapse
|
8
|
Mahlow F, Luiz FS, Malvezzi AL, Fanchini FF. Model-independent quantum phases classifier. Sci Rep 2023; 13:14411. [PMID: 37660190 PMCID: PMC10475057 DOI: 10.1038/s41598-023-33301-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 04/11/2023] [Indexed: 09/04/2023] Open
Abstract
Machine learning has transformed science and technology. In this article, we present a model-independent classifier that uses the k-Nearest Neighbors algorithm to classify phases of a model for which it has never been trained. This is done by studying three different spin-1 chains with some common phases: the XXZ chains with uniaxial single-ion-type anisotropy, the bond alternating XXZ chains, and the bilinear biquadratic chain. We show that the algorithm trained with two of these models can, with high probability, determine phases common to the third one. This is the first step towards a universal classifier, where an algorithm can recognize an arbitrary phase without knowing the Hamiltonian, since it knows only partial information about the quantum state.
Collapse
Affiliation(s)
- F Mahlow
- Faculty of Sciences, UNESP-São Paulo State University, 17033-360, Bauru, SP, Brazil.
| | - F S Luiz
- Faculty of Sciences, UNESP-São Paulo State University, 17033-360, Bauru, SP, Brazil
| | - A L Malvezzi
- Faculty of Sciences, UNESP-São Paulo State University, 17033-360, Bauru, SP, Brazil
| | - F F Fanchini
- Faculty of Sciences, UNESP-São Paulo State University, 17033-360, Bauru, SP, Brazil
| |
Collapse
|
9
|
Ng KK, Huang CY, Lin FL. Berezinskii-Kosterlitz-Thouless transition from neural network flows. Phys Rev E 2023; 108:034104. [PMID: 37849170 DOI: 10.1103/physreve.108.034104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2022] [Accepted: 07/27/2023] [Indexed: 10/19/2023]
Abstract
We adopt the neural network (NN) flow method to study the Berezinskii-Kosterlitz-Thouless (BKT) phase transitions of the two-dimensional q-state clock model with q≥4. The NN flow consists of a sequence of the same units that proceed with the flow. This unit is a variational autoencoder trained by the data of Monte Carlo configurations in unsupervised learning. To gauge the difference among the ensembles of Monte Carlo configurations at different temperatures and the uniqueness of the ensemble of NN-flow states, we adopt the Jensen-Shannon divergence (JSD) as the information-distance measure "thermometer." This JSD thermometer compares the probability distribution functions of the mean spin value of two ensembles of states. Our results show that the NN flow will flow an arbitrary spin state to some state in a fixed-point ensemble of states. The corresponding JSD of the fixed-point ensemble takes a unique profile with peculiar features, which can help to identify the critical temperatures of BKT phase transitions of the underlying Monte Carlo configurations.
Collapse
Affiliation(s)
- Kwai-Kong Ng
- Department of Applied Physics, Tunghai University, Taichung 40704, Taiwan
| | - Ching-Yu Huang
- Department of Applied Physics, Tunghai University, Taichung 40704, Taiwan
| | - Feng-Li Lin
- Department of Physics, National Taiwan Normal University, Taipei 11677, Taiwan
| |
Collapse
|
10
|
Richter-Laskowska M, Kurpas M, Maśka MM. Learning by confusion approach to identification of discontinuous phase transitions. Phys Rev E 2023; 108:024113. [PMID: 37723704 DOI: 10.1103/physreve.108.024113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 07/21/2023] [Indexed: 09/20/2023]
Abstract
Recently, the learning by confusion (LbC) approach has been proposed as a machine learning tool to determine the critical temperature T_{c} of phase transitions without any prior knowledge of its even approximate value. The method has been proven effective, but it has been used only for continuous phase transitions, where the confusion results only from deliberate incorrect labeling of the data. However, in the case of a discontinuous phase transition, additional confusion can result from the coexistence of different phases. To verify whether the confusion scheme can also be used for discontinuous phase transitions, we apply the LbC method to three microscopic models, the Blume-Capel, the q-state Potts, and the Falicov-Kimball models, which undergo continuous or discontinuous phase transitions depending on model parameters. With the help of a simple model, we predict that the phase coexistence present in discontinuous phase transitions can indeed make the neural network more confused and thus decrease its performance. However, numerical calculations performed for the models mentioned above indicate that other aspects of this kind of phase transition are more important and can render the LbC method even less effective. Nevertheless, we demonstrate that in some cases the same aspects allow us to use the LbC method to identify the order of a phase transition.
Collapse
Affiliation(s)
- Monika Richter-Laskowska
- University of Silesia, Katowice, Poland and Łukasiewicz Research Network Krakow Institute of Technology, The Centre for Biomedical Engineering, Zakopiańska 73, Kraków 30-418, Poland
| | - Marcin Kurpas
- Institute of Physics, University of Silesia in Katowice, 41-500 Chorzów, Poland
| | - Maciej M Maśka
- Institute of Theoretical Physics, Wrocław University of Science and Technology, Wrocław, Poland
| |
Collapse
|
11
|
Staats M, Thamm M, Rosenow B. Boundary between noise and information applied to filtering neural network weight matrices. Phys Rev E 2023; 108:L022302. [PMID: 37723807 DOI: 10.1103/physreve.108.l022302] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Accepted: 07/11/2023] [Indexed: 09/20/2023]
Abstract
Deep neural networks have been successfully applied to a broad range of problems where overparametrization yields weight matrices which are partially random. A comparison of weight matrix singular vectors to the Porter-Thomas distribution suggests that there is a boundary between randomness and learned information in the singular value spectrum. Inspired by this finding, we introduce an algorithm for noise filtering, which both removes small singular values and reduces the magnitude of large singular values to counteract the effect of level repulsion between the noise and the information part of the spectrum. For networks trained in the presence of label noise, we indeed find that the generalization performance improves significantly due to noise filtering.
Collapse
Affiliation(s)
- Max Staats
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| | - Matthias Thamm
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| | - Bernd Rosenow
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| |
Collapse
|
12
|
Lee DB, Yoon HG, Park SM, Choi JW, Chen G, Kwon HY, Won C. Super-resolution of magnetic systems using deep learning. Sci Rep 2023; 13:11526. [PMID: 37460591 DOI: 10.1038/s41598-023-38335-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2023] [Accepted: 07/06/2023] [Indexed: 07/20/2023] Open
Abstract
We construct a deep neural network to enhance the resolution of spin structure images formed by spontaneous symmetry breaking in the magnetic systems. Through the deep neural network, an image is expanded to a super-resolution image and reduced to the original image size to be fitted with the input feed image. The network does not require ground truth images in the training process. Therefore, it can be applied when low-resolution images are provided as training datasets, while high-resolution images are not obtainable due to the intrinsic limitation of microscope techniques. To show the usefulness of the network, we train the network with two types of simulated magnetic structure images; one is from self-organized maze patterns made of chiral magnetic structures, and the other is from magnetic domains separated by walls that are topological defects of the system. The network successfully generates high-resolution images highly correlated with the exact solutions in both cases. To investigate the effectiveness and the differences between datasets, we study the network's noise tolerance and compare the networks' reliabilities. The network is applied with experimental data obtained by magneto-optical Kerr effect microscopy and spin-polarized low-energy electron microscopy.
Collapse
Affiliation(s)
- D B Lee
- Department of Physics, Kyung Hee University, Seoul, 02447, South Korea
- Department of Battery-Smart Factory, Korea University, Seoul, 02841, South Korea
| | - H G Yoon
- Department of Physics, Kyung Hee University, Seoul, 02447, South Korea
| | - S M Park
- Department of Physics, Kyung Hee University, Seoul, 02447, South Korea
| | - J W Choi
- Center for Spintronics, Korea Institute of Science and Technology, Seoul, 02792, South Korea
| | - G Chen
- National Laboratory of Solid State Microstructures and Department of Physics, Nanjing University, Nanjing, 210093, China
- Collaborative Innovation Center of Advanced Microstructures, Nanjing, 210093, China
| | - H Y Kwon
- Center for Spintronics, Korea Institute of Science and Technology, Seoul, 02792, South Korea.
| | - C Won
- Department of Physics, Kyung Hee University, Seoul, 02447, South Korea.
| |
Collapse
|
13
|
Song JU, Choi K, Oh SM, Kahng B. Exploring nonlinear dynamics and network structures in Kuramoto systems using machine learning approaches. CHAOS (WOODBURY, N.Y.) 2023; 33:073148. [PMID: 37486666 DOI: 10.1063/5.0153229] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 07/03/2023] [Indexed: 07/25/2023]
Abstract
Recent advances in machine learning (ML) have facilitated its application to a wide range of systems, from complex to quantum. Reservoir computing algorithms have proven particularly effective for studying nonlinear dynamical systems that exhibit collective behaviors, such as synchronizations and chaotic phenomena, some of which still remain unclear. Here, we apply ML approaches to the Kuramoto model to address several intriguing problems, including identifying the transition point and criticality of a hybrid synchronization transition, predicting future chaotic behaviors, and understanding network structures from chaotic patterns. Our proposed method also has further implications, such as inferring the structure of neural networks from electroencephalogram signals. This study, finally, highlights the potential of ML approaches for advancing our understanding of complex systems.
Collapse
Affiliation(s)
- Je Ung Song
- CTP and Department of Physics and Astronomy, Seoul National University, Seoul 08826, Korea
| | - Kwangjong Choi
- CTP and Department of Physics and Astronomy, Seoul National University, Seoul 08826, Korea
| | - Soo Min Oh
- Wireless Information and Network Sciences Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
- Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - B Kahng
- Center for Complex Systems and KI for Grid Modernization, Korea Institute of Energy Technology, Naju 58217, Korea
| |
Collapse
|
14
|
Liu YJ, Smith A, Knap M, Pollmann F. Model-Independent Learning of Quantum Phases of Matter with Quantum Convolutional Neural Networks. PHYSICAL REVIEW LETTERS 2023; 130:220603. [PMID: 37327416 DOI: 10.1103/physrevlett.130.220603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Accepted: 05/16/2023] [Indexed: 06/18/2023]
Abstract
Quantum convolutional neural networks (QCNNs) have been introduced as classifiers for gapped quantum phases of matter. Here, we propose a model-independent protocol for training QCNNs to discover order parameters that are unchanged under phase-preserving perturbations. We initiate the training sequence with the fixed-point wave functions of the quantum phase and add translation-invariant noise that respects the symmetries of the system to mask the fixed-point structure on short length scales. We illustrate this approach by training the QCNN on phases protected by time-reversal symmetry in one dimension, and test it on several time-reversal symmetric models exhibiting trivial, symmetry-breaking, and symmetry-protected topological order. The QCNN discovers a set of order parameters that identifies all three phases and accurately predicts the location of the phase boundary. The proposed protocol paves the way toward hardware-efficient training of quantum phase classifiers on a programmable quantum processor.
Collapse
Affiliation(s)
- Yu-Jie Liu
- Technical University of Munich, TUM School of Natural Sciences, Physics Department, 85748 Garching, Germany
- Munich Center for Quantum Science and Technology (MCQST), Schellingstrasse 4, 80799 München, Germany
| | - Adam Smith
- School of Physics and Astronomy, University of Nottingham, Nottingham NG7 2RD, United Kingdom
- Centre for the Mathematics and Theoretical Physics of Quantum Non-Equilibrium Systems, University of Nottingham, Nottingham NG7 2RD, United Kingdom
| | - Michael Knap
- Technical University of Munich, TUM School of Natural Sciences, Physics Department, 85748 Garching, Germany
- Munich Center for Quantum Science and Technology (MCQST), Schellingstrasse 4, 80799 München, Germany
| | - Frank Pollmann
- Technical University of Munich, TUM School of Natural Sciences, Physics Department, 85748 Garching, Germany
- Munich Center for Quantum Science and Technology (MCQST), Schellingstrasse 4, 80799 München, Germany
| |
Collapse
|
15
|
Agrawal D, Del Maestro A, Johnston S, Ostrowski J. Group-equivariant autoencoder for identifying spontaneously broken symmetries. Phys Rev E 2023; 107:054104. [PMID: 37329019 DOI: 10.1103/physreve.107.054104] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 04/04/2023] [Indexed: 06/18/2023]
Abstract
We introduce the group-equivariant autoencoder (GE autoencoder), a deep neural network (DNN) method that locates phase boundaries by determining which symmetries of the Hamiltonian have spontaneously broken at each temperature. We use group theory to deduce which symmetries of the system remain intact in all phases, and then use this information to constrain the parameters of the GE autoencoder such that the encoder learns an order parameter invariant to these "never-broken" symmetries. This procedure produces a dramatic reduction in the number of free parameters such that the GE-autoencoder size is independent of the system size. We include symmetry regularization terms in the loss function of the GE autoencoder so that the learned order parameter is also equivariant to the remaining symmetries of the system. By examining the group representation by which the learned order parameter transforms, we are then able to extract information about the associated spontaneous symmetry breaking. We test the GE autoencoder on the 2D classical ferromagnetic and antiferromagnetic Ising models, finding that the GE autoencoder (1) accurately determines which symmetries have spontaneously broken at each temperature; (2) estimates the critical temperature in the thermodynamic limit with greater accuracy, robustness, and time efficiency than a symmetry-agnostic baseline autoencoder; and (3) detects the presence of an external symmetry-breaking magnetic field with greater sensitivity than the baseline method. Finally, we describe various key implementation details, including a quadratic-programming-based method for extracting the critical temperature estimate from trained autoencoders and calculations of the DNN initialization and learning rate settings required for fair model comparisons.
Collapse
Affiliation(s)
- Devanshu Agrawal
- Department of Industrial and Systems Engineering, University of Tennessee, Knoxville, Tennessee 37996, USA
| | - Adrian Del Maestro
- Department of Physics and Astronomy, University of Tennessee, Knoxville, Tennessee 37996, USA
- Min H. Kao Department of Electrical Engineering and Computer Science, University of Tennessee, Knoxville, Tennessee 37996, USA
- Institute for Advanced Materials and Manufacturing, University of Tennessee, Knoxville, Tennessee 37996, USA
| | - Steven Johnston
- Department of Physics and Astronomy, University of Tennessee, Knoxville, Tennessee 37996, USA
- Institute for Advanced Materials and Manufacturing, University of Tennessee, Knoxville, Tennessee 37996, USA
| | - James Ostrowski
- Department of Industrial and Systems Engineering, University of Tennessee, Knoxville, Tennessee 37996, USA
| |
Collapse
|
16
|
Rahaman SS, Haldar S, Kumar M. Machine learning approach to study quantum phase transitions of a frustrated one dimensional spin-1/2 system. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2023; 35:115603. [PMID: 36599166 DOI: 10.1088/1361-648x/acb030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 01/04/2023] [Indexed: 06/17/2023]
Abstract
Frustration-driven quantum fluctuation leads to many exotic phases in the ground state (GS) and the study of these quantum phase transitions is one of the most challenging areas of research in condensed matter physics. We study a frustrated HeisenbergJ1-J2model of spin-1/2 chain with nearest exchange interactionJ1and next nearest exchange interactionJ2using the principal component analysis (PCA) which is an unsupervised machine learning technique. In this method most probable spin configurations (MPSCs) of GS and first excited state (FES) for differentJ2/J1are used as the input in PCA to construct the covariance matrix. The 'quantified principal component'p1(J2/J1)of the largest eigenvalue of the covariance matrix is calculated and it is shown that the nature and variation ofp1(J2/J1)can accurately predict the phase transitions and degeneracies in the GS. Thep1(J2/J1)calculated from the MPSC of GS only exhibits the signature of degeneracies in the GS, whereas,p1(J2/J1)calculated from the MPSC of FES captures the gapless spin liquid (GSL)-dimer phase transition as well as all the degeneracies of the model system. We show that the jump inp1(J2/J1)of FES atJ2/J1≈0.241, indicates the GSL-dimer phase transition, whereas its kinks give the signature of the GS degeneracies. The scatter plot of the first two principal components of FES shows distinct band formation for different phases. The MPSCs are obtained by using an iterative variational method (IVM) which gives the approximate eigenvalues and eigenvectors.
Collapse
Affiliation(s)
- Sk Saniur Rahaman
- S. N. Bose National Centre for Basic Sciences, Block JD, Sector III, Salt Lake, Kolkata 700106, India
| | - Sumit Haldar
- S. N. Bose National Centre for Basic Sciences, Block JD, Sector III, Salt Lake, Kolkata 700106, India
| | - Manoranjan Kumar
- S. N. Bose National Centre for Basic Sciences, Block JD, Sector III, Salt Lake, Kolkata 700106, India
| |
Collapse
|
17
|
Thamm M, Staats M, Rosenow B. Random matrix analysis of deep neural network weight matrices. Phys Rev E 2022; 106:054124. [PMID: 36559497 DOI: 10.1103/physreve.106.054124] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2022] [Accepted: 10/14/2022] [Indexed: 06/17/2023]
Abstract
Neural networks have been used successfully in a variety of fields, which has led to a great deal of interest in developing a theoretical understanding of how they store the information needed to perform a particular task. We study the weight matrices of trained deep neural networks using methods from random matrix theory (RMT) and show that the statistics of most of the singular values follow universal RMT predictions. This suggests that they are random and do not contain system specific information, which we investigate further by comparing the statistics of eigenvector entries to the universal Porter-Thomas distribution. We find that for most eigenvectors the hypothesis of randomness cannot be rejected, and that only eigenvectors belonging to the largest singular values deviate from the RMT prediction, indicating that they may encode learned information. In addition, a comparison with RMT predictions also allows to distinguish networks trained in different learning regimes-from lazy to rich learning. We analyze the spectral distribution of the large singular values using the Hill estimator and find that the distribution cannot in general be characterized by a tail index, i.e., is not of power-law type.
Collapse
Affiliation(s)
- Matthias Thamm
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| | - Max Staats
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| | - Bernd Rosenow
- Institut für Theoretische Physik, Universität Leipzig, Brüderstrasse 16, 04103 Leipzig, Germany
| |
Collapse
|
18
|
Dborin J, Wimalaweera V, Barratt F, Ostby E, O'Brien TE, Green AG. Simulating groundstate and dynamical quantum phase transitions on a superconducting quantum computer. Nat Commun 2022; 13:5977. [PMID: 36216839 PMCID: PMC9550817 DOI: 10.1038/s41467-022-33737-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 09/30/2022] [Indexed: 11/15/2022] Open
Abstract
The phenomena of quantum criticality underlie many novel collective phenomena found in condensed matter systems. They present a challenge for classical and quantum simulation, in part because of diverging correlation lengths and consequently strong finite-size effects. Tensor network techniques that work directly in the thermodynamic limit can negotiate some of these difficulties. Here, we optimise a translationally invariant, sequential quantum circuit on a superconducting quantum device to simulate the groundstate of the quantum Ising model through its quantum critical point. We further demonstrate how the dynamical quantum critical point found in quenches of this model across its quantum critical point can be simulated. Our approach avoids finite-size scaling effects by using sequential quantum circuits inspired by infinite matrix product states. We provide efficient circuits and a variety of error mitigation strategies to implement, optimise and time-evolve these states.
Collapse
Affiliation(s)
- James Dborin
- London Centre for Nanotechnology, University College London, Gordon St., London, WC1H 0AH, UK
| | - Vinul Wimalaweera
- London Centre for Nanotechnology, University College London, Gordon St., London, WC1H 0AH, UK
| | - F Barratt
- Department of Physics, University of Massachusetts, Amherst, MA, 01003, USA
| | - Eric Ostby
- Google Quantum AI, 80636, Munich, Germany
| | | | - A G Green
- London Centre for Nanotechnology, University College London, Gordon St., London, WC1H 0AH, UK.
| |
Collapse
|
19
|
Sarder MTH, Medhi A. Feed-forward neural network based variational wave function for the fermionic Hubbard model in one dimension. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2022; 34:375901. [PMID: 35772394 DOI: 10.1088/1361-648x/ac7d85] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2022] [Accepted: 06/30/2022] [Indexed: 06/15/2023]
Abstract
We explore the suitability of a feed-forward neural network (FNN) to represent the ground state of the fermionic Hubbard model in one dimension (1D). We consider the model at half-filling, represent the ground state wave function in terms of an FNN and optimize it using the variational Monte Carlo (VMC) method. The results are compared with the exact Bethe Ansatz solution. We find that for lattice sizes which give a 'filled-shell' condition for the non-interacting Fermi sea wave function, a simple FNN performs very well at all values of Hubbard interactionU. For lattice sizes where this condition is not obtained, the simple FNN fails and we find a modified network with a 'sign' component (sFNN) to work in such cases. On the flip side, though we find the FNN to be successful in providing an unbiased variational wave function for the fermionic many-body system in 1D, the computational cost for the wave function scales up rapidly with lattice size which limits its applicability.
Collapse
Affiliation(s)
- Md Tahir Hossain Sarder
- Indian Institute of Science Education and Research Thiruvananthapuram, Kerala, 695551, India
| | - Amal Medhi
- Indian Institute of Science Education and Research Thiruvananthapuram, Kerala, 695551, India
| |
Collapse
|
20
|
Dawid A, Huembeli P, Tomza M, Lewenstein M, Dauphin A. Hessian-based toolbox for reliable and interpretable machine learning in physics. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac338d] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Machine learning (ML) techniques applied to quantum many-body physics have emerged as a new research field. While the numerical power of this approach is undeniable, the most expressive ML algorithms, such as neural networks, are black boxes: The user does neither know the logic behind the model predictions nor the uncertainty of the model predictions. In this work, we present a toolbox for interpretability and reliability, agnostic of the model architecture. In particular, it provides a notion of the influence of the input data on the prediction at a given test point, an estimation of the uncertainty of the model predictions, and an extrapolation score for the model predictions. Such a toolbox only requires a single computation of the Hessian of the training loss function. Our work opens the road to the systematic use of interpretability and reliability methods in ML applied to physics and, more generally, science.
Collapse
|
21
|
Zou E, Long E, Zhao E. Learning a compass spin model with neural network quantum states. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2022; 34:125802. [PMID: 34915457 DOI: 10.1088/1361-648x/ac43ff] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 12/16/2021] [Indexed: 06/14/2023]
Abstract
Neural network quantum states provide a novel representation of the many-body states of interacting quantum systems and open up a promising route to solve frustrated quantum spin models that evade other numerical approaches. Yet its capacity to describe complex magnetic orders with large unit cells has not been demonstrated, and its performance in a rugged energy landscape has been questioned. Here we apply restricted Boltzmann machines (RBMs) and stochastic gradient descent to seek the ground states of a compass spin model on the honeycomb lattice, which unifies the Kitaev model, Ising model and the quantum 120° model with a single tuning parameter. We report calculation results on the variational energy, order parameters and correlation functions. The phase diagram obtained is in good agreement with the predictions of tensor network ansatz, demonstrating the capacity of RBMs in learning the ground states of frustrated quantum spin Hamiltonians. The limitations of the calculation are discussed. A few strategies are outlined to address some of the challenges in machine learning frustrated quantum magnets.
Collapse
Affiliation(s)
- Eric Zou
- Department of Physics and Astronomy, George Mason University, Fairfax, Virginia 22030, United States of America
| | - Erik Long
- Department of Physics and Astronomy, George Mason University, Fairfax, Virginia 22030, United States of America
| | - Erhai Zhao
- Department of Physics and Astronomy, George Mason University, Fairfax, Virginia 22030, United States of America
| |
Collapse
|
22
|
Rodimkov Y, Bhadoria S, Volokitin V, Efimenko E, Polovinkin A, Blackburn T, Marklund M, Gonoskov A, Meyerov I. Towards ML-Based Diagnostics of Laser-Plasma Interactions. SENSORS (BASEL, SWITZERLAND) 2021; 21:6982. [PMID: 34770288 PMCID: PMC8588203 DOI: 10.3390/s21216982] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Revised: 10/07/2021] [Accepted: 10/19/2021] [Indexed: 11/16/2022]
Abstract
The power of machine learning (ML) in feature identification can be harnessed for determining quantities in experiments that are difficult to measure directly. However, if an ML model is trained on simulated data, rather than experimental results, the differences between the two can pose an obstacle to reliable data extraction. Here we report on the development of ML-based diagnostics for experiments on high-intensity laser-matter interactions. With the intention to accentuate robust, physics-governed features, the presence of which is tolerant to such differences, we test the application of principal component analysis, data augmentation and training with data that has superimposed noise of gradually increasing amplitude. Using synthetic data of simulated experiments, we identify that the approach based on the noise of increasing amplitude yields the most accurate ML models and thus is likely to be useful in similar projects on ML-based diagnostics.
Collapse
Affiliation(s)
- Yury Rodimkov
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, Russia; (Y.R.); (V.V.)
| | - Shikha Bhadoria
- Department of Physics, University of Gothenburg, SE-41296 Gothenburg, Sweden; (S.B.); (T.B.); (M.M.)
| | - Valentin Volokitin
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, Russia; (Y.R.); (V.V.)
- Mathematical Center, Lobachevsky University, 603950 Nizhni Novgorod, Russia
| | - Evgeny Efimenko
- Institute of Applied Physics of the Russian Academy of Sciences, 603950 Nizhni Novgorod, Russia;
| | | | - Thomas Blackburn
- Department of Physics, University of Gothenburg, SE-41296 Gothenburg, Sweden; (S.B.); (T.B.); (M.M.)
| | - Mattias Marklund
- Department of Physics, University of Gothenburg, SE-41296 Gothenburg, Sweden; (S.B.); (T.B.); (M.M.)
| | - Arkady Gonoskov
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, Russia; (Y.R.); (V.V.)
- Department of Physics, University of Gothenburg, SE-41296 Gothenburg, Sweden; (S.B.); (T.B.); (M.M.)
| | - Iosif Meyerov
- Department of Mathematical Software and Supercomputing Technologies, Lobachevsky University, 603950 Nizhni Novgorod, Russia; (Y.R.); (V.V.)
- Mathematical Center, Lobachevsky University, 603950 Nizhni Novgorod, Russia
| |
Collapse
|
23
|
Ren XY, Han RS, Chen L. Learning impurity spectral functions from density of states. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:495601. [PMID: 34500441 DOI: 10.1088/1361-648x/ac2533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 09/09/2021] [Indexed: 06/13/2023]
Abstract
Using numerical renormalization group calculation, we construct a dataset with 100 K samples, and train six different neural networks for the prediction of spectral functions from density of states (DOS) of the host material. We find that a combination of gated recurrent unit (GRU) network and bidirectional GRU (BiGRU) performances the best among all the six neural networks. The mean absolute error of the GRU + BiGRU network can reach 0.052 and 0.043 when this network is evaluated on the original dataset and two other independent datasets. The average time of spectral function predictions from machine learning is on the scale of 10-5-10-6that of traditional impurity solvers for Anderson impurity model. This investigation pave the way for the application of recurrent neural network and convolutional neural network in the prediction of spectral functions from DOSs in machine learning solvers of magnetic impurity problems.
Collapse
Affiliation(s)
- Xing-Yuan Ren
- Mathematics and Physics Department, North China Electric Power University, Beijing, 102206, People's Republic of China
| | - Rong-Sheng Han
- Mathematics and Physics Department, North China Electric Power University, Beijing, 102206, People's Republic of China
| | - Liang Chen
- Mathematics and Physics Department, North China Electric Power University, Beijing, 102206, People's Republic of China
| |
Collapse
|
24
|
Abstract
Chemical compound space (CCS), the set of all theoretically conceivable combinations of chemical elements and (meta-)stable geometries that make up matter, is colossal. The first-principles based virtual sampling of this space, for example, in search of novel molecules or materials which exhibit desirable properties, is therefore prohibitive for all but the smallest subsets and simplest properties. We review studies aimed at tackling this challenge using modern machine learning techniques based on (i) synthetic data, typically generated using quantum mechanics based methods, and (ii) model architectures inspired by quantum mechanics. Such Quantum mechanics based Machine Learning (QML) approaches combine the numerical efficiency of statistical surrogate models with an ab initio view on matter. They rigorously reflect the underlying physics in order to reach universality and transferability across CCS. While state-of-the-art approximations to quantum problems impose severe computational bottlenecks, recent QML based developments indicate the possibility of substantial acceleration without sacrificing the predictive power of quantum mechanics.
Collapse
Affiliation(s)
- Bing Huang
- Faculty
of Physics, University of Vienna, 1090 Vienna, Austria
| | - O. Anatole von Lilienfeld
- Faculty
of Physics, University of Vienna, 1090 Vienna, Austria
- Institute
of Physical Chemistry and National Center for Computational Design
and Discovery of Novel Materials (MARVEL), Department of Chemistry, University of Basel, 4056 Basel, Switzerland
| |
Collapse
|
25
|
Driskell G, Lederer S, Bauer C, Trebst S, Kim EA. Identification of Non-Fermi Liquid Physics in a Quantum Critical Metal via Quantum Loop Topography. PHYSICAL REVIEW LETTERS 2021; 127:046601. [PMID: 34355923 DOI: 10.1103/physrevlett.127.046601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 06/23/2021] [Indexed: 06/13/2023]
Abstract
Non-Fermi liquid physics is ubiquitous in strongly correlated metals, manifesting itself in anomalous transport properties, such as a T-linear resistivity in experiments. However, its theoretical understanding in terms of microscopic models is lacking, despite decades of conceptual work and attempted numerical simulations. Here we demonstrate that a combination of sign-problem-free quantum Monte Carlo sampling and quantum loop topography, a physics-inspired machine-learning approach, can map out the emergence of non-Fermi liquid physics in the vicinity of a quantum critical point (QCP) with little prior knowledge. Using only three parameter points for training the underlying neural network, we are able to robustly identify a stable non-Fermi liquid regime tracing the fans of metallic QCPs at the onset of both spin-density wave and nematic order. In particular, we establish for the first time that a spin-density wave QCP commands a wide fan of non-Fermi liquid region that funnels into the quantum critical point. Our study thereby provides an important proof-of-principle example that new physics can be detected via unbiased machine-learning approaches.
Collapse
Affiliation(s)
- George Driskell
- Department of Physics, Cornell University, Ithaca, New York 14853, USA
| | - Samuel Lederer
- Department of Physics, Cornell University, Ithaca, New York 14853, USA
| | - Carsten Bauer
- Institute for Theoretical Physics, University of Cologne, 50937 Cologne, Germany
| | - Simon Trebst
- Institute for Theoretical Physics, University of Cologne, 50937 Cologne, Germany
| | - Eun-Ah Kim
- Department of Physics, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
26
|
Käming N, Dawid A, Kottmann K, Lewenstein M, Sengstock K, Dauphin A, Weitenberg C. Unsupervised machine learning of topological phase transitions from experimental data. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abffe7] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Abstract
Identifying phase transitions is one of the key challenges in quantum many-body physics. Recently, machine learning methods have been shown to be an alternative way of localising phase boundaries from noisy and imperfect data without the knowledge of the order parameter. Here, we apply different unsupervised machine learning techniques, including anomaly detection and influence functions, to experimental data from ultracold atoms. In this way, we obtain the topological phase diagram of the Haldane model in a completely unbiased fashion. We show that these methods can successfully be applied to experimental data at finite temperatures and to the data of Floquet systems when post-processing the data to a single micromotion phase. Our work provides a benchmark for the unsupervised detection of new exotic phases in complex many-body systems.
Collapse
|
27
|
Liu J, Cao G, Zhou Z, Liu H. Screening potential topological insulators in half-Heusler compounds via compressed-sensing. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:325501. [PMID: 33001860 DOI: 10.1088/1361-648x/abba8d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/22/2020] [Accepted: 09/22/2020] [Indexed: 06/11/2023]
Abstract
Ternary half-Heusler compounds with widely tunable electronic structures, present a new platform to discover topological insulators (TIs). Due to time-consuming computations and synthesis procedures, the identification of new TIs is however a rough task. Here, we adopt a compressed-sensing approach to rapidly screen potential TIs in half-Heusler family, which is realized via a two-dimensional descriptor that only depends on the fundamental properties of the constituent atoms. Beyond the finite training data, the proposed descriptor is employed to screen many new half-Heusler compounds, including those with integer and fractional stoichiometry, and a larger number of possible TIs are predicted.
Collapse
Affiliation(s)
- Jianghui Liu
- Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education and School of Physics and Technology, Wuhan University, Wuhan 430072, People's Republic of China
| | - Guohua Cao
- Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education and School of Physics and Technology, Wuhan University, Wuhan 430072, People's Republic of China
| | - Zizhen Zhou
- Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education and School of Physics and Technology, Wuhan University, Wuhan 430072, People's Republic of China
| | - Huijun Liu
- Key Laboratory of Artificial Micro- and Nano-Structures of Ministry of Education and School of Physics and Technology, Wuhan University, Wuhan 430072, People's Republic of China
| |
Collapse
|
28
|
Shen J, Li W, Deng S, Zhang T. Supervised and unsupervised learning of directed percolation. Phys Rev E 2021; 103:052140. [PMID: 34134215 DOI: 10.1103/physreve.103.052140] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2021] [Accepted: 05/12/2021] [Indexed: 11/07/2022]
Abstract
Machine learning (ML) has been well applied to studying equilibrium phase transition models by accurately predicating critical thresholds and some critical exponents. Difficulty will be raised, however, for integrating ML into nonequilibrium phase transitions. The extra dimension in a given nonequilibrium system, namely time, can greatly slow down the procedure toward the steady state. In this paper we find that by using some simple techniques of ML, non-steady-state configurations of directed percolation (DP) suffice to capture its essential critical behaviors in both (1+1) and (2+1) dimensions. With the supervised learning method, the framework of our binary classification neural networks can identify the phase transition threshold, as well as the spatial and temporal correlation exponents. The characteristic time t_{c}, specifying the transition from active phases to absorbing ones, is also a major product of the learning. Moreover, we employ the convolutional autoencoder, an unsupervised learning technique, to extract dimensionality reduction representations and cluster configurations of (1+1) bond DP. It is quite appealing that such a method can yield a reasonable estimation of the critical point.
Collapse
Affiliation(s)
- Jianmin Shen
- Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China
| | - Wei Li
- Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China.,Max-Planck-Institute for Mathematics in the Sciences, 04103 Leipzig, Germany
| | - Shengfeng Deng
- Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China
| | - Tao Zhang
- Key Laboratory of Quark and Lepton Physics (MOE) and Institute of Particle Physics, Central China Normal University, Wuhan 430079, China
| |
Collapse
|
29
|
Tula T, Möller G, Quintanilla J, Giblin SR, Hillier AD, McCabe EE, Ramos S, Barker DS, Gibson S. Machine learning approach to muon spectroscopy analysis. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 33:194002. [PMID: 33545697 DOI: 10.1088/1361-648x/abe39e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 02/05/2021] [Indexed: 06/12/2023]
Abstract
In recent years, artificial intelligence techniques have proved to be very successful when applied to problems in physical sciences. Here we apply an unsupervised machine learning (ML) algorithm called principal component analysis (PCA) as a tool to analyse the data from muon spectroscopy experiments. Specifically, we apply the ML technique to detect phase transitions in various materials. The measured quantity in muon spectroscopy is an asymmetry function, which may hold information about the distribution of the intrinsic magnetic field in combination with the dynamics of the sample. Sharp changes of shape of asymmetry functions-measured at different temperatures-might indicate a phase transition. Existing methods of processing the muon spectroscopy data are based on regression analysis, but choosing the right fitting function requires knowledge about the underlying physics of the probed material. Conversely, PCA focuses on small differences in the asymmetry curves and works without any prior assumptions about the studied samples. We discovered that the PCA method works well in detecting phase transitions in muon spectroscopy experiments and can serve as an alternative to current analysis, especially if the physics of the studied material are not entirely known. Additionally, we found out that our ML technique seems to work best with large numbers of measurements, regardless of whether the algorithm takes data only for a single material or whether the analysis is performed simultaneously for many materials with different physical properties.
Collapse
Affiliation(s)
- T Tula
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| | - G Möller
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| | - J Quintanilla
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| | - S R Giblin
- School of Physics and Astronomy, Cardiff University, Cardiff CF24 3AA, United Kingdom
| | - A D Hillier
- ISIS Facility, STFC Rutherford Appleton Laboratory, Chilton, Didcot Oxon, OX11 0QX, United Kingdom
| | - E E McCabe
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| | - S Ramos
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| | - D S Barker
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
- School of Physics and Astronomy, University of Leeds, Leeds, LS2 9JT, United Kingdom
| | - S Gibson
- School of Physical Sciences, University of Kent, Park Wood Rd, Canterbury CT2 7NH, United Kingdom
| |
Collapse
|
30
|
Azizi A, Pleimling M. A cautionary tale for machine learning generated configurations in presence of a conserved quantity. Sci Rep 2021; 11:6395. [PMID: 33737630 PMCID: PMC7973807 DOI: 10.1038/s41598-021-85683-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Accepted: 03/04/2021] [Indexed: 11/09/2022] Open
Abstract
We investigate the performance of machine learning algorithms trained exclusively with configurations obtained from importance sampling Monte Carlo simulations of the two-dimensional Ising model with conserved magnetization. For supervised machine learning, we use convolutional neural networks and find that the corresponding output not only allows to locate the phase transition point with high precision, it also displays a finite-size scaling characterized by an Ising critical exponent. For unsupervised learning, restricted Boltzmann machines (RBM) are trained to generate new configurations that are then used to compute various quantities. We find that RBM generates configurations with magnetizations and energies forbidden in the original physical system. The RBM generated configurations result in energy density probability distributions with incorrect weights as well as in wrong spatial correlations. We show that shortcomings are also encountered when training RBM with configurations obtained from the non-conserved Ising model.
Collapse
Affiliation(s)
- Ahmadreza Azizi
- Department of Physics, Virginia Tech, Blacksburg, VA, 24061-0435, USA. .,Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, VA, 24061-0435, USA.
| | - Michel Pleimling
- Department of Physics, Virginia Tech, Blacksburg, VA, 24061-0435, USA.,Center for Soft Matter and Biological Physics, Virginia Tech, Blacksburg, VA, 24061-0435, USA.,Academy of Integrated Science, Virginia Tech, Blacksburg, VA, 24061-0563, USA
| |
Collapse
|
31
|
Mototake YI. Interpretable conservation law estimation by deriving the symmetries of dynamics from trained deep neural networks. Phys Rev E 2021; 103:033303. [PMID: 33862698 DOI: 10.1103/physreve.103.033303] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Accepted: 02/08/2021] [Indexed: 11/07/2022]
Abstract
Understanding complex systems with their reduced model is one of the central roles in scientific activities. Although physics has greatly been developed with the physical insights of physicists, it is sometimes challenging to build a reduced model of such complex systems on the basis of insight alone. We propose a framework that can infer hidden conservation laws of a complex system from deep neural networks (DNNs) that have been trained with physical data of the system. The purpose of the proposed framework is not to analyze physical data with deep learning but to extract interpretable physical information from trained DNNs. With Noether's theorem and by an efficient sampling method, the proposed framework infers conservation laws by extracting the symmetries of dynamics from trained DNNs. The proposed framework is developed by deriving the relationship between a manifold structure of a time-series data set and the necessary conditions for Noether's theorem. The feasibility of the proposed framework has been verified in some primitive cases in which the conservation law is well known. We also apply the proposed framework to conservation law estimation for a more practical case, that is, a large-scale collective motion system in the metastable state, and we obtain a result consistent with that of a previous study.
Collapse
Affiliation(s)
- Yoh-Ichi Mototake
- The Institute of Statistical Mathematics, Tachikawa, Tokyo 190-8562, Japan
| |
Collapse
|
32
|
Tang H, Shi R, He TS, Zhu YY, Wang TY, Lee M, Jin XM. TensorFlow solver for quantum PageRank in large-scale networks. Sci Bull (Beijing) 2021; 66:120-126. [PMID: 36654218 DOI: 10.1016/j.scib.2020.09.009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 07/24/2020] [Accepted: 08/31/2020] [Indexed: 01/20/2023]
Abstract
Google PageRank is a prevalent algorithm for ranking the significance of nodes or websites in a network, and a recent quantum counterpart for PageRank algorithm has been raised to suggest a higher accuracy of ranking comparing to Google PageRank. The quantum PageRank algorithm is essentially based on quantum stochastic walks and can be expressed using Lindblad master equation, which, however, needs to solve the Kronecker products of an O(N4) dimension and requires severely large memory and time when the number of nodes N in a network increases above 150. Here, we present an efficient solver for quantum PageRank by using the Runge-Kutta method to reduce the matrix dimension to O(N2) and employing TensorFlow to conduct GPU parallel computing. We demonstrate its performance in solving quantum stochastic walks on Erdös-Rényi graphs using an RTX 2060 GPU. The test on the graph of 6000 nodes requires a memory of 5.5 GB and time of 223 s, and that on the graph of 1000 nodes requires 226 MB and 3.6 s. Compared with QSWalk, a currently prevalent Mathematica solver, our solver for the same graph of 1000 nodes reduces the required memory and time to only 0.2% and 0.05%. We apply the solver to quantum PageRank for the USA major airline network with up to 922 nodes, and to quantum stochastic walk on a glued tree of 2186 nodes. This efficient solver for large-scale quantum PageRank and quantum stochastic walks would greatly facilitate studies of quantum information in real-life applications.
Collapse
Affiliation(s)
- Hao Tang
- Center for Integrated Quantum Information Technologies (IQIT), School of Physics and Astronomy and State Key Laboratory of Advanced Optical Communication Systems and Networks, Shanghai Jiao Tong University, Shanghai 200240, China; CAS Center for Excellence and Synergetic Innovation Center in Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei 230026, China
| | - Ruoxi Shi
- Center for Integrated Quantum Information Technologies (IQIT), School of Physics and Astronomy and State Key Laboratory of Advanced Optical Communication Systems and Networks, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Tian-Shen He
- Center for Integrated Quantum Information Technologies (IQIT), School of Physics and Astronomy and State Key Laboratory of Advanced Optical Communication Systems and Networks, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Yan-Yan Zhu
- School of Physical Science, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Tian-Yu Wang
- Center for Integrated Quantum Information Technologies (IQIT), School of Physics and Astronomy and State Key Laboratory of Advanced Optical Communication Systems and Networks, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Marcus Lee
- Department of Physics, Cambridge University, Cambridge CB3 0HE, UK
| | - Xian-Min Jin
- Center for Integrated Quantum Information Technologies (IQIT), School of Physics and Astronomy and State Key Laboratory of Advanced Optical Communication Systems and Networks, Shanghai Jiao Tong University, Shanghai 200240, China; CAS Center for Excellence and Synergetic Innovation Center in Quantum Information and Quantum Physics, University of Science and Technology of China, Hefei 230026, China.
| |
Collapse
|
33
|
Balabanov O, Granath M. Unsupervised interpretable learning of topological indices invariant under permutations of atomic bands. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/abcc43] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Abstract
Multi-band insulating Bloch Hamiltonians with internal or spatial symmetries, such as particle-hole or inversion, may have topologically disconnected sectors of trivial atomic-limit (momentum-independent) Hamiltonians. We present a neural-network-based protocol for finding topologically relevant indices that are invariant under transformations between such trivial atomic-limit Hamiltonians, thus corresponding to the standard classification of band insulators. The work extends the method of ‘topological data augmentation’ for unsupervised learning introduced (2020 Phys. Rev. Res.
2 013354) by also generalizing and simplifying the data generation scheme and by introducing a special ‘mod’ layer of the neural network appropriate for Z
n
classification. Ensembles of training data are generated by deforming seed objects in a way that preserves a discrete representation of continuity. In order to focus the learning on the topologically relevant indices, prior to the deformation procedure we stack the seed Bloch Hamiltonians with a complete set of symmetry-respecting trivial atomic bands. The obtained datasets are then used for training an interpretable neural network specially designed to capture the topological properties by learning physically relevant momentum space quantities, even in crystalline symmetry classes.
Collapse
|
34
|
Bachtis D, Aarts G, Lucini B. Mapping distinct phase transitions to a neural network. Phys Rev E 2020; 102:053306. [PMID: 33327125 DOI: 10.1103/physreve.102.053306] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Accepted: 10/22/2020] [Indexed: 11/07/2022]
Abstract
We demonstrate, by means of a convolutional neural network, that the features learned in the two-dimensional Ising model are sufficiently universal to predict the structure of symmetry-breaking phase transitions in considered systems irrespective of the universality class, order, and the presence of discrete or continuous degrees of freedom. No prior knowledge about the existence of a phase transition is required in the target system and its entire parameter space can be scanned with multiple histogram reweighting to discover one. We establish our approach in q-state Potts models and perform a calculation for the critical coupling and the critical exponents of the ϕ^{4} scalar field theory using quantities derived from the neural network implementation. We view the machine learning algorithm as a mapping that associates each configuration across different systems to its corresponding phase and elaborate on implications for the discovery of unknown phase transitions.
Collapse
Affiliation(s)
- Dimitrios Bachtis
- Department of Mathematics, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom
| | - Gert Aarts
- Department of Physics, Swansea University, Singleton Campus, SA2 8PP, Swansea, Wales, United Kingdom
| | - Biagio Lucini
- Department of Mathematics, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom.,Swansea Academy of Advanced Computing, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom
| |
Collapse
|
35
|
Lidiak A, Gong Z. Unsupervised Machine Learning of Quantum Phase Transitions Using Diffusion Maps. PHYSICAL REVIEW LETTERS 2020; 125:225701. [PMID: 33315426 DOI: 10.1103/physrevlett.125.225701] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2020] [Accepted: 10/15/2020] [Indexed: 06/12/2023]
Abstract
Experimental quantum simulators have become large and complex enough that discovering new physics from the huge amount of measurement data can be quite challenging, especially when little theoretical understanding of the simulated model is available. Unsupervised machine learning methods are particularly promising in overcoming this challenge. For the specific task of learning quantum phase transitions, unsupervised machine learning methods have primarily been developed for phase transitions characterized by simple order parameters, typically linear in the measured observables. However, such methods often fail for more complicated phase transitions, such as those involving incommensurate phases, valence-bond solids, topological order, and many-body localization. We show that the diffusion map method, which performs nonlinear dimensionality reduction and spectral clustering of the measurement data, has significant potential for learning such complex phase transitions unsupervised. This method may work for measurements of local observables in a single basis and is thus readily applicable to many experimental quantum simulators as a versatile tool for learning various quantum phases and phase transitions.
Collapse
Affiliation(s)
- Alexander Lidiak
- Department of Physics, Colorado School of Mines, Golden, Colorado 80401, USA
| | - Zhexuan Gong
- Department of Physics, Colorado School of Mines, Golden, Colorado 80401, USA
- National Institute of Standards and Technology, Boulder, Colorado 80305, USA
| |
Collapse
|
36
|
Bedolla E, Padierna LC, Castañeda-Priego R. Machine learning for condensed matter physics. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2020; 33:053001. [PMID: 32932243 DOI: 10.1088/1361-648x/abb895] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Accepted: 09/15/2020] [Indexed: 06/11/2023]
Abstract
Condensed matter physics (CMP) seeks to understand the microscopic interactions of matter at the quantum and atomistic levels, and describes how these interactions result in both mesoscopic and macroscopic properties. CMP overlaps with many other important branches of science, such as chemistry, materials science, statistical physics, and high-performance computing. With the advancements in modern machine learning (ML) technology, a keen interest in applying these algorithms to further CMP research has created a compelling new area of research at the intersection of both fields. In this review, we aim to explore the main areas within CMP, which have successfully applied ML techniques to further research, such as the description and use of ML schemes for potential energy surfaces, the characterization of topological phases of matter in lattice systems, the prediction of phase transitions in off-lattice and atomistic simulations, the interpretation of ML theories with physics-inspired frameworks and the enhancement of simulation methods with ML algorithms. We also discuss in detail the main challenges and drawbacks of using ML methods on CMP problems, as well as some perspectives for future developments.
Collapse
Affiliation(s)
- Edwin Bedolla
- División de Ciencias e Ingenierías, Universidad de Guanajuato, Loma del Bosque 103, 37150 León, Mexico
| | - Luis Carlos Padierna
- División de Ciencias e Ingenierías, Universidad de Guanajuato, Loma del Bosque 103, 37150 León, Mexico
| | - Ramón Castañeda-Priego
- División de Ciencias e Ingenierías, Universidad de Guanajuato, Loma del Bosque 103, 37150 León, Mexico
| |
Collapse
|
37
|
Kottmann K, Huembeli P, Lewenstein M, Acín A. Unsupervised Phase Discovery with Deep Anomaly Detection. PHYSICAL REVIEW LETTERS 2020; 125:170603. [PMID: 33156639 DOI: 10.1103/physrevlett.125.170603] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 07/22/2020] [Accepted: 09/24/2020] [Indexed: 06/11/2023]
Abstract
We demonstrate how to explore phase diagrams with automated and unsupervised machine learning to find regions of interest for possible new phases. In contrast to supervised learning, where data is classified using predetermined labels, we here perform anomaly detection, where the task is to differentiate a normal dataset, composed of one or several classes, from anomalous data. As a paradigmatic example, we explore the phase diagram of the extended Bose Hubbard model in one dimension at exact integer filling and employ deep neural networks to determine the entire phase diagram in a completely unsupervised and automated fashion. As input data for learning, we first use the entanglement spectra and central tensors derived from tensor-networks algorithms for ground-state computation and later we extend our method and use experimentally accessible data such as low-order correlation functions as inputs. Our method allows us to reveal a phase-separated region between supersolid and superfluid parts with unexpected properties, which appears in the system in addition to the standard superfluid, Mott insulator, Haldane-insulating, and density wave phases.
Collapse
Affiliation(s)
- Korbinian Kottmann
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860 Castelldefels (Barcelona), Spain
| | - Patrick Huembeli
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860 Castelldefels (Barcelona), Spain
| | - Maciej Lewenstein
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860 Castelldefels (Barcelona), Spain
- ICREA, Pg. Llus Companys 23, 08010 Barcelona, Spain
| | - Antonio Acín
- ICFO-Institut de Ciencies Fotoniques, The Barcelona Institute of Science and Technology, Av. Carl Friedrich Gauss 3, 08860 Castelldefels (Barcelona), Spain
- ICREA, Pg. Llus Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
38
|
Zhang YH, Zheng PL, Zhang Y, Deng DL. Topological Quantum Compiling with Reinforcement Learning. PHYSICAL REVIEW LETTERS 2020; 125:170501. [PMID: 33156669 DOI: 10.1103/physrevlett.125.170501] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/10/2020] [Revised: 09/08/2020] [Accepted: 09/14/2020] [Indexed: 06/11/2023]
Abstract
Quantum compiling, a process that decomposes the quantum algorithm into a series of hardware-compatible commands or elementary gates, is of fundamental importance for quantum computing. We introduce an efficient algorithm based on deep reinforcement learning that compiles an arbitrary single-qubit gate into a sequence of elementary gates from a finite universal set. It generates near-optimal gate sequences with given accuracy and is generally applicable to various scenarios, independent of the hardware-feasible universal set and free from using ancillary qubits. For concreteness, we apply this algorithm to the case of topological compiling of Fibonacci anyons and obtain near-optimal braiding sequences for arbitrary single-qubit unitaries. Our algorithm may carry over to other challenging quantum discrete problems, thus opening up a new avenue for intriguing applications of deep learning in quantum physics.
Collapse
Affiliation(s)
- Yuan-Hang Zhang
- Center for Quantum Information, IIIS, Tsinghua University, Beijing 100084, People's Republic of China
- Department of Physics, University of California, San Diego, California 92093, USA
| | - Pei-Lin Zheng
- International Center for Quantum Materials, Peking University, Beijing 100871, China
- School of Physics, Peking University, Beijing 100871, China
| | - Yi Zhang
- International Center for Quantum Materials, Peking University, Beijing 100871, China
- School of Physics, Peking University, Beijing 100871, China
| | - Dong-Ling Deng
- Center for Quantum Information, IIIS, Tsinghua University, Beijing 100084, People's Republic of China
- Shanghai Qi Zhi Institute, 41th Floor, AI Tower, No. 701 Yunjin Road, Xuhui District, Shanghai 200232, China
| |
Collapse
|
39
|
Bachtis D, Aarts G, Lucini B. Extending machine learning classification capabilities with histogram reweighting. Phys Rev E 2020; 102:033303. [PMID: 33075969 DOI: 10.1103/physreve.102.033303] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 08/20/2020] [Indexed: 11/07/2022]
Abstract
We propose the use of Monte Carlo histogram reweighting to extrapolate predictions of machine learning methods. In our approach, we treat the output from a convolutional neural network as an observable in a statistical system, enabling its extrapolation over continuous ranges in parameter space. We demonstrate our proposal using the phase transition in the two-dimensional Ising model. By interpreting the output of the neural network as an order parameter, we explore connections with known observables in the system and investigate its scaling behavior. A finite-size scaling analysis is conducted based on quantities derived from the neural network that yields accurate estimates for the critical exponents and the critical temperature. The method improves the prospects of acquiring precision measurements from machine learning in physical systems without an order parameter and those where direct sampling in regions of parameter space might not be possible.
Collapse
Affiliation(s)
- Dimitrios Bachtis
- Department of Mathematics, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom
| | - Gert Aarts
- Department of Physics, Swansea University, Singleton Campus, SA2 8PP, Swansea, Wales, United Kingdom
| | - Biagio Lucini
- Department of Mathematics, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom.,Swansea Academy of Advanced Computing, Swansea University, Bay Campus, SA1 8EN, Swansea, Wales, United Kingdom
| |
Collapse
|
40
|
Pawlowski JM, Urban JM. Reducing autocorrelation times in lattice simulations with generative adversarial networks. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2020. [DOI: 10.1088/2632-2153/abae73] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Short autocorrelation times are essential for a reliable error assessment in Monte Carlo simulations of lattice systems. In many interesting scenarios, the decay of autocorrelations in the Markov chain is prohibitively slow. Generative samplers can provide statistically independent field configurations, thereby potentially ameliorating these issues. In this work, the applicability of neural samplers to this problem is investigated. Specifically, we work with a generative adversarial network (GAN). We propose to address difficulties regarding its statistical exactness through the implementation of an overrelaxation step, by searching the latent space of the trained generator network. This procedure can be incorporated into a standard Monte Carlo algorithm, which then permits a sensible assessment of ergodicity and balance based on consistency checks. Numerical results for real, scalar φ
4-theory in two dimensions are presented. We achieve a significant reduction of autocorrelations while accurately reproducing the correct statistics. We discuss possible improvements to the approach as well as potential solutions to persisting issues.
Collapse
|
41
|
Yu S, Piao X, Park N. Machine learning identifies scale-free properties in disordered materials. Nat Commun 2020; 11:4842. [PMID: 32973187 PMCID: PMC7519134 DOI: 10.1038/s41467-020-18653-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 08/28/2020] [Indexed: 11/23/2022] Open
Abstract
The vast amount of design freedom in disordered systems expands the parameter space for signal processing. However, this large degree of freedom has hindered the deterministic design of disordered systems for target functionalities. Here, we employ a machine learning approach for predicting and designing wave-matter interactions in disordered structures, thereby identifying scale-free properties for waves. To abstract and map the features of wave behaviors and disordered structures, we develop disorder-to-localization and localization-to-disorder convolutional neural networks, each of which enables the instantaneous prediction of wave localization in disordered structures and the instantaneous generation of disordered structures from given localizations. We demonstrate that the structural properties of the network architectures lead to the identification of scale-free disordered structures having heavy-tailed distributions, thus achieving multiple orders of magnitude improvement in robustness to accidental defects. Our results verify the critical role of neural network structures in determining machine-learning-generated real-space structures and their defect immunity.
Collapse
Affiliation(s)
- Sunkyu Yu
- Photonic Systems Laboratory, Department of Electrical and Computer Engineering, Seoul National University, Seoul, 08826, Korea
- Intelligent Wave Systems Laboratory, Department of Electrical and Computer Engineering, Seoul National University, Seoul, 08826, Korea
| | - Xianji Piao
- Photonic Systems Laboratory, Department of Electrical and Computer Engineering, Seoul National University, Seoul, 08826, Korea
| | - Namkyoo Park
- Photonic Systems Laboratory, Department of Electrical and Computer Engineering, Seoul National University, Seoul, 08826, Korea.
| |
Collapse
|
42
|
Nakamura T. Machine learning as an improved estimator for magnetization curve and spin gap. Sci Rep 2020; 10:14201. [PMID: 32848170 PMCID: PMC7449974 DOI: 10.1038/s41598-020-70389-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/23/2020] [Indexed: 11/09/2022] Open
Abstract
The magnetization process is a very important probe to study magnetic materials, particularly in search of spin-liquid states in quantum spin systems. Regrettably, however, progress of the theoretical analysis has been unsatisfactory, mostly because it is hard to obtain sufficient numerical data to support the theory. Here we propose a machine-learning algorithm that produces the magnetization curve and the spin gap well out of poor numerical data. The plateau magnetization, its critical field and the critical exponent are estimated accurately. One of the hyperparameters identifies by its score whether the spin gap in the thermodynamic limit is zero or finite. After checking the validity for exactly solvable one-dimensional models we apply our algorithm to the kagome antiferromagnet. The magnetization curve that we obtain from the exact-diagonalization data with 36 spins is consistent with the DMRG results with 132 spins. We estimate the spin gap in the thermodynamic limit at a very small but finite value.
Collapse
Affiliation(s)
- Tota Nakamura
- Faculty of Engineering, Shibaura Institute of Technology, 307 Fukasaku, Minuma, Saitama, 337-8570, Japan.
| |
Collapse
|
43
|
Jung H, Yethiraj A. Phase behavior of continuous-space systems: A supervised machine learning approach. J Chem Phys 2020; 153:064904. [DOI: 10.1063/5.0014194] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Affiliation(s)
- Hyuntae Jung
- Theoretical Chemistry Institute and Department of Chemistry, University of Wisconsin, Madison, Wisconsin 53706, USA
| | - Arun Yethiraj
- Theoretical Chemistry Institute and Department of Chemistry, University of Wisconsin, Madison, Wisconsin 53706, USA
| |
Collapse
|
44
|
Koch EDM, Koch ADM, Kastanos N, Cheng L. Short-sighted deep learning. Phys Rev E 2020; 102:013307. [PMID: 32795065 DOI: 10.1103/physreve.102.013307] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2020] [Accepted: 06/15/2020] [Indexed: 11/07/2022]
Abstract
A theory explaining how deep learning works is yet to be developed. Previous work suggests that deep learning performs a coarse graining, similar in spirit to the renormalization group (RG). This idea has been explored in the setting of a local (nearest-neighbor interactions) Ising spin lattice. We extend the discussion to the setting of a long-range spin lattice. Markov-chain Monte Carlo (MCMC) simulations determine both the critical temperature and scaling dimensions of the system. The model is used to train both a single restricted Boltzmann machine (RBM) network, as well as a stacked RBM network. Following earlier Ising model studies, the trained weights of a single-layer RBM network define a flow of lattice models. In contrast to results for nearest-neighbor Ising, the RBM flow for the long-ranged model does not converge to the correct values for the spin and energy scaling dimension. Further, correlation functions between visible and hidden nodes exhibit key differences between the stacked RBM and RG flows. The stacked RBM flow appears to move toward low temperatures, whereas the RG flow moves toward high temperature. This again differs from results obtained for nearest-neighbor Ising.
Collapse
Affiliation(s)
- Ellen de Mello Koch
- School of Electrical and Information Engineering, University of the Witwatersrand, Wits 2050, South Africa
| | - Anita de Mello Koch
- School of Electrical and Information Engineering, University of the Witwatersrand, Wits 2050, South Africa
| | - Nicholas Kastanos
- School of Electrical and Information Engineering, University of the Witwatersrand, Wits 2050, South Africa
| | - Ling Cheng
- School of Electrical and Information Engineering, University of the Witwatersrand, Wits 2050, South Africa
| |
Collapse
|
45
|
Blücher S, Kades L, Pawlowski JM, Strodthoff N, Urban JM. Towards novel insights in lattice field theory with explainable machine learning. Int J Clin Exp Med 2020. [DOI: 10.1103/physrevd.101.094507] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
|
46
|
Löpez CA, Vesselinov VV, Gnanakaran S, Alexandrov BS. Unsupervised Machine Learning for Analysis of Phase Separation in Ternary Lipid Mixture. J Chem Theory Comput 2019; 15:6343-6357. [PMID: 31476122 DOI: 10.1021/acs.jctc.9b00074] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
47
|
Suhail Y, Cain MP, Vanaja K, Kurywchak PA, Levchenko A, Kalluri R, Kshitiz. Systems Biology of Cancer Metastasis. Cell Syst 2019; 9:109-127. [PMID: 31465728 PMCID: PMC6716621 DOI: 10.1016/j.cels.2019.07.003] [Citation(s) in RCA: 252] [Impact Index Per Article: 42.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Revised: 04/29/2019] [Accepted: 06/28/2019] [Indexed: 12/12/2022]
Abstract
Cancer metastasis is no longer viewed as a linear cascade of events but rather as a series of concurrent, partially overlapping processes, as successfully metastasizing cells assume new phenotypes while jettisoning older behaviors. The lack of a systemic understanding of this complex phenomenon has limited progress in developing treatments for metastatic disease. Because metastasis has traditionally been investigated in distinct physiological compartments, the integration of these complex and interlinked aspects remains a challenge for both systems-level experimental and computational modeling of metastasis. Here, we present some of the current perspectives on the complexity of cancer metastasis, the multiscale nature of its progression, and a systems-level view of the processes underlying the invasive spread of cancer cells. We also highlight the gaps in our current understanding of cancer metastasis as well as insights emerging from interdisciplinary systems biology approaches to understand this complex phenomenon.
Collapse
Affiliation(s)
- Yasir Suhail
- Department of Biomedical Engineering, University of Connecticut Health Center, Farmington, CT, USA; Cancer Systems Biology @ Yale (CaSB@Yale), Yale University, West Haven, CT, USA
| | - Margo P Cain
- Department of Cancer Biology, MD Anderson Cancer Center, Houston, TX, USA
| | - Kiran Vanaja
- Cancer Systems Biology @ Yale (CaSB@Yale), Yale University, West Haven, CT, USA
| | - Paul A Kurywchak
- Department of Cancer Biology, MD Anderson Cancer Center, Houston, TX, USA
| | - Andre Levchenko
- Cancer Systems Biology @ Yale (CaSB@Yale), Yale University, West Haven, CT, USA
| | - Raghu Kalluri
- Department of Cancer Biology, MD Anderson Cancer Center, Houston, TX, USA
| | - Kshitiz
- Department of Biomedical Engineering, University of Connecticut Health Center, Farmington, CT, USA; Cancer Systems Biology @ Yale (CaSB@Yale), Yale University, West Haven, CT, USA.
| |
Collapse
|
48
|
Hu W, Zhang Y, Li L. Study of the Application of Deep Convolutional Neural Networks (CNNs) in Processing Sensor Data and Biomedical Images. SENSORS 2019; 19:s19163584. [PMID: 31426516 PMCID: PMC6718995 DOI: 10.3390/s19163584] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 08/09/2019] [Accepted: 08/15/2019] [Indexed: 12/01/2022]
Abstract
The fast progress in research and development of multifunctional, distributed sensor networks has brought challenges in processing data from a large number of sensors. Using deep learning methods such as convolutional neural networks (CNN), it is possible to build smarter systems to forecasting future situations as well as precisely classify large amounts of data from sensors. Multi-sensor data from atmospheric pollutants measurements that involves five criteria, with the underlying analytic model unknown, need to be categorized, so do the Diabetic Retinopathy (DR) fundus images dataset. In this work, we created automatic classifiers based on a deep convolutional neural network (CNN) with two models, a simpler feedforward model with dual modules and an Inception Resnet v2 model, and various structural tweaks for classifying the data from the two tasks. For segregating multi-sensor data, we trained a deep CNN-based classifier on an image dataset extracted from the data by a novel image generating method. We created two deepened and one reductive feedforward network for DR phase classification. The validation accuracies and visualization results show that increasing deep CNN structure depth or kernels number in convolutional layers will not indefinitely improve the classification quality and that a more sophisticated model does not necessarily achieve higher performance when training datasets are quantitatively limited, while increasing training image resolution can induce higher classification accuracies for trained CNNs. The methodology aims at providing support for devising classification networks powering intelligent sensors.
Collapse
Affiliation(s)
- Weijun Hu
- College of Engineering, Swansea University, Bay Campus, Swansea SA1 8EN, UK.
| | - Yan Zhang
- School of Physics, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Lijie Li
- College of Engineering, Swansea University, Bay Campus, Swansea SA1 8EN, UK.
| |
Collapse
|
49
|
Zhang Y, Mesaros A, Fujita K, Edkins SD, Hamidian MH, Ch'ng K, Eisaki H, Uchida S, Davis JCS, Khatami E, Kim EA. Machine learning in electronic-quantum-matter imaging experiments. Nature 2019; 570:484-490. [PMID: 31217587 DOI: 10.1038/s41586-019-1319-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Accepted: 04/08/2019] [Indexed: 11/09/2022]
Abstract
For centuries, the scientific discovery process has been based on systematic human observation and analysis of natural phenomena1. Today, however, automated instrumentation and large-scale data acquisition are generating datasets of such large volume and complexity as to defy conventional scientific methodology. Radically different scientific approaches are needed, and machine learning (ML) shows great promise for research fields such as materials science2-5. Given the success of ML in the analysis of synthetic data representing electronic quantum matter (EQM)6-16, the next challenge is to apply this approach to experimental data-for example, to the arrays of complex electronic-structure images17 obtained from atomic-scale visualization of EQM. Here we report the development and training of a suite of artificial neural networks (ANNs) designed to recognize different types of order hidden in such EQM image arrays. These ANNs are used to analyse an archive of experimentally derived EQM image arrays from carrier-doped copper oxide Mott insulators. In these noisy and complex data, the ANNs discover the existence of a lattice-commensurate, four-unit-cell periodic, translational-symmetry-breaking EQM state. Further, the ANNs determine that this state is unidirectional, revealing a coincident nematic EQM state. Strong-coupling theories of electronic liquid crystals18,19 are consistent with these observations.
Collapse
Affiliation(s)
- Yi Zhang
- Department of Physics, Cornell University, Ithaca, NY, USA
| | - A Mesaros
- Department of Physics, Cornell University, Ithaca, NY, USA.,Laboratoire de Physique des Solides, Université Paris-Sud, CNRS, Orsay, France
| | - K Fujita
- Condensed Matter Physics and Materials Science Department, Brookhaven National Laboratory, Upton, NY, USA
| | - S D Edkins
- Department of Physics, Cornell University, Ithaca, NY, USA.,Department of Applied Physics, Stanford University, Stanford, CA, USA
| | - M H Hamidian
- Department of Physics, Cornell University, Ithaca, NY, USA.,Department of Physics, Harvard University, Cambridge, MA, USA
| | - K Ch'ng
- Department of Physics and Astronomy, San Jose State University, San Jose, CA, USA
| | - H Eisaki
- National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan
| | - S Uchida
- National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan.,Department of Physics, University of Tokyo, Tokyo, Japan
| | - J C Séamus Davis
- Department of Physics, Cornell University, Ithaca, NY, USA.,Condensed Matter Physics and Materials Science Department, Brookhaven National Laboratory, Upton, NY, USA.,Department of Physics, University College Cork, Cork, Ireland.,Clarendon Laboratory, University of Oxford, Oxford, UK
| | - Ehsan Khatami
- Department of Physics and Astronomy, San Jose State University, San Jose, CA, USA
| | - Eun-Ah Kim
- Department of Physics, Cornell University, Ithaca, NY, USA.
| |
Collapse
|
50
|
Walters M, Wei Q, Chen JZY. Machine learning topological defects of confined liquid crystals in two dimensions. Phys Rev E 2019; 99:062701. [PMID: 31330643 DOI: 10.1103/physreve.99.062701] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2019] [Indexed: 04/20/2023]
Abstract
Supervised machine learning can be used to classify images with spatially correlated physical features. We demonstrate the concept by using the coordinate files generated from an off-lattice computer simulation of rodlike molecules confined in a square box as an example. Because of the geometric frustrations at high number density, the nematic director field develops an inhomogeneous pattern containing various topological defects as the main physical feature. We describe two machine-learning procedures that can be used to effectively capture the correlation between the defect positions and the nematic directors around them and hence classify the topological defects. First is a feedforward neural network, which requires the aid of presorting the off-lattice simulation data in a coarse-grained fashion. Second is a recurrent neural network, which needs no such sorting and can be directly used for finding spatial correlations. The issues of when to presort a simulation data file and how the network structures affect such a decision are addressed.
Collapse
Affiliation(s)
- Michael Walters
- Department of Physics and Astronomy, University of Waterloo, Waterloo, Ontario, Canada, N2L 3G1
| | - Qianshi Wei
- Department of Physics and Astronomy, University of Waterloo, Waterloo, Ontario, Canada, N2L 3G1
| | - Jeff Z Y Chen
- Department of Physics and Astronomy, University of Waterloo, Waterloo, Ontario, Canada, N2L 3G1
| |
Collapse
|