Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Chen C, Dong D, Li HX, Chu J, Tarn TJ. Fidelity-based probabilistic Q-learning for control of quantum systems. IEEE Trans Neural Netw Learn Syst 2014;25:920-933. [PMID: 24808038 DOI: 10.1109/tnnls.2013.2283574] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

For:	Chen C, Dong D, Li HX, Chu J, Tarn TJ. Fidelity-based probabilistic Q-learning for control of quantum systems. IEEE Trans Neural Netw Learn Syst 2014;25:920-933. [PMID: 24808038 DOI: 10.1109/tnnls.2013.2283574] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]

Number

Cited by Other Article(s)

Yu H, Zhao X, Dong D, Chen C. Hamiltonian Identification via Quantum Ensemble Classification. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2024;35:11261-11275. [PMID: 37030784 DOI: 10.1109/tnnls.2023.3258622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Reuer K, Landgraf J, Fösel T, O'Sullivan J, Beltrán L, Akin A, Norris GJ, Remm A, Kerschbaum M, Besse JC, Marquardt F, Wallraff A, Eichler C. Realizing a deep reinforcement learning agent for real-time quantum feedback. Nat Commun 2023;14:7138. [PMID: 37932251 PMCID: PMC10628214 DOI: 10.1038/s41467-023-42901-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Accepted: 10/25/2023] [Indexed: 11/08/2023] Open

Affiliation(s)

Kevin Reuer Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland. Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland.
Jonas Landgraf Max Planck Institute for the Science of Light, Staudtstraße 2, 91058, Erlangen, Germany Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058, Erlangen, Germany
Thomas Fösel Max Planck Institute for the Science of Light, Staudtstraße 2, 91058, Erlangen, Germany Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058, Erlangen, Germany
James O'Sullivan Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Liberto Beltrán Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Abdulkadir Akin Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Graham J Norris Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Ants Remm Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Michael Kerschbaum Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Jean-Claude Besse Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Florian Marquardt Max Planck Institute for the Science of Light, Staudtstraße 2, 91058, Erlangen, Germany Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058, Erlangen, Germany
Andreas Wallraff Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland Quantum Center, ETH Zurich, CH-8093, Zurich, Switzerland
Christopher Eichler Department of Physics, ETH Zurich, CH-8093, Zurich, Switzerland. Physics Department, University of Erlangen-Nuremberg, Staudtstraße 5, 91058, Erlangen, Germany.

Collapse

Ma H, Dong D, Ding SX, Chen C. Curriculum-Based Deep Reinforcement Learning for Quantum Control. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2023;34:8852-8865. [PMID: 35263262 DOI: 10.1109/tnnls.2022.3153502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Konar D, Bhattacharyya S, Panigrahi BK, Behrman EC. Qutrit-Inspired Fully Self-Supervised Shallow Quantum Learning Network for Brain Tumor Segmentation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2022;33:6331-6345. [PMID: 33983887 DOI: 10.1109/tnnls.2021.3077188] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

Gao Y, Wang X, Yu N, Wong BM. Harnessing deep reinforcement learning to construct time-dependent optimal fields for quantum control dynamics. Phys Chem Chem Phys 2022;24:24012-24020. [PMID: 36128792 DOI: 10.1039/d2cp02495k] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Wei Q, Ma H, Chen C, Dong D. Deep Reinforcement Learning With Quantum-Inspired Experience Replay. IEEE TRANSACTIONS ON CYBERNETICS 2022;52:9326-9338. [PMID: 33600343 DOI: 10.1109/tcyb.2021.3053414] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]

A quantum system control method based on enhanced reinforcement learning. Soft comput 2022. [DOI: 10.1007/s00500-022-07179-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]

Konar D, Bhattacharyya S, Dey S, Panigrahi BK. Optimized activation for quantum-inspired self-supervised neural network based fully automated brain lesion segmentation. APPL INTELL 2022. [DOI: 10.1007/s10489-021-03108-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

Optimizing quantum annealing schedules with Monte Carlo tree search enhanced with neural networks. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00446-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Bolens A, Heyl M. Reinforcement Learning for Digital Quantum Simulation. PHYSICAL REVIEW LETTERS 2021;127:110502. [PMID: 34558930 DOI: 10.1103/physrevlett.127.110502] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2020] [Revised: 05/09/2021] [Accepted: 07/22/2021] [Indexed: 06/13/2023]

Haug T, Mok WK, You JB, Zhang W, Eng Png C, Kwek LC. Classifying global state preparation via deep reinforcement learning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abc81f] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Konar D, Bhattacharyya S, Gandhi TK, Panigrahi BK. A Quantum-Inspired Self-Supervised Network model for automatic segmentation of brain MR images. Appl Soft Comput 2020. [DOI: 10.1016/j.asoc.2020.106348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Wang Z, Li HX, Chen C. Reinforcement Learning-Based Optimal Sensor Placement for Spatiotemporal Modeling. IEEE TRANSACTIONS ON CYBERNETICS 2020;50:2861-2871. [PMID: 30892267 DOI: 10.1109/tcyb.2019.2901897] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Wang Z, Li HX, Chen C. Incremental Reinforcement Learning in Continuous Spaces via Policy Relaxation and Importance Weighting. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:1870-1883. [PMID: 31395556 DOI: 10.1109/tnnls.2019.2927320] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]

Al-Dabooni S, Wunsch DC. Online Model-Free n-Step HDP With Stability Analysis. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:1255-1269. [PMID: 31251198 DOI: 10.1109/tnnls.2019.2919614] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Al-Dabooni S, Wunsch DC. An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2020;31:1155-1169. [PMID: 31247567 DOI: 10.1109/tnnls.2019.2919338] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Abstract

In problems with complex dynamics and challenging state spaces, the dual heuristic programming (DHP) algorithm has been shown theoretically and experimentally to perform well. This was recently extended by an approach called value gradient learning (VGL). VGL was inspired by a version of temporal difference (TD) learning that uses eligibility traces. The eligibility traces create an exponential decay of older observations with a decay parameter ( λ ). This approach is known as TD( λ ), and its DHP extension is known as VGL( λ ), where VGL(0) is identical to DHP. VGL has presented convergence and other desirable properties, but it is primarily useful for batch learning. Online learning requires an eligibility-trace-work-space matrix, which is not required for the batch learning version of VGL. Since online learning is desirable for many applications, it is important to remove this computational and memory impediment. This paper introduces a dual-critic version of VGL, called N -step VGL (NSVGL), that does not need the eligibility-trace-work-space matrix, thereby allowing online learning. Furthermore, this combination of critic networks allows an NSVGL algorithm to learn faster. The first critic is similar to DHP, which is adapted based on TD(0) learning, while the second critic is adapted based on a gradient of n -step TD( λ ) learning. Both networks are combined to train an actor network. The combination of feedback signals from both critic networks provides an optimal decision faster than traditional adaptive dynamic programming (ADP) via mixing current information and event history. Convergence proofs are provided. Gradients of one- and n -step value functions are monotonically nondecreasing and converge to the optimum. Two simulation case studies are presented for NSVGL to show their superior performance.

Collapse

Sommer C, Asjad M, Genes C. Prospects of reinforcement learning for the simultaneous damping of many mechanical modes. Sci Rep 2020;10:2623. [PMID: 32060483 PMCID: PMC7021687 DOI: 10.1038/s41598-020-59435-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2019] [Accepted: 01/28/2020] [Indexed: 11/08/2022] Open

Fu Q, Yang Z, Lu Y, Wu H, Hu F, Chen J. Variational Bayesian Exploration-Based Active Sarsa Algorithm. INT J PATTERN RECOGN 2019. [DOI: 10.1142/s0218001419510054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Affiliation(s)

Qiming Fu Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China
Zhengxia Yang Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China
You Lu Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China
Hongjie Wu Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China
Fuyuan Hu Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China
Jianping Chen Institute of Electronics and Information Engineering, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Jiangsu Key Laboratory of Intelligent Building Energy Efficiency, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China Suzhou Key Laboratory of Mobile Networking and Applied Technologies, Suzhou University of Science and Technology, Suzhou, Jiangsu 215009, P. R. China

Collapse

Al-Dabooni S, Wunsch D. The Boundedness Conditions for Model-Free HDP( λ ). IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2019;30:1928-1942. [PMID: 30418923 DOI: 10.1109/tnnls.2018.2875870] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Mehta P, Wang CH, Day AGR, Richardson C, Bukov M, Fisher CK, Schwab DJ. A high-bias, low-variance introduction to Machine Learning for physicists. PHYSICS REPORTS 2019;810:1-124. [PMID: 31404441 PMCID: PMC6688775 DOI: 10.1016/j.physrep.2019.03.001] [Citation(s) in RCA: 203] [Impact Index Per Article: 40.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

Day AGR, Bukov M, Weinberg P, Mehta P, Sels D. Glassy Phase of Optimal Quantum Control. PHYSICAL REVIEW LETTERS 2019;122:020601. [PMID: 30720331 DOI: 10.1103/physrevlett.122.020601] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Indexed: 06/09/2023]

Dunjko V, Briegel HJ. Machine learning & artificial intelligence in the quantum domain: a review of recent progress. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2018;81:074001. [PMID: 29504942 DOI: 10.1088/1361-6633/aab406] [Citation(s) in RCA: 112] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Abstract

Quantum information technologies, on the one hand, and intelligent learning systems, on the other, are both emergent technologies that are likely to have a transformative impact on our society in the future. The respective underlying fields of basic research-quantum information versus machine learning (ML) and artificial intelligence (AI)-have their own specific questions and challenges, which have hitherto been investigated largely independently. However, in a growing body of recent work, researchers have been probing the question of the extent to which these fields can indeed learn and benefit from each other. Quantum ML explores the interaction between quantum computing and ML, investigating how results and techniques from one field can be used to solve the problems of the other. Recently we have witnessed significant breakthroughs in both directions of influence. For instance, quantum computing is finding a vital application in providing speed-ups for ML problems, critical in our 'big data' world. Conversely, ML already permeates many cutting-edge technologies and may become instrumental in advanced quantum technologies. Aside from quantum speed-up in data analysis, or classical ML optimization used in quantum experiments, quantum enhancements have also been (theoretically) demonstrated for interactive learning tasks, highlighting the potential of quantum-enhanced learning agents. Finally, works exploring the use of AI for the very design of quantum experiments and for performing parts of genuine research autonomously, have reported their first successes. Beyond the topics of mutual enhancement-exploring what ML/AI can do for quantum physics and vice versa-researchers have also broached the fundamental issue of quantum generalizations of learning and AI concepts. This deals with questions of the very meaning of learning and intelligence in a world that is fully described by quantum mechanics. In this review, we describe the main ideas, recent developments and progress in a broad spectrum of research investigating ML and AI in the quantum domain.

Collapse

Masuyama N, Loo CK, Seera M, Kubota N. Quantum-Inspired Multidirectional Associative Memory With a Self-Convergent Iterative Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2018;29:1058-1068. [PMID: 28182559 DOI: 10.1109/tnnls.2017.2653114] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Zhang P, Shen H, Zhai H. Machine Learning Topological Invariants with Neural Networks. PHYSICAL REVIEW LETTERS 2018;120:066401. [PMID: 29481246 DOI: 10.1103/physrevlett.120.066401] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Revised: 12/04/2017] [Indexed: 06/08/2023]

Palittapongarnpim P, Wittek P, Zahedinejad E, Vedaie S, Sanders BC. Learning in quantum control: High-dimensional global optimization for noisy quantum dynamics. Neurocomputing 2017. [DOI: 10.1016/j.neucom.2016.12.087] [Citation(s) in RCA: 42] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Wu C, Qi B, Chen C, Dong D. Robust Learning Control Design for Quantum Unitary Transformations. IEEE TRANSACTIONS ON CYBERNETICS 2017;47:4405-4417. [PMID: 27705875 DOI: 10.1109/tcyb.2016.2610979] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Iwata K. Extending the Peak Bandwidth of Parameters for Softmax Selection in Reinforcement Learning. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017;28:1865-1877. [PMID: 27187974 DOI: 10.1109/tnnls.2016.2558295] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Chen C, Dong D, Qi B, Petersen IR, Rabitz H. Quantum Ensemble Classification: A Sampling-Based Learning Control Approach. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2017;28:1345-1359. [PMID: 28113872 DOI: 10.1109/tnnls.2016.2540719] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]

Wei Q, Song R, Yan P. Data-Driven Zero-Sum Neuro-Optimal Control for a Class of Continuous-Time Unknown Nonlinear Systems With Disturbance Using ADP. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS 2016;27:444-458. [PMID: 26292346 DOI: 10.1109/tnnls.2015.2464080] [Citation(s) in RCA: 67] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]

Application of emotion affected associative memory based on mood congruency effects for a humanoid. Neural Comput Appl 2015. [DOI: 10.1007/s00521-015-2102-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]