Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Xu X, Zuo L, Huang Z. Reinforcement learning algorithms with function approximation: Recent advances and applications. Inf Sci (N Y) 2014;261:1-31. [DOI: 10.1016/j.ins.2013.08.037] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

For:	Xu X, Zuo L, Huang Z. Reinforcement learning algorithms with function approximation: Recent advances and applications. Inf Sci (N Y) 2014;261:1-31. [DOI: 10.1016/j.ins.2013.08.037] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Number

Cited by Other Article(s)

Alahi MEE, Sukkuea A, Tina FW, Nag A, Kurdthongmee W, Suwannarat K, Mukhopadhyay SC. Integration of IoT-Enabled Technologies and Artificial Intelligence (AI) for Smart City Scenario: Recent Advancements and Future Trends. SENSORS (BASEL, SWITZERLAND) 2023;23:s23115206. [PMID: 37299934 DOI: 10.3390/s23115206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 05/24/2023] [Accepted: 05/29/2023] [Indexed: 06/12/2023]

Abstract

As the global population grows, and urbanization becomes more prevalent, cities often struggle to provide convenient, secure, and sustainable lifestyles due to the lack of necessary smart technologies. Fortunately, the Internet of Things (IoT) has emerged as a solution to this challenge by connecting physical objects using electronics, sensors, software, and communication networks. This has transformed smart city infrastructures, introducing various technologies that enhance sustainability, productivity, and comfort for urban dwellers. By leveraging Artificial Intelligence (AI) to analyze the vast amount of IoT data available, new opportunities are emerging to design and manage futuristic smart cities. In this review article, we provide an overview of smart cities, defining their characteristics and exploring the architecture of IoT. A detailed analysis of various wireless communication technologies employed in smart city applications is presented, with extensive research conducted to determine the most appropriate communication technologies for specific use cases. The article also sheds light on different AI algorithms and their suitability for smart city applications. Furthermore, the integration of IoT and AI in smart city scenarios is discussed, emphasizing the potential contributions of 5G networks coupled with AI in advancing modern urban environments. This article contributes to the existing literature by highlighting the tremendous opportunities presented by integrating IoT and AI, paving the way for the development of smart cities that significantly enhance the quality of life for urban dwellers while promoting sustainability and productivity. By exploring the potential of IoT, AI, and their integration, this review article provides valuable insights into the future of smart cities, demonstrating how these technologies can positively impact urban environments and the well-being of their inhabitants.

Collapse

Hasanzadeh A, Hamblin MR, Kiani J, Noori H, Hardie JM, Karimi M, Shafiee H. Could artificial intelligence revolutionize the development of nanovectors for gene therapy and mRNA vaccines? NANO TODAY 2022;47:101665. [PMID: 37034382 PMCID: PMC10081506 DOI: 10.1016/j.nantod.2022.101665] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

Affiliation(s)

Akbar Hasanzadeh Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Michael R Hamblin Laser Research Centre, Faculty of Health Science, University of Johannesburg, Doornfontein 2028, South Africa Radiation Biology Research Center, Iran University of Medical Sciences, Tehran, Iran
Jafar Kiani Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Molecular Medicine, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran, Iran
Hamid Noori Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran
Joseph M. Hardie Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA
Mahdi Karimi Cellular and Molecular Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Department of Medical Nanotechnology, Faculty of Advanced Technologies in Medicine, Iran University of Medical Sciences, Tehran 1449614535, Iran Oncopathology Research Center, Iran University of Medical Sciences, Tehran 1449614535, Iran Research Center for Science and Technology in Medicine, Tehran University of Medical Sciences, Tehran 141556559, Iran Applied Biotechnology Research Centre, Tehran Medical Science, Islamic Azad University, Tehran 1584743311, Iran
Hadi Shafiee Division of Engineering in Medicine, Department of Medicine, Brigham and Women’s Hospital, Harvard Medical School, Boston, MA, 02139 USA

Collapse

Gao X, Chao F, Zhou C, Ge Z, Yang L, Chang X, Shang C, Shen Q. Error controlled actor-critic. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.08.079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Xiong K, Zhou P, Wei C. Autonomous Navigation of Unmanned Aircraft Using Space Target LOS Measurements and QLEKF. SENSORS (BASEL, SWITZERLAND) 2022;22:s22186992. [PMID: 36146339 PMCID: PMC9503636 DOI: 10.3390/s22186992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/10/2022] [Accepted: 09/13/2022] [Indexed: 06/01/2023]

Nievas N, Pagès-Bernaus A, Bonada F, Echeverria L, Abio A, Lange D, Pujante J. A Reinforcement Learning Control in Hot Stamping for Cycle Time Optimization. MATERIALS 2022;15:ma15144825. [PMID: 35888292 PMCID: PMC9322736 DOI: 10.3390/ma15144825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 06/23/2022] [Accepted: 07/10/2022] [Indexed: 11/16/2022]

Fathi A, Masoudi SF. Combining CNN and Q-learning for increasing the accuracy of lost gamma source finding. Sci Rep 2022;12:2644. [PMID: 35173217 PMCID: PMC8850423 DOI: 10.1038/s41598-022-06326-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 01/24/2022] [Indexed: 11/30/2022] Open

Ai M, Xie Y, Tang Z, Zhang J, Gui W. Deep learning feature-based setpoint generation and optimal control for flotation processes. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.07.060] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Alharbi A, Alyami H, M P, Rauf HT, Kadry S. Intelligent scaling for 6G IoE services for resource provisioning. PeerJ Comput Sci 2021;7:e755. [PMID: 34805508 PMCID: PMC8576555 DOI: 10.7717/peerj-cs.755] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 09/30/2021] [Indexed: 06/13/2023]

A reinforcement learning approach to distribution-free capacity allocation for sea cargo revenue management. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.04.092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Sami H, Otrok H, Bentahar J, Mourad A. AI-Based Resource Provisioning of IoE Services in 6G: A Deep Reinforcement Learning Approach. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021. [DOI: 10.1109/tnsm.2021.3066625] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Balkenius C, Tjøstheim TA, Johansson B, Wallin A, Gärdenfors P. The Missing Link Between Memory and Reinforcement Learning. Front Psychol 2020;11:560080. [PMID: 33362625 PMCID: PMC7758424 DOI: 10.3389/fpsyg.2020.560080] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 11/16/2020] [Indexed: 11/16/2022] Open

Deng C, Ji X, Rainey C, Zhang J, Lu W. Integrating Machine Learning with Human Knowledge. iScience 2020;23:101656. [PMID: 33134890 PMCID: PMC7588855 DOI: 10.1016/j.isci.2020.101656] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023] Open

Towards safe reinforcement-learning in industrial grid-warehousing. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.06.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Konda R, La HM, Zhang J. Decentralized Function Approximated Q-Learning in Multi-Robot Systems For Predator Avoidance. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2020.3013920] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

LIN YAHUI, CHIU SHAOWEN, LIN YINGCHE, LIN CHIENCHUNG, PAN LUNGKWANG. INVERSE PROBLEM ALGORITHM APPLICATION TO SEMI-QUANTITATIVE ANALYSIS OF 272 PATIENTS WITH ISCHEMIC STROKE SYMPTOMS: CAROTID STENOSIS RISK ASSESSMENT FOR FIVE RISK FACTORS. J MECH MED BIOL 2020. [DOI: 10.1142/s0219519420400217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Abstract This study proposes the inverse problem algorithm (IPA) with five risk factors applied to the semi-quantitative analysis of carotid stenosis 272 patients with suspected ischemic stroke. The IPA is known to provide a substantiated machine learning-based prediction of the expected outcomes by solving an inverse matrix of variable coefficients. In case of carotid stenosis prediction, such risk factors as patient’s age, mean arterial pressure (MAP), glucose AC, low-density lipoprotein-cholesterol (LDL-C), and C-Reactive protein (CRP) were assessed for the main group of 217 patients. Their results were processed by the STATISTICA program with a customized loss function ([Formula: see text]), yielding the first-order nonlinear semi-empirical formula with 16 terms. The loss function was calculated via the total mismatch between the theoretical predictions and true carotid stenosis cases (%) for all 217 patients. Thus, the carotid stenosis (%) compromised solution array [[Formula: see text]] was optimized using [Formula: see text] individual data points via the proposed algorithm. The results showed a complete regression with loss function [Formula: see text]=2.3543, variance [Formula: see text]=87.46%, and correlation coefficient [Formula: see text]. The reference group of 55 more patients with the same preliminary diagnosis and symptoms was selected to validate the method predictive feasibility, which was found quite satisfactory. The decreasing order of three dominant risk factors was as follows: CRP, glucose AC, and MAP, whereas age and LDL-C weakly influenced the program computation results. The IPA showed a strong convergence by its default characteristic. The reduction of the number of variables in computation deteriorated the prediction accuracy, exhibiting the algorithm’s high sensitivity to the number of variables. Collapse

Li J, Yao L, Xu X, Cheng B, Ren J. Deep reinforcement learning for pedestrian collision avoidance and human-machine cooperative driving. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.03.105] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Liu R, Liang J, Alkhambashi M. Research on breakthrough and innovation of UAV mission planning method based on cloud computing-based reinforcement learning algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-179130] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Yuan Y, Yu ZL, Gu Z, Deng X, Li Y. A novel multi-step reinforcement learning method for solving reward hacking. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01417-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

A Q-learning approach based on human reasoning for navigation in a dynamic environment. ROBOTICA 2018. [DOI: 10.1017/s026357471800111x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Xu B. Composite Learning Finite-Time Control With Application to Quadrotors. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS 2018;48:1806-1815. [DOI: 10.1109/tsmc.2017.2698473] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Zhu H, Paschalidis IC, Hasselmo ME. Neural circuits for learning context-dependent associations of stimuli. Neural Netw 2018;107:48-60. [PMID: 30177226 DOI: 10.1016/j.neunet.2018.07.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 07/08/2018] [Accepted: 07/09/2018] [Indexed: 10/28/2022]

Zhou Y, Duval B, Hao JK. Improving probability learning based local search for graph coloring. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2018.01.027] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Automatic design of hyper-heuristic based on reinforcement learning. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.01.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]

Shi H, Lin Z, Zhang S, Li X, Hwang KS. An adaptive decision-making method with fuzzy Bayesian reinforcement learning for robot soccer. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.01.032] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Wang GF, Fang Z, Li P. Shaping in reinforcement learning by knowledge transferred from human-demonstrations of a simple similar task. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-17052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Yin B, Dridi M, Moudni AE. Recursive least-squares temporal difference learning for adaptive traffic signal control at intersection. Neural Comput Appl 2017. [DOI: 10.1007/s00521-017-3066-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]

Wang W, Chen X, He J. Adaptive Critic Design with Local Gaussian Process Models. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2016. [DOI: 10.20965/jaciii.2016.p1135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Lian C, Xu X, Chen H, He H. Near-Optimal Tracking Control of Mobile Robots Via Receding-Horizon Dual Heuristic Programming. IEEE TRANSACTIONS ON CYBERNETICS 2016;46:2484-2496. [PMID: 26642462 DOI: 10.1109/tcyb.2015.2478857] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]

Wang D, Li C, Liu D, Mu C. Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.05.034] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling. ENERGIES 2016. [DOI: 10.3390/en9090725] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Wei Q, Liu D, Lewis FL. Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.044] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]

Fernandez-Gauna B, Graña M, Lopez-Guede JM, Etxeberria-Agiriano I, Ansoategui I. Reinforcement Learning endowed with safe veto policies to learn the control of Linked-Multicomponent Robotic Systems. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Embedded Adaptive Fuzzy Controller Based on Reinforcement Learning for DC Motor with Flexible Shaft. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2015. [DOI: 10.1007/s13369-015-1752-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Optimal scheduling for data transmission between mobile devices and cloud. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2014.12.059] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]

Bahrpeyma F, Zakerolhoseini A, Haghighi H. Using IDS fitted Q to develop a real-time adaptive controller for dynamic resource provisioning in Cloud's virtualized environment. Appl Soft Comput 2015. [DOI: 10.1016/j.asoc.2014.10.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Huang Z, Xu X, Zuo L. Reinforcement learning with automatic basis construction based on isometric feature mapping. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.07.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

A developmental approach to robotic pointing via human–robot interaction. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.03.104] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]

Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.05.050] [Citation(s) in RCA: 111] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Zhu F, Liu Q, Wang H, Zhou X, Fu Y. Unregistered biological words recognition by Q-learning with transfer learning. ScientificWorldJournal 2014;2014:173290. [PMID: 24701139 PMCID: PMC3950481 DOI: 10.1155/2014/173290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 01/08/2014] [Indexed: 11/23/2022] Open