• Reference Citation Analysis
  • v
  • v
  • Find an Article
Find an Article PDF (4632662)   Today's Articles (2442)   Subscriber (49920)
For: Xu X, Zuo L, Huang Z. Reinforcement learning algorithms with function approximation: Recent advances and applications. Inf Sci (N Y) 2014;261:1-31. [DOI: 10.1016/j.ins.2013.08.037] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Number Cited by Other Article(s)
1
Alahi MEE, Sukkuea A, Tina FW, Nag A, Kurdthongmee W, Suwannarat K, Mukhopadhyay SC. Integration of IoT-Enabled Technologies and Artificial Intelligence (AI) for Smart City Scenario: Recent Advancements and Future Trends. SENSORS (BASEL, SWITZERLAND) 2023;23:s23115206. [PMID: 37299934 DOI: 10.3390/s23115206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 05/24/2023] [Accepted: 05/29/2023] [Indexed: 06/12/2023]
2
Hasanzadeh A, Hamblin MR, Kiani J, Noori H, Hardie JM, Karimi M, Shafiee H. Could artificial intelligence revolutionize the development of nanovectors for gene therapy and mRNA vaccines? NANO TODAY 2022;47:101665. [PMID: 37034382 PMCID: PMC10081506 DOI: 10.1016/j.nantod.2022.101665] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
3
Gao X, Chao F, Zhou C, Ge Z, Yang L, Chang X, Shang C, Shen Q. Error controlled actor-critic. Inf Sci (N Y) 2022. [DOI: 10.1016/j.ins.2022.08.079] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
4
Xiong K, Zhou P, Wei C. Autonomous Navigation of Unmanned Aircraft Using Space Target LOS Measurements and QLEKF. SENSORS (BASEL, SWITZERLAND) 2022;22:s22186992. [PMID: 36146339 PMCID: PMC9503636 DOI: 10.3390/s22186992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/10/2022] [Accepted: 09/13/2022] [Indexed: 06/01/2023]
5
Nievas N, Pagès-Bernaus A, Bonada F, Echeverria L, Abio A, Lange D, Pujante J. A Reinforcement Learning Control in Hot Stamping for Cycle Time Optimization. MATERIALS 2022;15:ma15144825. [PMID: 35888292 PMCID: PMC9322736 DOI: 10.3390/ma15144825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Revised: 06/23/2022] [Accepted: 07/10/2022] [Indexed: 11/16/2022]
6
Fathi A, Masoudi SF. Combining CNN and Q-learning for increasing the accuracy of lost gamma source finding. Sci Rep 2022;12:2644. [PMID: 35173217 PMCID: PMC8850423 DOI: 10.1038/s41598-022-06326-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2021] [Accepted: 01/24/2022] [Indexed: 11/30/2022]  Open
7
Ai M, Xie Y, Tang Z, Zhang J, Gui W. Deep learning feature-based setpoint generation and optimal control for flotation processes. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.07.060] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
8
Alharbi A, Alyami H, M P, Rauf HT, Kadry S. Intelligent scaling for 6G IoE services for resource provisioning. PeerJ Comput Sci 2021;7:e755. [PMID: 34805508 PMCID: PMC8576555 DOI: 10.7717/peerj-cs.755] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 09/30/2021] [Indexed: 06/13/2023]
9
A reinforcement learning approach to distribution-free capacity allocation for sea cargo revenue management. Inf Sci (N Y) 2021. [DOI: 10.1016/j.ins.2021.04.092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
10
Sami H, Otrok H, Bentahar J, Mourad A. AI-Based Resource Provisioning of IoE Services in 6G: A Deep Reinforcement Learning Approach. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT 2021. [DOI: 10.1109/tnsm.2021.3066625] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
11
Balkenius C, Tjøstheim TA, Johansson B, Wallin A, Gärdenfors P. The Missing Link Between Memory and Reinforcement Learning. Front Psychol 2020;11:560080. [PMID: 33362625 PMCID: PMC7758424 DOI: 10.3389/fpsyg.2020.560080] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2020] [Accepted: 11/16/2020] [Indexed: 11/16/2022]  Open
12
Deng C, Ji X, Rainey C, Zhang J, Lu W. Integrating Machine Learning with Human Knowledge. iScience 2020;23:101656. [PMID: 33134890 PMCID: PMC7588855 DOI: 10.1016/j.isci.2020.101656] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]  Open
13
Towards safe reinforcement-learning in industrial grid-warehousing. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.06.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
14
Konda R, La HM, Zhang J. Decentralized Function Approximated Q-Learning in Multi-Robot Systems For Predator Avoidance. IEEE Robot Autom Lett 2020. [DOI: 10.1109/lra.2020.3013920] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
15
LIN YAHUI, CHIU SHAOWEN, LIN YINGCHE, LIN CHIENCHUNG, PAN LUNGKWANG. INVERSE PROBLEM ALGORITHM APPLICATION TO SEMI-QUANTITATIVE ANALYSIS OF 272 PATIENTS WITH ISCHEMIC STROKE SYMPTOMS: CAROTID STENOSIS RISK ASSESSMENT FOR FIVE RISK FACTORS. J MECH MED BIOL 2020. [DOI: 10.1142/s0219519420400217] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
16
Li J, Yao L, Xu X, Cheng B, Ren J. Deep reinforcement learning for pedestrian collision avoidance and human-machine cooperative driving. Inf Sci (N Y) 2020. [DOI: 10.1016/j.ins.2020.03.105] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
17
Liu R, Liang J, Alkhambashi M. Research on breakthrough and innovation of UAV mission planning method based on cloud computing-based reinforcement learning algorithm. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019. [DOI: 10.3233/jifs-179130] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
18
Yuan Y, Yu ZL, Gu Z, Deng X, Li Y. A novel multi-step reinforcement learning method for solving reward hacking. APPL INTELL 2019. [DOI: 10.1007/s10489-019-01417-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
19
A Q-learning approach based on human reasoning for navigation in a dynamic environment. ROBOTICA 2018. [DOI: 10.1017/s026357471800111x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
20
Xu B. Composite Learning Finite-Time Control With Application to Quadrotors. IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS 2018;48:1806-1815. [DOI: 10.1109/tsmc.2017.2698473] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
21
Zhu H, Paschalidis IC, Hasselmo ME. Neural circuits for learning context-dependent associations of stimuli. Neural Netw 2018;107:48-60. [PMID: 30177226 DOI: 10.1016/j.neunet.2018.07.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 07/08/2018] [Accepted: 07/09/2018] [Indexed: 10/28/2022]
22
Zhou Y, Duval B, Hao JK. Improving probability learning based local search for graph coloring. Appl Soft Comput 2018. [DOI: 10.1016/j.asoc.2018.01.027] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
23
Automatic design of hyper-heuristic based on reinforcement learning. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.01.005] [Citation(s) in RCA: 44] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
24
Shi H, Lin Z, Zhang S, Li X, Hwang KS. An adaptive decision-making method with fuzzy Bayesian reinforcement learning for robot soccer. Inf Sci (N Y) 2018. [DOI: 10.1016/j.ins.2018.01.032] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
25
Wang GF, Fang Z, Li P. Shaping in reinforcement learning by knowledge transferred from human-demonstrations of a simple similar task. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2018. [DOI: 10.3233/jifs-17052] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
26
Yin B, Dridi M, Moudni AE. Recursive least-squares temporal difference learning for adaptive traffic signal control at intersection. Neural Comput Appl 2017. [DOI: 10.1007/s00521-017-3066-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
27
Wang W, Chen X, He J. Adaptive Critic Design with Local Gaussian Process Models. JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS 2016. [DOI: 10.20965/jaciii.2016.p1135] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
28
Lian C, Xu X, Chen H, He H. Near-Optimal Tracking Control of Mobile Robots Via Receding-Horizon Dual Heuristic Programming. IEEE TRANSACTIONS ON CYBERNETICS 2016;46:2484-2496. [PMID: 26642462 DOI: 10.1109/tcyb.2015.2478857] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
29
Wang D, Li C, Liu D, Mu C. Data-based robust optimal control of continuous-time affine nonlinear systems with matched uncertainties. Inf Sci (N Y) 2016. [DOI: 10.1016/j.ins.2016.05.034] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
30
Application of a Gradient Descent Continuous Actor-Critic Algorithm for Double-Side Day-Ahead Electricity Market Modeling. ENERGIES 2016. [DOI: 10.3390/en9090725] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
31
Wei Q, Liu D, Lewis FL. Optimal distributed synchronization control for continuous-time heterogeneous multi-agent differential graphical games. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.044] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
32
Fernandez-Gauna B, Graña M, Lopez-Guede JM, Etxeberria-Agiriano I, Ansoategui I. Reinforcement Learning endowed with safe veto policies to learn the control of Linked-Multicomponent Robotic Systems. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2015.04.005] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
33
Embedded Adaptive Fuzzy Controller Based on Reinforcement Learning for DC Motor with Flexible Shaft. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2015. [DOI: 10.1007/s13369-015-1752-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
34
Optimal scheduling for data transmission between mobile devices and cloud. Inf Sci (N Y) 2015. [DOI: 10.1016/j.ins.2014.12.059] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
35
Bahrpeyma F, Zakerolhoseini A, Haghighi H. Using IDS fitted Q to develop a real-time adaptive controller for dynamic resource provisioning in Cloud's virtualized environment. Appl Soft Comput 2015. [DOI: 10.1016/j.asoc.2014.10.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
36
Huang Z, Xu X, Zuo L. Reinforcement learning with automatic basis construction based on isometric feature mapping. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.07.008] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
37
A developmental approach to robotic pointing via human–robot interaction. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.03.104] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
38
Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming. Inf Sci (N Y) 2014. [DOI: 10.1016/j.ins.2014.05.050] [Citation(s) in RCA: 111] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
39
Zhu F, Liu Q, Wang H, Zhou X, Fu Y. Unregistered biological words recognition by Q-learning with transfer learning. ScientificWorldJournal 2014;2014:173290. [PMID: 24701139 PMCID: PMC3950481 DOI: 10.1155/2014/173290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2013] [Accepted: 01/08/2014] [Indexed: 11/23/2022]  Open
PrevPage 1 of 1 1Next
© 2004-2024 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA