• Reference Citation Analysis
  • v
  • v
  • Find an Article
Find an Article PDF (4623375)   Today's Articles (5096)   Subscriber (49407)
For: Shi C, Zhang S, Lu W, Song R. Statistical inference of the value function for reinforcement learning in infinite‐horizon settings. J R Stat Soc Series B Stat Methodol 2021. [DOI: 10.1111/rssb.12465] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Number Cited by Other Article(s)
1
Zhou Y, Shi C, Li L, Yao Q. Testing for the Markov property in time series via deep conditional generative learning. J R Stat Soc Series B Stat Methodol 2023;85:1204-1222. [PMID: 37780936 PMCID: PMC10541293 DOI: 10.1093/jrsssb/qkad064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2022] [Accepted: 05/27/2023] [Indexed: 10/03/2023]
2
Yu M, Lu W, Yang S, Ghosh P. A multiplicative structural nested mean model for zero-inflated outcomes. Biometrika 2023;110:519-536. [PMID: 37197742 PMCID: PMC10183836 DOI: 10.1093/biomet/asac050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2021] [Indexed: 11/13/2022]  Open
3
Liao P, Qi Z, Wan R, Klasnja P, Murphy SA. Batch policy learning in average reward Markov decision processes. Ann Stat 2022;50:3364-3387. [PMID: 37022318 PMCID: PMC10072865 DOI: 10.1214/22-aos2231] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
4
Shi C, Zhu J, Ye S, Luo S, Zhu H, Song R. Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2110878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
5
Shi C, Zhu J, Shen Y, Luo S, Zhu H, Song R. Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2110876] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2022]
6
Ramprasad P, Li Y, Yang Z, Wang Z, Sun WW, Cheng G. Online Bootstrap Inference For Policy Evaluation In Reinforcement Learning. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2096620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
7
Shi C, Wang X, Luo S, Zhu H, Ye J, Song R. Dynamic Causal Effects Evaluation in A/B Testing with a Reinforcement Learning Framework. J Am Stat Assoc 2022. [DOI: 10.1080/01621459.2022.2027776] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
PrevPage 1 of 1 1Next
© 2004-2024 Baishideng Publishing Group Inc. All rights reserved. 7041 Koll Center Parkway, Suite 160, Pleasanton, CA 94566, USA