1
|
Dang TKN, Bucur D, Atil B, Pitel G, Ruis F, Kadkhodaei H, Litvak N. Look back, look around: A systematic analysis of effective predictors for new outlinks in focused Web crawling. Knowl Based Syst 2022. [DOI: 10.1016/j.knosys.2022.110126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
2
|
Srinivasan K, Currim F, Ram S. A Human-in-the-loop Segmented Mixed-effects Modeling Method For Analyzing Wearables Data. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2022. [DOI: 10.1145/3564276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/14/2022]
Abstract
Wearables are an important source of big data as they provide real-time high-resolution data logs of health indicators of individuals. Higher-order associations between pairs of variables is common in wearables data. Representing higher-order association curves as piece-wise linear segments in a regression model makes them more interpretable. However, existing methods for identifying the change points for segmented modeling either overfit or have low external validity for wearables data containing repeated measures. Therefore, we propose a human-in-the-loop method for segmented modeling of higher-order pairwise associations between variables in wearables data. Our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate change point estimates for a segmented mixed-effects model, and thereafter employs the Brent's constrained optimization procedure to fine-tuning the manually provided estimates. We validate our method using three real-world wearables datasets. Our method not only outperforms state-of-the-art modeling methods in terms of prediction performance but also provides more interpretable results. Our study contributes to health data science in terms of developing a new method for interpretable modeling of wearables data. Our analysis uncovers interesting insights on higher order associations for health researchers.
Collapse
Affiliation(s)
| | - Faiz Currim
- Eller College of Management, University of Arizona, Tucson AZ, U.S
| | - Sudha Ram
- Eller College of Management, University of Arizona, Tucson AZ, U.S
| |
Collapse
|
3
|
Liu R, Mai F, Shan Z, Wu Y. Predicting shareholder litigation on insider trading from financial text: An interpretable deep learning approach. INFORMATION & MANAGEMENT 2020. [DOI: 10.1016/j.im.2020.103387] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
4
|
Kim I, Pant G. Predicting web site audience demographics using content and design cues. INFORMATION & MANAGEMENT 2019. [DOI: 10.1016/j.im.2018.11.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
5
|
Leong LY, Hew TS, Ooi KB, Lin B. Do Electronic Word-of-Mouth and Elaboration Likelihood Model Influence Hotel Booking? JOURNAL OF COMPUTER INFORMATION SYSTEMS 2017. [DOI: 10.1080/08874417.2017.1320953] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Affiliation(s)
- Lai-Ying Leong
- Universiti Tunku Abdul Rahman, Kampar, Malaysia
- University of Malaya, Kuala Lumpur, Malaysia
| | | | | | - Binshan Lin
- Louisiana State University, Shreveport, LA, USA
| |
Collapse
|
6
|
|
7
|
Pant G, Srinivasan P. Status Locality on the Web: Implications for Building Focused Collections. INFORMATION SYSTEMS RESEARCH 2013. [DOI: 10.1287/isre.1120.0457] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
8
|
Abstract
Despite the increased prevalence of sentiment-related information on the Web, there has been limited work on focused crawlers capable of effectively collecting not only topic-relevant but also sentiment-relevant content. In this article, we propose a novel focused crawler that incorporates topic and sentiment information as well as a graph-based tunneling mechanism for enhanced collection of opinion-rich Web content regarding a particular topic. The graph-based sentiment (GBS) crawler uses a text classifier that employs both topic and sentiment categorization modules to assess the relevance of candidate pages. This information is also used to label nodes in web graphs that are employed by the tunneling mechanism to improve collection recall. Experimental results on two test beds revealed that GBS was able to provide better precision and recall than seven comparison crawlers. Moreover, GBS was able to collect a large proportion of the relevant content after traversing far fewer pages than comparison methods. GBS outperformed comparison methods on various categories of Web pages in the test beds, including collection of blogs, Web forums, and social networking Web site content. Further analysis revealed that both the sentiment classification module and graph-based tunneling mechanism played an integral role in the overall effectiveness of the GBS crawler.
Collapse
|
9
|
Huang Z, Zhao H, Zhu D. Two New Prediction-Driven Approaches to Discrete Choice Prediction. ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS 2012. [DOI: 10.1145/2229156.2229159] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The ability to predict consumer choices is essential in understanding the demand structure of products and services. Typical discrete choice models that are targeted at providing an understanding of the behavioral process leading to choice outcomes are developed around two main assumptions: the existence of a utility function that represents the preferences over a choice set and the relatively simple and interpretable functional form for the utility function with respect to attributes of alternatives and decision makers. These assumptions lead to models that can be easily interpreted to provide insights into the effects of individual variables, such as price and promotion, on consumer choices. However, these restrictive assumptions might impede the ability of such theory-driven models to deliver accurate predictions and forecasts. In this article, we develop novel approaches targeted at providing more accurate choice predictions. Specifically, we propose two prediction-driven approaches: pairwise preference learning using classification techniques and ranking function learning using evolutionary computation. We compare our proposed approaches with a multiclass classification approach, as well as a standard discrete choice model. Our empirical results show that the proposed approaches achieved significantly higher choice prediction accuracy.
Collapse
|