1
|
García Subies G, Barbero Jiménez Á, Martínez Fernández P. A comparative analysis of Spanish Clinical encoder-based models on NER and classification tasks. J Am Med Inform Assoc 2024:ocae054. [PMID: 38489543 DOI: 10.1093/jamia/ocae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 02/16/2024] [Accepted: 03/04/2024] [Indexed: 03/17/2024] Open
Abstract
OBJECTIVES This comparative analysis aims to assess the efficacy of encoder Language Models for clinical tasks in the Spanish language. The primary goal is to identify the most effective resources within this context. IMPORTANCE This study highlights a critical gap in NLP resources for the Spanish language, particularly in the clinical sector. Given the vast number of Spanish speakers globally and the increasing reliance on electronic health records, developing effective Spanish language models is crucial for both clinical research and healthcare delivery. Our work underscores the urgent need for specialized encoder models in Spanish that can handle clinical data with high accuracy, thus paving the way for advancements in healthcare services and biomedical research for Spanish-speaking populations. MATERIALS AND METHODS We examined 17 distinct corpora with a focus on clinical tasks. Our evaluation centered on Spanish Language Models and Spanish Clinical Language models (both encoder-based). To ascertain performance, we meticulously benchmarked these models across a curated subset of the corpora. This extensive study involved fine-tuning over 3000 models. RESULTS Our analysis revealed that the best models are not clinical models, but general-purpose models. Also, the biggest models are not always the best ones. The best-performing model, RigoBERTa 2, obtained an average F1 score of 0.880 across all tasks. DISCUSSION Our study demonstrates the advantages of dedicated encoder-based Spanish Clinical Language models over generative models. However, the scarcity of diverse corpora, mostly focused on NER tasks, underscores the need for further research. The limited availability of high-performing models emphasizes the urgency for development in this area. CONCLUSION Through systematic evaluation, we identified the current landscape of encoder Language Models for clinical tasks in the Spanish language. While challenges remain, the availability of curated corpora and models offers a foundation for advancing Spanish Clinical Language models. Future efforts in refining these models are essential to elevate their effectiveness in clinical NLP.
Collapse
Affiliation(s)
- Guillem García Subies
- Computer Science Department, Universidad Carlos III de Madrid, Leganés, Spain
- AI Department, Instituto de Ingeniería del Conocimiento, Madrid, Spain
| | | | | |
Collapse
|
2
|
Zhang Y, Yang W. The landslide's conceptualizing economic decline and its framing effect: Mandarin evidence. Front Psychol 2023; 14:1271911. [PMID: 37965670 PMCID: PMC10642486 DOI: 10.3389/fpsyg.2023.1271911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 10/03/2023] [Indexed: 11/16/2023] Open
Abstract
Conceptual metaphors are essential for explaining and understanding social concerns. Natural disaster metaphors are commonly employed to access the abstract and negative impacts of social issues. Five of the top 10 most prevalent natural disaster frames in the Center for Chinese Linguistics (CCL)-earthquake, flood, fire hazard, drought, typhoon, landslide, volcano, sandstorm, tsunami, and debris flow-share a common economic target domain and show economic recession. Additionally, corpus-based research has revealed that the landslide frame is the most salient in figuratively representing economic declines. An experimental study derived from the corpus analysis has found that the landslide-framed economic crises posed more severity to participants and exerted a notable influence on their opinions and judgments. Therefore, when effective communication of economic hazards is to be realized, metaphorical representation of economic crises demands great consideration.
Collapse
Affiliation(s)
| | - Wenxing Yang
- College of International Studies, Yangzhou University, Yangzhou, Jiangsu, China
| |
Collapse
|
3
|
Sancar B, Çetiner Y, Dayı E. Evaluation of the pattern of fracture formation from trauma to the human mandible with finite element analysis. Part 2: The corpus and the angle regions. Dent Traumatol 2023; 39:437-447. [PMID: 36942890 DOI: 10.1111/edt.12841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/07/2023] [Accepted: 03/14/2023] [Indexed: 03/23/2023]
Abstract
BACKGROUND/AIMS Although the mandible is the largest and strongest bone of the facial skeleton, it is frequently broken. The fracture location in the mandible depends on the biomechanical features, direction and angle of the trauma, and masticatory muscles. This study aimed to evaluate the stresses caused by trauma to the corpus and angle regions from different angles. MATERIALS AND METHODS After computer-based mandible models were created using finite element analysis, a force of 2000 Newton(N) was simulated with the mouth open or closed to the corpus and the angle. To the corpus: at 90° (Model 1) in the lateromedial direction, 45° (Model 2) in the lateromedial-inferosuperior direction, and 90° (Model 3) in the inferosuperior direction. To angle: 90° (Model 4) in the lateromedial direction and 45° (Model 5) in the lateromedial-inferosuperior direction. The resulting stress intensity was assessed using FEA. RESULTS Following the simulated forces, the maximum stress in the mandible occurred in the condylar region, except in Model 3 (Left(L)Corpus2[36 megapascals(MPa)]) in the mouth-closed condition. After traumas in Model 1 (open-mouth: LCondyle2[547 MPa]) and Model 4 (closed-mouth: LCondyle2[607 MPa]), higher stress values occurred in the condyle. In the mouth open-closed state, there was no significant stress change in the condyle region in Model 1 (open-mouth: LCondyle2[547 MPa], closed-mouth:LCondyle2[546 MPa]) or in Model 2 (open mouth: Right(R)Condyle2[431 MPa], closed-mouth:LCondyle2[439 MPa]). In Model 3, lower stress values occurred in the closed-mouth rather than the open-mouth (LCondyle1[167 MPa]) state. In Models 4 and 5, the stress values increased in the mouth-closed condition compared with the mouth-open condition. CONCLUSIONS Stress in the mandible is affected by the location of the trauma and the angle of incidence of the blow. In trauma to both the corpus and the angle, the most common area to be fractured is the condyle.
Collapse
Affiliation(s)
- Bahadır Sancar
- Dentistry Faculty, Department of Oral and Maxillofacial Surgery, Inonu University, Malatya, Turkey
| | - Yunus Çetiner
- Dentistry Faculty, Department of Oral and Maxillofacial Surgery, Inonu University, Malatya, Turkey
| | - Ertunç Dayı
- Dentistry Faculty, Department of Oral and Maxillofacial Surgery, Ataturk University, Erzurum, Turkey
| |
Collapse
|
4
|
Robinson JA, Sandow RJ, Piazza R. Introducing the keyconcept approach to the analysis of language: the case of regulation in COVID-19 diaries. Front Artif Intell 2023; 6:1176283. [PMID: 37808621 PMCID: PMC10552565 DOI: 10.3389/frai.2023.1176283] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Accepted: 07/20/2023] [Indexed: 10/10/2023] Open
Abstract
Using the Mass Observation corpus of 12th of May Diaries, we investigate concepts that are characteristic of the first coronavirus lockdown in the UK. More specifically, we extract and analyse concepts which are distinctive of the discourses produced in May 2020 in relation to concepts used in the 10 previous years, 2010-2019. In the current paper we focus on the concept of regulation, which we identify through a novel approach to querying semantic content in large datasets. Typically, linguists look at keywords to understand differences between two datasets. We demonstrate that taking the perspective of a keyconcept rather than the keyword in linguistic analysis is a beneficial way of identifying trends in broader patterns of thoughts and behaviours which reflect lived-experiences that are particularly prominent of a given dataset, which, in this current paper, is the COVID-19 era dataset. In order to contextualise the keyconcept analysis, we investigate the discourses surrounding the concept of regulation. We find that diarists communicate collective experience of limited individual agency, surrounded by feelings of fear and gratitude. Diarists' reporting on events is often fragmented, focused on new information, and firmly placed in a temporal frame.
Collapse
Affiliation(s)
- Justyna A. Robinson
- School of Media, Arts and Humanities, University of Sussex, Brighton, United Kingdom
| | - Rhys J. Sandow
- Department of Language and Linguistic Science, University of York, York, United Kingdom
| | - Roberta Piazza
- School of Media, Arts and Humanities, University of Sussex, Brighton, United Kingdom
| |
Collapse
|
5
|
Tatsumi T, Sala G. Learning conversational dependency: Children's response using un in Japanese. J Child Lang 2023; 50:1226-1244. [PMID: 35786206 DOI: 10.1017/s0305000922000344] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
This study investigates how Japanese-speaking children learn interactional dependencies in conversations that determine the use of un, a token typically used as a positive response for yes-no questions, backchannel, and acknowledgement. We hypothesise that children learn to produce un appropriately by recognising different types of cues occurring in the immediately preceding turns. We built a set of generalised linear models on the longitudinal conversation data from seven children aged 1 to 5 years and their caregivers. Our models revealed that children not only increased their un production, but also learned to attend relevant cues in the preceding turns to understand when to respond by producing un. Children increasingly produced un when their interlocutors asked a yes-no question or signalled the continuation of their own speech. These results illustrate how children learn the probabilistic dependency between adjacent turns, and become able to participate in conversational interactions.
Collapse
Affiliation(s)
| | - Giovanni Sala
- National Center for Geriatrics and Gerontology, Obu, Japan
| |
Collapse
|
6
|
Huang Q, Li J, Yu J. Distribution characteristics of mitochondria-rich segments in the epididymis. Am J Clin Exp Immunol 2023; 12:72-73. [PMID: 37736076 PMCID: PMC10509488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 08/18/2023] [Indexed: 09/23/2023]
Abstract
The epididymis is a highly specialized tissue that plays vital roles in sperm maturation and storage. The spatio-temporal repertoire of epididymal cells and their gene expression in the epididymis remain less characterized. With the help of single-cell RNA sequencing (scRNA-seq), Shi et al., reveal a spatio- and segment-specific distribution pattern of mitochondria that adds another layer of complexity to our understanding of the epididymis. They unexpectedly find a higher abundance of mitochondria and mitochondrial transcription in the corpus and cauda compared to the caput of epididymis, which are believed to be responsible for providing the energy necessary for sperm maturation and motility.
Collapse
Affiliation(s)
- Qiuru Huang
- Institute of Reproductive Medicine, Medical School of Nantong University Nantong 226001, Jiangsu, PR China
| | - Jiaxin Li
- Institute of Reproductive Medicine, Medical School of Nantong University Nantong 226001, Jiangsu, PR China
| | - Jun Yu
- Institute of Reproductive Medicine, Medical School of Nantong University Nantong 226001, Jiangsu, PR China
| |
Collapse
|
7
|
Wei J. Kindness in British communities: Discursive practices of promoting kindness during the Covid pandemic. Discourse Soc 2023; 34:502-520. [PMID: 37829999 PMCID: PMC9834622 DOI: 10.1177/09579265221148691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/14/2023]
Abstract
This research adopts Critical Discourse Analysis as a perspective to explore how kindness was expressed and promoted in university communities and city communities from January to March in 2020 when the Covid pandemic broke out in the UK and provide a window on British culture in which kindness was expressed and promoted through discourse. It combines a qualitative method with a corpus-based quantitative method. It is found that kindness was meant for providing support and showing compassion and inclusion to community members and that strategies in lexis, syntax and metaphor can reproduce or resist the expression and promotion of kindness in communities. During the pandemic, the intentional kindness expressed by community authorities was respect of diversity rather than inclusion of different values or ethnicity and no substantial support was provided to vulnerable members even though authorities were trying to impress the public by claiming that they were making constant efforts to support the community. Case studies revealed that we should caution against the use of passivation and the pronouns like they.
Collapse
Affiliation(s)
- Jilan Wei
- Jilan Wei, Changzhou Institute of
Technology, Changzhou, Jiangsu 213022, China; School of Media, Arts and
Humanities, University of Sussex, Arts B348, Brighton, England BN1 9RH, UK.
| |
Collapse
|
8
|
Dhrangadhariya A, Hilfiker R, Sattelmayer M, Giacomino K, Caliesch R, Elsig S, Naderi N, Müller H. First Steps Towards a Risk of Bias Corpus of Randomized Controlled Trials. Stud Health Technol Inform 2023; 302:586-590. [PMID: 37203753 DOI: 10.3233/shti230210] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Risk of bias (RoB) assessment of randomized clinical trials (RCTs) is vital to conducting systematic reviews. Manual RoB assessment for hundreds of RCTs is a cognitively demanding, lengthy process and is prone to subjective judgment. Supervised machine learning (ML) can help to accelerate this process but requires a hand-labelled corpus. There are currently no RoB annotation guidelines for randomized clinical trials or annotated corpora. In this pilot project, we test the practicality of directly using the revised Cochrane RoB 2.0 guidelines for developing an RoB annotated corpus using a novel multi-level annotation scheme. We report inter-annotator agreement among four annotators who used Cochrane RoB 2.0 guidelines. The agreement ranges between 0% for some bias classes and 76% for others. Finally, we discuss the shortcomings of this direct translation of annotation guidelines and scheme and suggest approaches to improve them to obtain an RoB annotated corpus suitable for ML.
Collapse
Affiliation(s)
- Anjani Dhrangadhariya
- Informatics Institute, HES-SO Valais-Wallis, Sierre, Switzerland
- University of Geneva (UNIGE), Geneva, Switzerland
| | | | | | - Katia Giacomino
- School of Health Sciences, HES-SO Valais-Wallis, Leukerbad, Switzerland
| | - Rahel Caliesch
- School of Health Sciences, HES-SO Valais-Wallis, Leukerbad, Switzerland
| | - Simone Elsig
- School of Health Sciences, HES-SO Valais-Wallis, Leukerbad, Switzerland
| | - Nona Naderi
- Geneva School of Business Administration, HES-SO Geneva, Switzerland
- SIB Swiss Institute of Bioinformatics (SIB), Geneva, Switzerland
| | - Henning Müller
- Informatics Institute, HES-SO Valais-Wallis, Sierre, Switzerland
- University of Geneva (UNIGE), Geneva, Switzerland
| |
Collapse
|
9
|
Karapetian K, Jeon SM, Kwon JW, Suh YK. Supervised Relation Extraction Between Suicide-Related Entities and Drugs: Development and Usability Study of an Annotated PubMed Corpus. J Med Internet Res 2023; 25:e41100. [PMID: 36884281 PMCID: PMC10034613 DOI: 10.2196/41100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 11/18/2022] [Accepted: 12/19/2022] [Indexed: 12/23/2022] Open
Abstract
BACKGROUND Drug-induced suicide has been debated as a crucial issue in both clinical and public health research. Published research articles contain valuable data on the drugs associated with suicidal adverse events. An automated process that extracts such information and rapidly detects drugs related to suicide risk is essential but has not been well established. Moreover, few data sets are available for training and validating classification models on drug-induced suicide. OBJECTIVE This study aimed to build a corpus of drug-suicide relations containing annotated entities for drugs, suicidal adverse events, and their relations. To confirm the effectiveness of the drug-suicide relation corpus, we evaluated the performance of a relation classification model using the corpus in conjunction with various embeddings. METHODS We collected the abstracts and titles of research articles associated with drugs and suicide from PubMed and manually annotated them along with their relations at the sentence level (adverse drug events, treatment, suicide means, or miscellaneous). To reduce the manual annotation effort, we preliminarily selected sentences with a pretrained zero-shot classifier or sentences containing only drug and suicide keywords. We trained a relation classification model using various Bidirectional Encoder Representations from Transformer embeddings with the proposed corpus. We then compared the performances of the model with different Bidirectional Encoder Representations from Transformer-based embeddings and selected the most suitable embedding for our corpus. RESULTS Our corpus comprised 11,894 sentences extracted from the titles and abstracts of the PubMed research articles. Each sentence was annotated with drug and suicide entities and the relationship between these 2 entities (adverse drug events, treatment, means, and miscellaneous). All of the tested relation classification models that were fine-tuned on the corpus accurately detected sentences of suicidal adverse events regardless of their pretrained type and data set properties. CONCLUSIONS To our knowledge, this is the first and most extensive corpus of drug-suicide relations.
Collapse
Affiliation(s)
- Karina Karapetian
- School of Computer Science and Engineering, Kyungpook National University, Daegu, Republic of Korea
| | - Soo Min Jeon
- College of Pharmacy, Jeju National University, Jeju, Republic of Korea
| | - Jin-Won Kwon
- BK21 FOUR Community-Based Intelligent Novel Drug Discovery Education Unit, College of Pharmacy and Research Institute of Pharmaceutical Sciences, Kyungpook National University, Daegu, Republic of Korea
| | - Young-Kyoon Suh
- School of Computer Science and Engineering, Kyungpook National University, Daegu, Republic of Korea
| |
Collapse
|
10
|
Huang R. A corpus-assisted discourse study of Chinese university students' perceptions of sustainability. Front Psychol 2023; 14:1124909. [PMID: 36874853 PMCID: PMC9980348 DOI: 10.3389/fpsyg.2023.1124909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 01/16/2023] [Indexed: 02/18/2023] Open
Abstract
Education for sustainable development (ESD) in the higher education context plays a critical role in advancing the cause of sustainable development. However, previous research on university students' perceptions of sustainable development is limited. This study used a corpus-assisted eco-linguistic approach to investigate students' perceptions of sustainability issues and responsible actors to address these issues. This quantitative and qualitative study is based on a corpus of 501 collaborative essays on sustainability written by ~2,000 Chinese university students collected with their permission. The results show that the students had a comprehensive perception of the three dimensions of sustainable development. Environmental issues have received the most attention from students, followed by economic and social issues. With regard to perceived actors, students were inclined to view themselves as active participants in the cause of sustainable development, rather than as observers. They called for coordinated action of all relevant parties, such as the government, business sectors, institutions, and individuals. On the other hand, the author also noticed a tendency toward superficial green talk and anthropocentrism in students' discourse. This study aims to contribute to sustainability education by integrating findings into English as a foreign language (EFL) classes. Implications for sustainability education in the context of higher education are also discussed.
Collapse
|
11
|
Lyu Y, Han Z. Applying data-driven learning in self-translation of academic discourse: A case study of a Chinese medical student. Front Psychol 2023; 14:1071123. [PMID: 36891202 PMCID: PMC9986535 DOI: 10.3389/fpsyg.2023.1071123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2022] [Accepted: 01/24/2023] [Indexed: 02/22/2023] Open
Abstract
This article reports on an experiment on the use of data-driven learning (DDL) in the revision of self-translation by a Chinese medical student. The think-aloud method is employed to investigate the difficulties the student encountered in self-translation and the effectiveness of DDL in improving the quality of self-translation. Results show that difficulties in the self-translation of medical abstracts are mostly associated with markers of rhetorical moves, terminologies, and conventional academic expressions and that they can be effectively solved by such corpus consultation strategies as checking possible options in bilingual dictionaries, using the most certain keywords to find collocations, and using the most possible accompanying words to find contexts. A comparison of translations before and after the application of DDL reveals that it could help improve translation quality in lexical choices, syntactic structures, and discourse practice. An immediate interview shows that the participant holds a positive attitude toward DDL.
Collapse
Affiliation(s)
- Ying Lyu
- Institute of Corpus Studies and Applications, Shanghai International Studies University, Shanghai, China.,School of Foreign Language Studies, Wenzhou Medical University, Wenzhou, Zhejiang, China
| | - Ziman Han
- Institute of Corpus Studies and Applications, Shanghai International Studies University, Shanghai, China
| |
Collapse
|
12
|
Song J. How interaction molds semantics: The mood functions of Chinese "sum-up" adverbs. Front Psychol 2022; 13:1014858. [PMID: 36643698 PMCID: PMC9838191 DOI: 10.3389/fpsyg.2022.1014858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Accepted: 11/04/2022] [Indexed: 12/31/2022] Open
Abstract
"Hezhe" (), "ganqing" (), and "nao le bantian" () are common mood expressions in modern Chinese which have a common function of summarizing the information before (so called 'sum-up') as well as similar pragmatic functions. Study on these mood adverbs could reveal how the interactional mechanism molds the original semantic meanings of mood words and leads to new pragmatic functions. Five verbal corpora are applied to collect the real materials of usage containing the above three mood adverbs. Functional analysis and data statistics have been carried out to categorize the pragmatic functions of these words, calculate their distributions, and reconstruct their evolutional approach through an interactional perspective. We have found a core theoretical viewpoint that the similar functions of the three words emerged through the same pragmatic mechanism called "violation". These functions are: (1) "unexpectation" from the violation of psychological expectation; (2) "criticism" from the violation of universal principles; (3) "humor" from the violation of communicative principles. A statistic work of several corpora showed that these functions of the above three words appear broadly in verbal materials, with differences in their proportions according to the communication types and genres. In Chinese teaching to speakers of other languages, more attention should be paid to these words stemming from dialects, especially during intermediate and advanced levels.
Collapse
|
13
|
Gao Y, Dligach D, Miller T, Tesch S, Laffin R, Churpek MM, Afshar M. Hierarchical Annotation for Building A Suite of Clinical Natural Language Processing Tasks: Progress Note Understanding. LREC Int Conf Lang Resour Eval 2022; 2022:5484-5493. [PMID: 35939277 PMCID: PMC9354726] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Applying methods in natural language processing on electronic health records (EHR) data is a growing field. Existing corpus and annotation focus on modeling textual features and relation prediction. However, there is a paucity of annotated corpus built to model clinical diagnostic thinking, a process involving text understanding, domain knowledge abstraction and reasoning. This work introduces a hierarchical annotation schema with three stages to address clinical text understanding, clinical reasoning, and summarization. We created an annotated corpus based on an extensive collection of publicly available daily progress notes, a type of EHR documentation that is collected in time series in a problem-oriented format. The conventional format for a progress note follows a Subjective, Objective, Assessment and Plan heading (SOAP). We also define a new suite of tasks, Progress Note Understanding, with three tasks utilizing the three annotation stages. The novel suite of tasks was designed to train and evaluate future NLP models for clinical text understanding, clinical knowledge representation, inference, and summarization.
Collapse
Affiliation(s)
- Yanjun Gao
- ICU Data Science Lab, School of Medicine and Public Health, University of Wisconsin
| | | | | | - Samuel Tesch
- School of Medicine and Public Health, University of Wisconsin
| | - Ryan Laffin
- School of Medicine and Public Health, University of Wisconsin
| | - Matthew M. Churpek
- ICU Data Science Lab, School of Medicine and Public Health, University of Wisconsin
| | - Majid Afshar
- ICU Data Science Lab, School of Medicine and Public Health, University of Wisconsin
| |
Collapse
|
14
|
Abstract
This study investigated whether Korean children follow the acquisition pattern predicted by the Aspect Hypothesis (Shirai & Andersen, 1995), and the relationship between caretakers' and children's speech. Accordingly, we analyzed a Korean corpus (Ryu-Corpus) on the CHILDES database (MacWhinney, 2000), which comprised longitudinal video-recorded interactions of three Korean children and their caregivers. Results indicate that the children used the past marker -ess principally with telic verbs, consistent with the Aspect Hypothesis. Each child's usage closely reflects the caretaker's frequency, yielding a high correlation (τb = 0.79). However, the acquisition of the imperfective marker -ko iss did not show a predicted association with activity verbs, contrary to the Aspect Hypothesis. Furthermore, caretakers' input did not correlate with the children's utterances of the imperfective marker (τb = 0.40). We argue that multiple factors such as input frequency, language-specific organization of aspectual semantics, and individual differences should be considered to explain tense-aspect acquisition.
Collapse
|
15
|
Liu Q, Bilal, Komal B. A Corpus-Based Comparison of the Chief Executive Officer Statements in Annual Reports and Corporate Social Responsibility Reports. Front Psychol 2022; 13:851405. [PMID: 35529572 PMCID: PMC9072963 DOI: 10.3389/fpsyg.2022.851405] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2022] [Accepted: 02/21/2022] [Indexed: 11/13/2022] Open
Abstract
This study presents a corpus-based comparison of the Chief Executive Officer (CEO) statements between annual reports and corporate social responsibility (CSR) reports. Using a corpus of 209 CEO statements from annual reports and CSR reports of Chinese companies, this study employs the Discourse-Historical Approach of critical discourse analysis to investigate the nomination strategies and key topics in these two related reports. The results showed that corporate leaders tend to have different priorities in annual reports and CSR reports. In annual reports, corporate leaders highlight the economic and pragmatic concerns of stakeholders to create a professionally capable and objective corporate image. In CSR reports, corporate leaders highlight the ethical concerns of stakeholders to create a socially responsible corporate image and adopt a more engaging and affiliative voice through the use of first-person pronouns. This study has significance in understanding the differences in the related genres of annual reports and CSR reports for the stakeholders.
Collapse
Affiliation(s)
- Qingrong Liu
- Department of English, Wuhan University, Wuhan, China
| | - Bilal
- School of Accounting, Hubei University of Economics, Wuhan, China
| | - Bushra Komal
- Business School, University of International Business and Economics, Beijing, China
| |
Collapse
|
16
|
Di Natale MR, Patten L, Molero JC, Stebbing MJ, Hunne B, Wang X, Liu Z, Furness JB. Organisation of the musculature of the rat stomach. J Anat 2022; 240:711-723. [PMID: 34747011 PMCID: PMC8930815 DOI: 10.1111/joa.13587] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 10/26/2021] [Accepted: 10/27/2021] [Indexed: 01/02/2023] Open
Abstract
The strengths, directions and coupling of the movements of the stomach depend on the organisation of its musculature. Although the rat has been used as a model species to study gastric function, there is no detailed, quantitative study of the arrangement of the gastric muscles in rat. Here we provide a descriptive and quantitative account, and compare it with human gastric anatomy. The rat stomach has three components of the muscularis externa, a longitudinal coat, a circular coat and an internal oblique (sling) muscle in the region of the gastro-oesophageal junction. These layers are similar to human. Unlike human, the rat stomach is also equipped with paired muscular oesophago-pyloric ligaments that lie external to the longitudinal muscle. There is a prominent muscularis mucosae throughout the stomach and strands of smooth muscle occur in the mucosa, between the glands of the corpus and antrum. The striated muscle of the oesophageal wall reaches to the stomach, unlike the human, in which the wall of the distal oesophagus is smooth muscle. Thus, the continuity of gastric and oesophageal smooth muscle bundles, that occurs in human, does not occur in rat. Circular muscle bundles extend around the circumference of the stomach, in the fundus forming a cap of parallel muscle bundles. This arrangement favours co-ordinated circumferential contractions. Small bands of muscle make connections between the circular muscle bundles. This is consistent with a slower conduction of excitation orthogonal to the circular muscle bundles, across the corpus towards the distal antrum. The oblique muscle merged and became continuous with the circular muscle close to the gastro-oesophageal junction at the base of the fundus, and in the corpus, lateral to the lesser curvature. Quantitation of muscle thickness revealed gradients of thickness of both the longitudinal and circular muscle. This anatomical study provides essential data for interpreting gastric movements.
Collapse
Affiliation(s)
- Madeleine R. Di Natale
- Department of Anatomy & PhysiologyUniversity of MelbourneParkvilleVictoriaAustralia
- Florey Institute of Neuroscience and Mental HealthParkvilleVictoriaAustralia
| | - Lauren Patten
- Florey Institute of Neuroscience and Mental HealthParkvilleVictoriaAustralia
| | - Juan C. Molero
- Department of Anatomy & PhysiologyUniversity of MelbourneParkvilleVictoriaAustralia
- Florey Institute of Neuroscience and Mental HealthParkvilleVictoriaAustralia
| | - Martin J. Stebbing
- Department of Anatomy & PhysiologyUniversity of MelbourneParkvilleVictoriaAustralia
- Florey Institute of Neuroscience and Mental HealthParkvilleVictoriaAustralia
| | - Billie Hunne
- Department of Anatomy & PhysiologyUniversity of MelbourneParkvilleVictoriaAustralia
| | - Xiaokai Wang
- Department of Biomedical EngineeringUniversity of MichiganAnn ArborMichiganUSA
| | - Zhongming Liu
- Department of Biomedical EngineeringUniversity of MichiganAnn ArborMichiganUSA
| | - John B. Furness
- Department of Anatomy & PhysiologyUniversity of MelbourneParkvilleVictoriaAustralia
- Florey Institute of Neuroscience and Mental HealthParkvilleVictoriaAustralia
| |
Collapse
|
17
|
Funkner AA, Zhurman DA, Kovalchuk SV. Extraction of Temporal Structures for Clinical Events in Unlabeled Free-Text Electronic Health Records in Russian. Stud Health Technol Inform 2021; 287:55-56. [PMID: 34795079 DOI: 10.3233/shti210811] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
The important information about a patient is often stored in a free-form text to describe the events in the patient's medical history. In this work, we propose and evaluate a hybrid approach based on rules and syntactical analysis to normalise temporal expressions and assess uncertainty depending on the remoteness of the event. A dataset of 500 sentences was manually labelled to measure the accuracy. On this dataset, the accuracy of extracting temporal expressions is 95,5%, and the accuracy of normalization is 94%. The event extraction accuracy is 74.80%. The essential advantage of this work is the implementation of the considered approach for the non-English language where NLP tools are limited.
Collapse
Affiliation(s)
| | | | - Sergey V Kovalchuk
- ITMO University, Saint Petersburg, Russia.,National Almazov Medical Research Centre, Saint Petersburg, Russia
| |
Collapse
|
18
|
Jaffey DM, Chesney L, Powley TL. Stomach serosal arteries distinguish gastric regions of the rat. J Anat 2021; 239:903-912. [PMID: 34142374 PMCID: PMC8450471 DOI: 10.1111/joa.13480] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/12/2021] [Accepted: 05/22/2021] [Indexed: 11/29/2022] Open
Abstract
Because the stomach in situ has few distinctive surface features and changes shape dramatically with food intake, we have used micro-CT imaging combined with two distinct contrast agents to (1) characterize the pattern of arteries, potential landmarks, on the stomach wall and (2) evaluate how meal-related shape changes affect the size of the different regions. Images generated with a contrast agent injected directly into the heart during perfusion enabled a thorough look at the organizational features of the stomach angioarchitecture. The stomach receives its blood supply primarily from two pairs of vessels, the gastric and gastroepiploic arteries. Each of the three regions of the stomach is delineated by a distinctive combination of arterial fields: the corpus, consistent with its dynamic secretory activity and extensive mucosa, is supplied by extensive arterial trees formed by the left and right gastric arteries, travelling, respectively, on the ventral and dorsal stomach surfaces. These major arteries course circularly from the lesser towards the greater curvature, distally along both left (or ventral) and right (or dorsal) walls of the corpus, and branch rostrally to supply the region. The muscular antrum is characterized by smaller arterial branches arising primarily from the right gastroepiploic artery that follows the distal greater curvature and secondarily from small, distally directed arteries supplied by the large vessels of the left and right gastric arteries. The forestomach, essentially devoid of mucosal tissue and separated from the corpus by the limiting ridge, is vascularized predominantly by a network of small arteries issued from the left gastroepiploic artery coursing around the proximal greater curvature, as well as from higher order and smaller branches issued by the gastric and celiac arteries. These distinctive arterial fields appear to distinguish the major gastric regions, irrespective of the degree of fill of the stomach. Volume assessments of stomach compartments were made from images of iodine-stained stomachs. By varying the delay time between eating and perfusion, we were able to probe the emptying behavior of the stomach and demonstrate that the regions of the stomach empty at different rates, thus changing the relative dimensions of the organ regions. Notably, and despite these shape changes, the gastric arteries appear to form a regular, particularly recognizable, and lateralized pattern corresponding to the corpus that should be of use in guiding surgical and experimental interventions.
Collapse
Affiliation(s)
- Deborah M. Jaffey
- Department of Psychological SciencesPurdue UniversityWest LafayetteIndianaUSA
| | - Logan Chesney
- Department of Psychological SciencesPurdue UniversityWest LafayetteIndianaUSA
| | - Terry L. Powley
- Department of Psychological SciencesPurdue UniversityWest LafayetteIndianaUSA
| |
Collapse
|
19
|
Froese T, Broome M, Carel H, Humpston C, Malpass A, Mori T, Ratcliffe M, Rodrigues J, Sangati F. The Pandemic Experience: A Corpus of Subjective Reports on Life During the First Wave of COVID-19 in the UK, Japan, and Mexico. Front Public Health 2021; 9:725506. [PMID: 34490201 PMCID: PMC8418530 DOI: 10.3389/fpubh.2021.725506] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 07/16/2021] [Indexed: 11/13/2022] Open
Affiliation(s)
- Tom Froese
- Embodied Cognitive Science Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Matthew Broome
- School of Psychology, University of Birmingham, Birmingham, United Kingdom
| | - Havi Carel
- Department of Philosophy, University of Bristol, Bristol, United Kingdom
| | - Clara Humpston
- School of Psychology, University of Birmingham, Birmingham, United Kingdom
| | - Alice Malpass
- Bristol Medical School, University of Bristol, Bristol, United Kingdom
| | - Tomoari Mori
- Human Subjects Research Review Committee, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | | | - Jamila Rodrigues
- Embodied Cognitive Science Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| | - Federico Sangati
- Embodied Cognitive Science Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan
| |
Collapse
|
20
|
Ouyang S, Wang Y, Zhou K, Xia J. LitCovid-AGAC: cellular and molecular level annotation data set based on COVID-19. Genomics Inform 2021; 19:e23. [PMID: 34638170 PMCID: PMC8510875 DOI: 10.5808/gi.21013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Revised: 09/03/2021] [Accepted: 09/13/2021] [Indexed: 12/20/2022] Open
Abstract
Currently, coronavirus disease 2019 (COVID-19) literature has been increasing dramatically, and the increased text amount make it possible to perform large scale text mining and knowledge discovery. Therefore, curation of these texts becomes a crucial issue for Bio-medical Natural Language Processing (BioNLP) community, so as to retrieve the important information about the mechanism of COVID-19. PubAnnotation is an aligned annotation system which provides an efficient platform for biological curators to upload their annotations or merge other external annotations. Inspired by the integration among multiple useful COVID-19 annotations, we merged three annotations resources to LitCovid data set, and constructed a cross-annotated corpus, LitCovid-AGAC. This corpus consists of 12 labels including Mutation, Species, Gene, Disease from PubTator, GO, CHEBI from OGER, Var, MPA, CPA, NegReg, PosReg, Reg from AGAC, upon 50,018 COVID-19 abstracts in LitCovid. Contain sufficient abundant information being possible to unveil the hidden knowledge in the pathological mechanism of COVID-19.
Collapse
Affiliation(s)
- Sizhuo Ouyang
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, 430070 Wuhan, China
| | - Yuxing Wang
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, 430070 Wuhan, China
| | - Kaiyin Zhou
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, 430070 Wuhan, China
| | - Jingbo Xia
- Hubei Key Lab of Agricultural Bioinformatics, College of Informatics, Huazhong Agricultural University, 430070 Wuhan, China
| |
Collapse
|
21
|
Hobbs ET, Goralski SM, Mitchell A, Simpson A, Leka D, Kotey E, Sekira M, Munro JB, Nadendla S, Jackson R, Gonzalez-Aguirre A, Krallinger M, Giglio M, Erill I. ECO-CollecTF: A Corpus of Annotated Evidence-Based Assertions in Biomedical Manuscripts. Front Res Metr Anal 2021; 6:674205. [PMID: 34327299 PMCID: PMC8313968 DOI: 10.3389/frma.2021.674205] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2021] [Accepted: 06/28/2021] [Indexed: 11/20/2022] Open
Abstract
Analysis of high-throughput experiments in the life sciences frequently relies upon standardized information about genes, gene products, and other biological entities. To provide this information, expert curators are increasingly relying on text mining tools to identify, extract and harmonize statements from biomedical journal articles that discuss findings of interest. For determining reliability of the statements, curators need the evidence used by the authors to support their assertions. It is important to annotate the evidence directly used by authors to qualify their findings rather than simply annotating mentions of experimental methods without the context of what findings they support. Text mining tools require tuning and adaptation to achieve accurate performance. Many annotated corpora exist to enable developing and tuning text mining tools; however, none currently provides annotations of evidence based on the extensive and widely used Evidence and Conclusion Ontology. We present the ECO-CollecTF corpus, a novel, freely available, biomedical corpus of 84 documents that captures high-quality, evidence-based statements annotated with the Evidence and Conclusion Ontology.
Collapse
Affiliation(s)
- Elizabeth T Hobbs
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Stephen M Goralski
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Ashley Mitchell
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Andrew Simpson
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Dorjan Leka
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Emmanuel Kotey
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - Matt Sekira
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| | - James B Munro
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Suvarna Nadendla
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Rebecca Jackson
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| | | | - Martin Krallinger
- Barcelona Supercomputing Center (BSC), Barcelona, Spain.,Centro Nacional de Investigaciones Oncológicas (CNIO), Madrid, Spain
| | - Michelle Giglio
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD, United States
| | - Ivan Erill
- Department of Biological Sciences, University of Maryland Baltimore County, Baltimore, MD, United States
| |
Collapse
|
22
|
Ribaldone DG, Zurlo C, Fagoonee S, Rosso C, Armandi A, Caviglia GP, Saracco GM, Pellicano R. A Retrospective Experience of Helicobacter pylori Histology in a Large Sample of Subjects in Northern Italy. Life (Basel) 2021; 11:life11070650. [PMID: 34357021 PMCID: PMC8303344 DOI: 10.3390/life11070650] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2021] [Revised: 07/01/2021] [Accepted: 07/02/2021] [Indexed: 12/12/2022] Open
Abstract
Updated data about the prevalence of Helicobacter pylori (H. pylori) and its correlation with histological results are scarce. The aim of our study was to provide current data on the impact of H. pylori in a third-level endoscopy service. We performed a large, retrospective study analyzing the results of all histological samples of gastroscopy from the year 2019. In total, 1512 subjects were included. The prevalence of H. pylori was 16.8%. A significant difference between the prevalence in subjects born in Italy and those from eastern Europe, south America, or Africa was found (p < 0.0001, p = 0.006, and p = 0.0006, respectively). An association was found between H. pylori and active superficial gastritis (p < 0.0001). Current H. pylori and/or a previous finding of H. pylori was related to antral atrophy (p < 0.0001). Fifteen patients had low-grade dysplasia. There were no statistically significant associations with current or past H. pylori infection. One patient presented gastric cardia adenocarcinoma with regular gastric mucosa. One patient, H. pylori positive, was diagnosed with gastric signet ring cell adenocarcinoma in a setting of diffuse atrophy, without metaplasia.. Our study provides updated, solid (biopsy diagnosis and large population) data on the prevalence of H. pylori infection in a representative region of southern Europe.
Collapse
Affiliation(s)
- Davide Giuseppe Ribaldone
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
- Correspondence: (D.G.R.); (G.P.C.)
| | - Carlo Zurlo
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
| | - Sharmila Fagoonee
- Institute of Biostructure and Bioimaging (CNR), Molecular Biotechnology Center, 10126 Turin, Italy;
| | - Chiara Rosso
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
| | - Angelo Armandi
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
| | - Gian Paolo Caviglia
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
- Correspondence: (D.G.R.); (G.P.C.)
| | - Giorgio Maria Saracco
- Department of Medical Sciences, Division of Gastroenterology, University of Torino, 10124 Torino, Italy; (C.Z.); (C.R.); (A.A.); (G.M.S.)
| | - Rinaldo Pellicano
- Department of General and Specialist Medicine, Gastroenterologia-U, Città della Salute e della Scienza di Torino, C.so Bramante 88, 10126 Turin, Italy;
| |
Collapse
|
23
|
Chen L, Chang K. A novel corpus-based computing method for handling critical word-ranking issues: An example of COVID-19 research articles. INT J INTELL SYST 2021; 36:3190-3216. [PMID: 38607844 PMCID: PMC8207067 DOI: 10.1002/int.22413] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 02/10/2021] [Accepted: 02/21/2021] [Indexed: 12/24/2022]
Abstract
A corpus is a massive body of structured textual data that are stored and operated electronically. It usually combines with statistics, machine learning algorithms, or artificial intelligence (AI) technologies to explore the semantic relationship between lexical units, and beneficial when applied to language learning, information processing, translation, and so forth. In the face of a novel disease, like, COVID-19, establishing medical-specific corpus will enhance frontline medical personnel's information acquisition efficiency, guiding them on the right approaches to respond to and prevent the novel disease. To effectively retrieve critical messages from the corpus, appropriately handling word-ranking issues is quite crucial. However, traditional frequency-based approaches may cause bias in handling word-ranking issues because they neither optimize the corpus nor integrally take words' frequency dispersion and concentration criteria into consideration. Thus, this paper develops a novel corpus-based approach that combines a corpus software and Hirsch index (H-index) algorithm to handle the aforementioned issues simultaneously, making word-ranking processes more accurate. This paper compiled 100 COVID-19-related research articles as an empirical example of the target corpus. To verify the proposed approach, this study compared the results of two traditional frequency-based approaches and the proposed approach. The results indicate that the proposed approach can refine corpus and simultaneously compute words' frequency dispersion and concentration criteria in handling word-ranking issues.
Collapse
Affiliation(s)
- Liang‐Ching Chen
- Department of Foreign LanguagesR.O.C. Military AcademyKaohsiungTaiwan
- Institute of Education, National Sun Yat‐sen UniversityKaohsiungTaiwan
| | - Kuei‐Hu Chang
- Department of Management SciencesR.O.C. Military AcademyKaohsiungTaiwan
- Institute of Innovation and Circular Economy, Asia UniversityTaichungTaiwan
| |
Collapse
|
24
|
Hansen NC, Treider JMG, Swarbrick D, Bamford JS, Wilson J, Vuoskoski JK. A Crowd-Sourced Database of Coronamusic: Documenting Online Making and Sharing of Music During the COVID-19 Pandemic. Front Psychol 2021; 12:684083. [PMID: 34248787 PMCID: PMC8262515 DOI: 10.3389/fpsyg.2021.684083] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 05/25/2021] [Indexed: 11/13/2022] Open
Affiliation(s)
- Niels Chr. Hansen
- Aarhus Institute of Advanced Studies, Aarhus University, Aarhus, Denmark
- Department of Clinical Medicine, Center for Music in the Brain, Aarhus University and Royal Academy of Music Aarhus/Aalborg, Aarhus, Denmark
| | - John Melvin G. Treider
- Department of Musicology & Department of Psychology, RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Movement, University of Oslo, Oslo, Norway
| | - Dana Swarbrick
- Department of Musicology & Department of Psychology, RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Movement, University of Oslo, Oslo, Norway
| | - Joshua S. Bamford
- Social Body Lab, Institute of Cognitive and Evolutionary Anthropology, School of Anthropology & Museum Ethnography, University of Oxford, Oxford, United Kingdom
- School of Psychological Science, The University of Western Australia, Perth, WA, Australia
| | - Johanna Wilson
- Department of Music, Arts and Culture Studies, Centre for Interdisciplinary Music Research, University of Jyväskylä, Jyväskylä, Finland
| | - Jonna Katariina Vuoskoski
- Department of Musicology & Department of Psychology, RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Movement, University of Oslo, Oslo, Norway
| |
Collapse
|
25
|
Abstract
Based on the Competition Model, the current study investigated how cue availability and cue reliability as two important input factors influenced second language (L2) learners' cue learning of the English article construction. Written corpus data of university-level Chinese-L1 learners of English were sampled for a comparison of English majors and non-English majors who demonstrated two levels of L2 competence in English article usage. The path model analysis in structural equation modeling was utilized to investigate the relationship between the input factors and L2 usage (frequency and accuracy of article cue production). The findings contribute novel and scarce empirical evidence that confirms a central claim of the Competition Model, i.e., the changing importance of cue availability and cue reliability in the frequency and accuracy of production. Cue availability was found to determine L2 production frequency regardless of level of L2 competence. Cue reliability was the input factor that differentiated competence levels. When learners stayed at a relatively lower L2 proficiency, cue reliability played an important role in influencing L2 frequency of usage rather than accuracy of usage. When learners developed increased exposure to and stronger competence in the target language, cue reliability played a significant role in determining learners' success of cue learning. The study is methodologically innovative and expands the empirical applicability of the Competition Model to the domain of second language production and construction learning.
Collapse
Affiliation(s)
- Helen Zhao
- School of Languages and Linguistics, University of Melbourne, Melbourne, VIC, Australia
| | - Jason Fan
- Language Testing Research Centre, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
26
|
Ozturk T, Sengul D, Sengul I. Helicobacter pylori and association between its positivity and anatomotopographic settlement in the stomach with the host age range. Ann Afr Med 2021; 20:1-8. [PMID: 33727504 PMCID: PMC8102889 DOI: 10.4103/aam.aam_69_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Helicobacter pylori (H. pylori) is a Gram-negative, helically shaped flagellated bacterium. Major diseases associated with H. pylori infection include peptic ulcer, gastric adenocarcinoma, and mucosa-associated lymphoid tissue lymphoma. The incidence of H. pylori in the anatomotopographic regions of the stomach, such as antrum, corpus, fundus, and incisura angularis, has been investigated. Do the rates of H. pylori in the settlements change over time according to the age ranges of the hosts? Does this change affect the diseases caused by or related to H. pylori? It is estimated that the outcomes, which have been obtained, may provide a new perspective in terms of understanding the etiopathogenesis of H. pylori-induced diseases. A comprehensive literature search of PubMed/MEDLINE databases had been conducted using a combination of terms, “Helicobacter pylori,” “Sydney System,” “stomach,” “pyloric antrum,” “gastric corpus,” “stomach cancer,” and “Helicobacter pylori and age.” There are very few articles examining the relationship between the topographic locations of H. pylori and host age range in the English language literature. Therefore, it is also purposed to emphasize the outcomes of our current research about the mentioned topic. In our opinion, similar studies should reveal the settlement and age range in the different geographic locations and societies as in our study. We believe that these findings will contribute to the efforts for understanding overtly of H. pylori-induced disease of the stomach.
Collapse
Affiliation(s)
- Tuncer Ozturk
- Department of General Surgery, Giresun University Faculty of Medicine, TR28100 Giresun, Turkey
| | - Demet Sengul
- Department of Pathology, Giresun University Faculty of Medicine, TR28100 Giresun, Turkey
| | - Ilker Sengul
- Department of General Surgery, Giresun University Faculty of Medicine, TR28100 Giresun, Turkey
| |
Collapse
|
27
|
Abstract
We describe a corpus of speech taking place between 30 Korean mother–child pairs, divided in three groups of Prelexical (M = 0;08), Early-Lexical (M = 1;02), and Advanced-Lexical (M = 2;03). In addition to the child-directed speech (CDS), this corpus includes two different formalities of adult-directed speech (ADS), i.e., family-directed ADS (ADS_Fam) and experimenter-directed ADS (ADS_Exp). Our analysis of the MLU in CDS, family-, and experimenter-directed ADS found significant differences between CDS and ADS_Fam, and between ADS_Fam and ADS_Exp, but not between CDS and ADS_Exp. Our finding suggests that researchers should pay more attention to controlling the level of formality in CDS and ADS when comparing the two registers for their speech characteristics. The corpus was transcribed in the CHAT format of the CHILDES system, so users can easily extract data related to verbal behavior in the mother–child interaction using the CLAN program of CHILDES.
Collapse
Affiliation(s)
- Eon-Suk Ko
- Department of English Language and Literature, Chosun University, Gwangju, South Korea
| | - Jinyoung Jo
- Department of Linguistics, University of California, Los Angeles, Los Angeles, CA, United States
| | - Kyung-Woon On
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| | - Byoung-Tak Zhang
- Department of Computer Science and Engineering, Seoul National University, Seoul, South Korea
| |
Collapse
|
28
|
Abstract
That demonstratives often have endophoric functions marking referents outside the physical space of interaction but accessible through cognition, especially memory, is well-known. These functions are often classified as independent from exophoric ones and are typically seen as secondary with respect to spatial deixis. However, data from multiple languages show that cognitive access to referents functions alongside of perceptual access, including vision. Cognitive access is enabled by prior interactions and prior familiarity with the referents. As a result of such interactions, the interlocutors share a great deal of knowledge about the referents, which facilitates reference to objects in the interactive field. The centrality of common ground in reference to an object at the interactive scene challenges the often assumed classification of demonstrative reference into exophoric and endophoric. I illustrate this idea throughout the paper by using first-hand data from Mano, a Mande language of Guinea. Adding another argument in favor of viewing demonstrative reference as a social, interactive process, the Mano data push the idea of salience of non-spatial parameters further and emphasizes the importance of short and long-term interactional history and cultural knowledge both for the choice of demonstratives in exophoric reference and for the structuring of the demonstrative paradigm.
Collapse
Affiliation(s)
- Maria Khachaturyan
- Helsinki University Humanities Programme, University of Helsinki, Helsinki, Finland
| |
Collapse
|
29
|
Zhang X, Xue Y. A Novel GAN-Based Synthesis Method for In-Air Handwritten Words. Sensors (Basel) 2020; 20:E6548. [PMID: 33207826 DOI: 10.3390/s20226548] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Revised: 11/07/2020] [Accepted: 11/13/2020] [Indexed: 11/16/2022]
Abstract
In recent years, with the miniaturization and high energy efficiency of MEMS (micro-electro-mechanical systems), in-air handwriting technology based on inertial sensors has come to the fore. Most of the previous works have focused on character-level in-air handwriting recognition. In contrast, few works focus on word-level in-air handwriting tasks. In the field of word-level recognition, researchers have to face the problems of insufficient data and poor generalization performance of recognition methods. On one hand, the training of deep neural learning networks usually requires a particularly large dataset, but collecting data will take a lot of time and money. On the other hand, a deep recognition network trained on a small dataset can hardly recognize samples whose labels do not appear in the training set. To address these problems, we propose a two-stage synthesis method of in-air handwritten words. The proposed method includes a splicing module guided by an additional corpus and a generating module trained by adversarial learning. We carefully design the proposed network so that it can handle word sample inputs of arbitrary length and pay more attention to the details of the samples. We design multiple sets of experiments on a public dataset. The experimental results demonstrate the success of the proposed method. What is impressive is that with the help of the air-writing word synthesizer, the recognition model learns the context information (combination information of characters) of the word. In this way, it can recognize words that have never appeared in the training process. In this paper, the recognition model trained on synthetic data achieves a word-level recognition accuracy of 62.3% on the public dataset. Compared with the model trained using only the public dataset, the word-level accuracy is improved by 62%. Furthermore, the proposed method can synthesize realistic samples under the condition of limited of in-air handwritten character samples and word samples. It largely solves the problem of insufficient data. In the future, mathematically modeling the strokes between characters in words may help us find a better way to splice character samples. In addition, we will apply our method to various datasets and improve the splicing module and generating module for different tasks.
Collapse
|
30
|
Garcia B, Sallandre MA. Contribution of the Semiological Approach to Deixis-Anaphora in Sign Language: The Key Role of Eye-Gaze. Front Psychol 2020; 11:583763. [PMID: 33240174 PMCID: PMC7677344 DOI: 10.3389/fpsyg.2020.583763] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Accepted: 09/10/2020] [Indexed: 11/13/2022] Open
Abstract
We address the issue of deixis–anaphora in sign language (SL) discourse, focusing on the role of eye-gaze. According to the Semiological Approach, SL structuring stems from a maximum exploitation of the visuo-gestural modality, which results in two modes of meaning production, depending on the signer’s semiotic intent. Involving both non-manual and manual parameters, the first mode, expressing the intent to say while showing, uses constructions based on structures, the termed “transfer structures.” The second one, expressing the intent to say without showing, involves lexical, pointing and fingerspelling units. In order to situate our descriptive concepts with respect to those used by SL linguists who, like us, adopt a cognitive–functionalist perspective, we expose a specific theoretical foundation of our approach, the “enunciation theories.” The concept of “enunciation” is decisive for understanding the role of eye-gaze, as being at the foundation of deixis and the key vector of referential creation and tracking in SL discourse. “Enunciation” entails the opposition between “Enunciation” and “Utterance” Domains. The first links, as co-enunciators, the signer/speaker and his/her addressee, establishing them by the very “act of enunciation” as 1st and 2nd person. The second is internal to the discourse produced. Grounding on corpora of narratives in several SLs (some with no historical link), we illustrate this crucial role of eye-gaze and the diversity of functions it fulfills. Our analyses, carried out in this perspective, attest to the multiple structural similarities between SLs, particularly with regard to transfer structures. This result strongly supports the typological hypothesis underlying our approach, namely, that these structures are common to all SLs. We thus show that an enunciative analysis, based on the key role of eye-gaze in these visual languages that are SLs, is able to give the simplest account of their own linguistic economy and, in particular, of deixis–anaphora in these languages.
Collapse
Affiliation(s)
- Brigitte Garcia
- Structures Formelles du Langage Laboratory, UMR 7023, Centre National de la Recherche Scientifique, University of Paris 8 - University Paris Lumières, Paris, France
| | - Marie-Anne Sallandre
- Structures Formelles du Langage Laboratory, UMR 7023, Centre National de la Recherche Scientifique, University of Paris 8 - University Paris Lumières, Paris, France
| |
Collapse
|
31
|
Kim S, Kim R, Nam HJ, Kim RG, Ko E, Kim HS, Shin J, Cho D, Jin Y, Bae S, Jo YW, Jeong SA, Kim Y, Ahn S, Jang B, Seong J, Lee Y, Seo SE, Kim Y, Kim HJ, Kim H, Sung HL, Lho H, Koo J, Chu J, Lim J, Kim Y, Lee K, Lim Y, Kim M, Hwang S, Han S, Bae S, Kim S, Yoo S, Seo Y, Shin Y, Kim Y, Ko YJ, Baek J, Hyun H, Choi H, Oh JH, Kim DY, Park HS. Organizing an in-class hackathon to correct PDF-to-text conversion errors of Genomics & Informatics 1.0. Genomics Inform 2020; 18:e33. [PMID: 33017877 PMCID: PMC7560450 DOI: 10.5808/gi.2020.18.3.e33] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
This paper describes a community effort to improve earlier versions of the full-text corpus of Genomics & Informatics by semi-automatically detecting and correcting PDF-to-text conversion errors and optical character recognition errors during the first hackathon of Genomics & Informatics Annotation Hackathon (GIAH) event. Extracting text from multi-column biomedical documents such as Genomics & Informatics is known to be notoriously difficult. The hackathon was piloted as part of a coding competition of the ELTEC College of Engineering at Ewha Womans University in order to enable researchers and students to create or annotate their own versions of the Genomics & Informatics corpus, to gain and create knowledge about corpus linguistics, and simultaneously to acquire tangible and transferable skills. The proposed projects during the hackathon harness an internal database containing different versions of the corpus and annotations.
Collapse
Affiliation(s)
- Sunho Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Royoung Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hee-Jo Nam
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Ryeo-Gyeong Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Enjin Ko
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Han-Su Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Jihye Shin
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Daeun Cho
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yurhee Jin
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Soyeon Bae
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Ye Won Jo
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - San Ah Jeong
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yena Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Seoyeon Ahn
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Bomi Jang
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Jiheyon Seong
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yujin Lee
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Si Eun Seo
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yujin Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Ha-Jeong Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hyeji Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hye-Lynn Sung
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hyoyoung Lho
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Jaywon Koo
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Jion Chu
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Juwon Lim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Youngju Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Kyungyeon Lee
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yuri Lim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Meongeun Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Seonjeong Hwang
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Shinhye Han
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Sohyeun Bae
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Sua Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Suhyeon Yoo
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yeonjeong Seo
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yerim Shin
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Yonsoo Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - You-Jung Ko
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Jihee Baek
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hyejin Hyun
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hyemin Choi
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Ji-Hye Oh
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Da-Young Kim
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea
| | - Hyun-Seok Park
- Bioinformatics & Natural Language Processing Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea.,Center for Convergence Research of Advanced Technologies, Ewha Womans University, Seoul 03760, Korea
| |
Collapse
|
32
|
Uzundag BA, Küntay AC. The acquisition and use of relative clauses in Turkish-learning children's conversational interactions: a cross-linguistic approach. J Child Lang 2019; 46:1142-1168. [PMID: 31495350 DOI: 10.1017/s030500091900045x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Using a cross-linguistic approach, we investigated Turkish-speaking children's acquisition and use of relative clauses (RCs) by examining longitudinal child-caregiver interactions and cross-sectional peer conversations. Longitudinal data were collected from 8 children between the ages of 8 and 36 months. Peer conversational corpus came from 78 children aged between 43 and 64 months. Children produced RCs later than in English (Diessel, 2004) and Mandarin (Chen & Shirai, 2015), and demonstrated increasing semantic and structural complexity with age. Despite the morphosyntactic difficulty of object RCs, and prior experimental findings showing a subject RC advantage, preschool-aged children produced object RCs, which were highly frequent in child-directed speech, as frequently as subject RCs. Object RCs in spontaneous speech were semantically less demanding (with pronominal subjects and inanimate head nouns) than the stimuli used in prior experiments. Results suggest that multiple factors such as input frequency and morphosyntactic and semantic difficulty affect the acquisition patterns.
Collapse
|
33
|
Abstract
The epithelium of the human epididymis maintains an appropriate luminal environment for sperm maturation that is essential for male fertility. Regional expression of small noncoding RNAs such as microRNAs contributes to segment-specific gene expression and differentiated functions. MicroRNA profiles were reported in human epididymal tissues but not specifically in the epithelial cells derived from those regions. Here, we reveal miRNA signatures of primary cultures of caput, corpus, and cauda epididymis epithelial cells and of the tissues from which they were derived. We identify 324 epithelial cell-derived microRNAs and 259 tissue-derived microRNAs in the epididymis, some of which displayed regionalized expression patterns in cells and/or tissues. Caput cell-enriched miRNAs included miR-573 and miR-155. Cauda cell-enriched miRNAs included miR-1204 and miR-770. Next, we determined the gene ontology pathways associated with in silico predicted target genes of the differentially expressed miRNAs. The effect of androgen receptor stimulation on miRNA expression was also investigated. These data show novel epithelial cell-derived miRNAs that may regulate the expression of important gene networks that are responsible for the regionalized gene expression and function of the epididymis.
Collapse
Affiliation(s)
- James A Browne
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH 44106, USA.,Human Molecular Genetics Program, Lurie Children's Research Center, Chicago, IL 60614, USA.,Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Shih-Hsing Leir
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH 44106, USA.,Human Molecular Genetics Program, Lurie Children's Research Center, Chicago, IL 60614, USA.,Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Scott E Eggener
- Section of Urology, University of Chicago Medical Center, Chicago, IL 60611, USA
| | - Ann Harris
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH 44106, USA.,Human Molecular Genetics Program, Lurie Children's Research Center, Chicago, IL 60614, USA.,Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| |
Collapse
|
34
|
Daikoku T. Tonality Tunes the Statistical Characteristics in Music: Computational Approaches on Statistical Learning. Front Comput Neurosci 2019; 13:70. [PMID: 31632260 PMCID: PMC6783562 DOI: 10.3389/fncom.2019.00070] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2019] [Accepted: 09/19/2019] [Indexed: 12/28/2022] Open
Abstract
Statistical learning is a learning mechanism based on transition probability in sequences such as music and language. Recent computational and neurophysiological studies suggest that the statistical learning contributes to production, action, and musical creativity as well as prediction and perception. The present study investigated how statistical structure interacts with tonalities in music based on various-order statistical models. To verify this in all 24 major and minor keys, the transition probabilities of the sequences containing the highest pitches in Bach's Well-Tempered Clavier, which is a collection of two series (No. 1 and No. 2) of preludes and fugues in all of the 24 major and minor keys, were calculated based on nth-order Markov models. The transition probabilities of each sequence were compared among tonalities (major and minor), two series (No. 1 and No. 2), and music types (prelude and fugue). The differences in statistical characteristics between major and minor keys were detected in lower- but not higher-order models. The results also showed that statistical knowledge in music might be modulated by tonalities and composition periods. Furthermore, the principal component analysis detected the shared components of related keys, suggesting that the tonalities modulate statistical characteristics in music. The present study may suggest that there are at least two types of statistical knowledge in music that are interdependent on and independent of tonality, respectively.
Collapse
Affiliation(s)
- Tatsuya Daikoku
- Department of Neuropsychology, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany
| |
Collapse
|
35
|
Wang Y, Sun X, Ding Y, Fei Z, Jiao C, Fan M, Yao B, Xin P, Chu J, Wei Q. Cellular and molecular characterization of a thick-walled variant reveal a pivotal role of shoot apical meristem in transverse development of bamboo culm. J Exp Bot 2019; 70:3911-3926. [PMID: 31037305 DOI: 10.1093/jxb/erz201] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 04/17/2019] [Indexed: 05/27/2023]
Abstract
Little is known about the mechanisms underlying the development of bamboo culm. Using anatomical, mathematical modeling, and genomics methods, we investigated the role of shoot apical meristem (SAM) in the development of the transverse morphology of bamboo culm and explored the underlying cellular and molecular processes. We discovered that maintenance of SAM morphology that can produce circular culm and increase in SAM cell numbers, especially corpus cells, is the means by which bamboo makes a larger culm with a regular pith cavity and culm wall during development. A less cellular form of SAM with a lower proportion of corpus cells causes an abnormal higher ratio of wall component cells to pith cells, which breaks the balance of their interaction and triggers the random invasion of wall component cells into pith tissues during development, and finally results in the various thick culm walls of Phyllostachys nidularia f. farcta. The smaller SAM also results in a lower level of hormones such as cytokinin and auxin, and down-regulates hormone signaling and the downstream functional genes such as those related to metabolism, which finally results in a dwarf and smaller diameter culm with lower biomass. These results provide an important perspective on the culm development of bamboo, and support a plausible mechanism causing the size-reduced culm and various thick culm walls of P. nidularia f. farcta.
Collapse
Affiliation(s)
- Yujun Wang
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Xuepeng Sun
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA
| | - Yulong Ding
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, Jiangsu, China
| | - Zhangjun Fei
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA
| | - Chen Jiao
- Boyce Thompson Institute for Plant Research, Cornell University, Ithaca, NY, USA
| | - Mingyuan Fan
- International Education College, Nanjing Forestry University, Nanjing, Jiangsu, China
| | | | - Peiyong Xin
- National Centre for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Jinfang Chu
- National Centre for Plant Gene Research (Beijing), Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, China
| | - Qiang Wei
- Co-Innovation Center for Sustainable Forestry in Southern China, Nanjing Forestry University, Nanjing, Jiangsu, China
| |
Collapse
|
36
|
Abstract
Persistence is the tendency of speakers to repeat the choice of sociolinguistic variant they have recently made in conversational speech. A longstanding debate is whether this tendency toward repetitiveness reflects the direct influence of one outcome on the next instance of the variable, which I call sequential dependence, or the shared influence of shifting contextual factors on proximal instances of the variable, which I call baseline deflection. I propose that these distinct types of clustering make different predictions for sequences of variable observations that are longer than the typical prime-target pairs of typical corpus persistence studies. In corpus ING data from conversational speech, I show that there are two effects to be accounted for: an effect of how many times the /ing/ variant occurs in the 2, 3, or 4-token sequence prior to the target (regardless of order), and an effect of whether the immediately prior (1-back) token was /ing/. I then build a series of simulations involving Bernoulli trials at sequences of different probabilities that incorporate either a sequential dependence mechanism, a baseline deflection mechanism, or both. I argue that the model incorporating both baseline deflection and sequential dependence is best able to produce simulated data that shares the relevant properties of the corpus data, which is an encouraging outcome because we have independent reasons to expect both baseline deflection and sequential dependence to exist. I conclude that this exploratory analysis of longer sociolinguistic sequences reflects a promising direction for future research on the mechanisms involved in the production of sociolinguistic variation.
Collapse
Affiliation(s)
- Meredith Tamminga
- Department of Linguistics, University of Pennsylvania, Philadelphia, PA, United States
| |
Collapse
|
37
|
Browne JA, Leir SH, Yin S, Harris A. Transcriptional networks in the human epididymis. Andrology 2019; 7:741-747. [PMID: 31050198 DOI: 10.1111/andr.12629] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Revised: 12/26/2018] [Accepted: 03/29/2019] [Indexed: 12/31/2022]
Abstract
BACKGROUND The epithelial lining of the human epididymis is critical for sperm maturation. This process requires distinct specialized functions in the head, body, and tail of the duct. These region-specific properties are maintained by distinct gene expression profiles which are governed by transcription factor networks, non-coding RNAs, and other factors. MATERIALS AND METHODS We used genome-wide protocols including DNase-seq, RNA-seq and ChIP-seq to characterize open (active) chromatin, the transcriptome and occupancy of specific transcription factors (TFs) respectively, in caput, corpus, and cauda segments of adult human epididymis tissue and primary human epididymis epithelial (HEE) cell cultures derived from them. RNA-seq following TF depletion or activation, combined with gene ontology analysis also determined TF targets. RESULTS Among regional differentially expressed transcripts were epithelial-selective transcription factors (TFs), microRNAs, and antiviral response genes. Caput-enriched TFs included hepatocyte nuclear factor 1 (HNF1) and the androgen receptor (AR), both of which were also predicted to occupy cis-regulatory elements identified as open chromatin in HEE cells. HNF1 targets were identified genome-wide using ChIP-seq, in HEE cells. Next, siRNA-mediated depletion of HNF1 revealed a pivotal role for this TF in coordinating epithelial water and solute transport in caput epithelium. The importance of AR in HEE cells was shown by AR ChIP-seq, and by RNA-seq after synthetic androgen (R1881) treatment. AR has a distinct transcriptional program in the HEE cells and likely recruits different co-factors (RUNX1 and CEBPβ) in comparison to those used in prostate epithelium. DISCUSSION AND CONCLUSION Our data identify many transcription factors that regulate the development and differentiation of HEE cells. Moreover, a comparison between immature and adult HEE cells showed key TFs in the transition to fully differentiated function of this epithelium. These data may help identify new targets to treat male infertility and have the potential to open new avenues for male contraception.
Collapse
Affiliation(s)
- J A Browne
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - S-H Leir
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - S Yin
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| | - A Harris
- Department of Genetics and Genome Sciences, Case Western Reserve University, Cleveland, OH, 44106, USA
| |
Collapse
|
38
|
Abstract
We herein report two cases of miliary lung metastases from genital carcinoma in uterine cervix and endometrium. Notably, these patients were unable to receive any anti-tumor chemotherapy due to rapid progression causing respiratory failure, and they ultimately died of disease progression within only a month after the first visit to our hospitals. A postmortem examination confirmed the diagnosis of genital large-cell neuroendocrine carcinoma (LCNEC). Chest physicians should be aware of genital LCNEC with a dismal prognostic entity as an important differential diagnosis of miliary lung metastases.
Collapse
Affiliation(s)
- Akimasa Sekine
- Department of Respiratory Medicine, Kanagawa Cardiovascular and Respiratory Center, Japan
| | - Makiko Satoh
- Department of Obstetrics and Gynecology, Graduate School of Medicine, Yokohama City University, Japan
- Department of Obstetrics and Gynecology, Aomori City Hospital, Japan
| | - Koji Okudela
- Department of Pathology, Graduate School of Medicine, Yokohama City University, Japan
| | - Mai Matsumura
- Department of Pathology, Graduate School of Medicine, Yokohama City University, Japan
| | - Yukio Morishita
- Diagnostic Pathology Division, Tokyo Medical University Ibaraki Medical Center, Japan
| | - Yuko Minami
- Department of Respiratory Medicine, National Hospital Organization, Ibarakihigashi National Hospital, Japan
| | - Kenji Hayashihara
- Department of Respiratory Medicine, National Hospital Organization, Ibarakihigashi National Hospital, Japan
| | - Takefumi Saito
- Department of Respiratory Medicine, National Hospital Organization, Ibarakihigashi National Hospital, Japan
| | - Tae Iwasawa
- Department of Radiology, Kanagawa Cardiovascular and Respiratory Center, Japan
| | - Takashi Ogura
- Department of Respiratory Medicine, Kanagawa Cardiovascular and Respiratory Center, Japan
| |
Collapse
|
39
|
Abstract
We report the case of a 45-year-old male referred to our hospital with fever, asthenia, visual disturbances and increasing headaches. Diffusion-weighted imaging of the brain showed high signal intensity in the splenium of corpus callosum with low apparent diffusion coefficient values. Diagnosis of cytotoxic lesion of corpus callosum was made with Puumala Hantavirus infection serologically confirmed and should not be mistaken for ischemia. Patient was discharged 8 days after admission and imaging findings had resolved 3 weeks later.
Collapse
|
40
|
Manocha S, Farokhnia N, Khosropanah S, Bertol JW, Santiago J, Fakhouri WD. Systematic review of hormonal and genetic factors involved in the nonsyndromic disorders of the lower jaw. Dev Dyn 2019; 248:162-172. [PMID: 30576023 DOI: 10.1002/dvdy.8] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Revised: 11/30/2018] [Accepted: 12/14/2018] [Indexed: 12/14/2022] Open
Abstract
Mandibular disorders are among the most common birth defects in humans, yet the etiological factors are largely unknown. Most of the neonates affected by mandibular abnormalities have a sequence of secondary anomalies, including airway obstruction and feeding problems, that reduce the quality of life. In the event of lacking corrective surgeries, patients with mandibular congenital disorders suffer from additional lifelong problems such as sleep apnea and temporomandibular disorders, among others. The goal of this systematic review is to gather evidence on hormonal and genetic factors that are involved in signaling pathways and interactions that are potentially associated with the nonsyndromic mandibular disorders. We found that members of FGF and BMP pathways, including FGF8/10, FGFR2/3, BMP2/4/7, BMPR1A, ACVR1, and ACVR2A/B, have a prominent number of gene-gene interactions among all identified genes in this review. Gene ontology of the 154 genes showed that the functional gene sets are involved in all aspects of cellular processes and organogenesis. Some of the genes identified by the genome-wide association studies of common mandibular disorders are involved in skeletal formation and growth retardation based on animal models, suggesting a potential direct role as genetic risk factors in the common complex jaw disorders. Developmental Dynamics 248:162-172, 2019. © 2018 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Srishti Manocha
- Center for Craniofacial Research, Department of Diagnostic and Biomedical Sciences, School of Dentistry, University of Texas Health Science Center at Houston, Houston, Texas
| | - Nadia Farokhnia
- Center for Craniofacial Research, Department of Diagnostic and Biomedical Sciences, School of Dentistry, University of Texas Health Science Center at Houston, Houston, Texas
| | - Sepideh Khosropanah
- Ostrow School of Dentistry, University of Southern California, California, Los Angeles
| | - Jessica W Bertol
- Center for Craniofacial Research, Department of Diagnostic and Biomedical Sciences, School of Dentistry, University of Texas Health Science Center at Houston, Houston, Texas
| | - Joel Santiago
- Pró-Reitoria de Pesquisa e Pós-graduação (PRPPG), Universidade do Sagrado Coração, Jardim Brasil, Bauru, Sao Paulo, Brazil
| | - Walid D Fakhouri
- Center for Craniofacial Research, Department of Diagnostic and Biomedical Sciences, School of Dentistry, University of Texas Health Science Center at Houston, Houston, Texas.,Department of Pediatrics, McGovern Medical School, University of Texas Health Science Center, Houston, Texas
| |
Collapse
|
41
|
Park HS. Opinion: Strategy of Semi-Automatically Annotating Full Text Corpus of Genomics & Informatics. Genomics Inform 2019; 16:e40. [PMID: 30602101 PMCID: PMC6440653 DOI: 10.5808/gi.2018.16.4.e40] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 12/20/2018] [Indexed: 11/20/2022] Open
Abstract
There is a communal need for an annotated corpus consisting of the full texts of biomedical journal articles. In response to community needs, a prototype version of the full-text corpus of Genomics & Informatics, called GNI version 1.0, has recently been published, with 499 annotated full-text articles available as a corpus resource. However, GNI needs to be updated, as the texts were shallow-parsed and annotated with several existing parsers. I list issues associated with upgrading annotations and give an opinion on the methodology for developing the next version of the GNI corpus, based on a semi-automatic strategy for more linguistically rich corpus annotation.
Collapse
Affiliation(s)
- Hyun-Seok Park
- Bioinformatics Laboratory, ELTEC College of Engineering, Ewha Womans University, Seoul 03760, Korea.,Center for Convergence Research of Advanced Technologies, Ewha Womans University, Seoul 03760, Korea
| |
Collapse
|
42
|
Harris JA, Carlson K. Information Structure Preferences in Focus-Sensitive Ellipsis: How Defaults Persist. Lang Speech 2018; 61:480-512. [PMID: 29119869 PMCID: PMC5930160 DOI: 10.1177/0023830917737110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We compare the roles of overt accent and default focus marking in processing ellipsis structures headed by focus-sensitive coordinators (such as Danielle couldn't pass the quiz, let alone the final/Kayla). In a small auditory corpus study of radio transcripts, we establish that such structures overwhelmingly occur with contrastive pitch accents on the correlate and remnant ( the quiz and the final, or Danielle and Kayla), and that there is a strong bias to pair the remnant with the most local plausible correlate in production. In two auditory naturalness ratings experiments, we observe that marking a non-local correlate with contrastive pitch accent moderates, but does not fully overturn, the bias for local correlates in comprehension. We propose that the locality preference is due to a sentence-final default position for sentence accent, and that auditory processing is subject to "enduring focus," in which default positions for focus continue to influence the focus structure of the sentence even in the presence of overt accents. The importance of these results for models of auditory processing and of the processing of remnants in ellipsis structures is discussed.
Collapse
|
43
|
Couvee S, Pfau R. Structure and Grammaticalization of Serial Verb Constructions in Sign Language of the Netherlands-A Corpus-Based Study. Front Psychol 2018; 9:993. [PMID: 30065671 PMCID: PMC6056838 DOI: 10.3389/fpsyg.2018.00993] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Accepted: 05/28/2018] [Indexed: 11/17/2022] Open
Abstract
In serial verb constructions (SVCs), multiple independent lexical verbs are combined in a mono-clausal construction. SVCs express a range of grammatical meanings and are attested in numerous spoken languages all around the world. Yet, to date only few studies have investigated the existence and functions of SVCs in sign languages. For the most part, these studies—including a previous study on Sign Language of the Netherlands (NGT)—relied on elicited data. In this article, we offer a cross-modal typological contribution to the study of SVCs by investigating the phenomenon based on naturalistic corpus data from NGT. A search of the Corpus NGT yielded 41 mono-clausal utterances in which one of a closed set of verbs—namely go, give, take, and call—combines with another lexical verb. While the combinations we found are in important respects reminiscent of SVCs described for spoken languages, our data also confirm the previous finding that the fixed verb in the SVC serves to express agreement (by means of spatial modulation) when the other verb cannot do so. In addition, we identified some novel uses of the verbs go and give: (i) go functioning as a future tense marker and (ii) give functioning as a light verb. We will also discuss aspects of the grammaticalization of SVCs in NGT: from lexical verb to light verb to auxiliary, again offering some comparison to grammaticalization paths described for spoken languages.
Collapse
Affiliation(s)
- Sascha Couvee
- Department of Linguistics, University of Amsterdam, Amsterdam, Netherlands
| | - Roland Pfau
- Department of Linguistics, University of Amsterdam, Amsterdam, Netherlands
| |
Collapse
|
44
|
Alvaro N, Miyao Y, Collier N. TwiMed: Twitter and PubMed Comparable Corpus of Drugs, Diseases, Symptoms, and Their Relations. JMIR Public Health Surveill 2017; 3:e24. [PMID: 28468748 PMCID: PMC5438461 DOI: 10.2196/publichealth.6396] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2016] [Revised: 11/24/2016] [Accepted: 03/20/2017] [Indexed: 12/11/2022] Open
Abstract
BACKGROUND Work on pharmacovigilance systems using texts from PubMed and Twitter typically target at different elements and use different annotation guidelines resulting in a scenario where there is no comparable set of documents from both Twitter and PubMed annotated in the same manner. OBJECTIVE This study aimed to provide a comparable corpus of texts from PubMed and Twitter that can be used to study drug reports from these two sources of information, allowing researchers in the area of pharmacovigilance using natural language processing (NLP) to perform experiments to better understand the similarities and differences between drug reports in Twitter and PubMed. METHODS We produced a corpus comprising 1000 tweets and 1000 PubMed sentences selected using the same strategy and annotated at entity level by the same experts (pharmacists) using the same set of guidelines. RESULTS The resulting corpus, annotated by two pharmacists, comprises semantically correct annotations for a set of drugs, diseases, and symptoms. This corpus contains the annotations for 3144 entities, 2749 relations, and 5003 attributes. CONCLUSIONS We present a corpus that is unique in its characteristics as this is the first corpus for pharmacovigilance curated from Twitter messages and PubMed sentences using the same data selection and annotation strategies. We believe this corpus will be of particular interest for researchers willing to compare results from pharmacovigilance systems (eg, classifiers and named entity recognition systems) when using data from Twitter and from PubMed. We hope that given the comprehensive set of drug names and the annotated entities and relations, this corpus becomes a standard resource to compare results from different pharmacovigilance studies in the area of NLP.
Collapse
Affiliation(s)
- Nestor Alvaro
- National Institute of Informatics, Department of Informatics, Tokyo, Japan
- The Graduate University for Advanced Studies (SOKENDAI), Kanagawa, Japan
| | - Yusuke Miyao
- National Institute of Informatics, Department of Informatics, Tokyo, Japan
- The Graduate University for Advanced Studies (SOKENDAI), Kanagawa, Japan
| | - Nigel Collier
- Faculty of Modern & Medieval Languages, Department of Theoretical and Applied Linguistics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
45
|
Syrjänen K. A Panel of Serum Biomarkers (GastroPanel®) in Non-invasive Diagnosis of Atrophic Gastritis. Systematic Review and Meta-analysis. Anticancer Res 2017; 36:5133-5144. [PMID: 27798873 DOI: 10.21873/anticanres.11083] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Accepted: 08/29/2016] [Indexed: 12/13/2022]
Abstract
BACKGROUND/AIM To meet the increasing demand of non-invasive tests for screening of gastric cancer (GC) risk, biomarker panel (GastroPanel®) (GP) was designed by Biohit Oyj as the first serological test for stomach health. The aim of the present study was to perform a systematic review and meta-analysis of all studies on GP in diagnosis of atrophic gastritis (AG). MATERIALS AND METHODS Studies were eligible, if i) GP was used to diagnose biopsy-confirmed AG of the corpus (AGC) and/or antrum (AGA) and ii) exact numbers were available to enable calculating sensitivity (SE) and specificity (SP). Comprehensive Meta-Analysis software was used with maximum likelihood meta-regression (R2 analog). Effect size estimates (SE; SP, 95% confidence interval (CI)) were tested for homogeneity with Cochran's Q and I2 statistics. Potential publication bias was estimated by funnel plot statistics. RESULTS Altogether, 27 studies were eligible comprising of 8,654 patients from different geographic regions. Significant heterogeneity between studies reporting AGC (n=27) or AGA (n=13) warranted random effects (RE) model for summary statistics. GP performs better in diagnosing AGC than AGA with 70.2% vs. 51.6% pooled SE and 93.9% vs. 84.1% pooled SP, respectively. Limited number of studies erodes the Q test's power to detect true heterogeneity in meta-analysis stratified by geographic study origin. Few hypothetical missing studies had only marginal effect on pooled estimates of SE and SP. CONCLUSION This first meta-analysis of GP literature corroborates the statement of international experts, advocating GP in diagnosis and screening of AG. Due to its high specificity for both AGA and AGC, GastroPanel® is truly a test for stomach health.
Collapse
Affiliation(s)
- Kari Syrjänen
- Department of Clinical Research, Biohit Oyj, Helsinki, Finland .,Molecular Oncology Research Center, Barretos Cancer Hospital, Barretos, SP, Brazil
| |
Collapse
|
46
|
Guido M, Brooks M, Grover J, Katz E, Ondricek J, Rogers M, Sharpe L. Generating a Corpus of Mobile Forensic Images for Masquerading user Experimentation. J Forensic Sci 2016; 61:1467-1472. [PMID: 27545967 DOI: 10.1111/1556-4029.13178] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2015] [Revised: 11/16/2015] [Accepted: 01/17/2016] [Indexed: 11/30/2022]
Abstract
The Periodic Mobile Forensics (PMF) system investigates user behavior on mobile devices. It applies forensic techniques to an enterprise mobile infrastructure, utilizing an on-device agent named TractorBeam. The agent collects changed storage locations for later acquisition, reconstruction, and analysis. TractorBeam provides its data to an enterprise infrastructure that consists of a cloud-based queuing service, relational database, and analytical framework for running forensic processes. During a 3-month experiment with Purdue University, TractorBeam was utilized in a simulated operational setting across 34 users to evaluate techniques to identify masquerading users (i.e., users other than the intended device user). The research team surmises that all masqueraders are undesirable to an enterprise, even when a masquerader lacks malicious intent. The PMF system reconstructed 821 forensic images, extracted one million audit events, and accurately detected masqueraders. Evaluation revealed that developed methods reduced storage requirements 50-fold. This paper describes the PMF architecture, performance of TractorBeam throughout the protocol, and results of the masquerading user analysis.
Collapse
Affiliation(s)
- Mark Guido
- The MITRE Corporation, McLean, VA, 22102
| | | | | | - Eric Katz
- The MITRE Corporation, McLean, VA, 22102
| | | | | | | |
Collapse
|
47
|
Roman LD, Lukyanchuk R, Sablin OA, Araslanova EI, Eklund C, Hendolin P, Paloheimo L, Syrjänen K. Prevalence of H. pylori Infection and Atrophic Gastritis in a Population-based Screening with Serum Biomarker Panel (GastroPanel®) in St. Petersburg. Anticancer Res 2016; 36:4129-4138. [PMID: 27466521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2016] [Accepted: 06/28/2016] [Indexed: 06/06/2023]
Abstract
BACKGROUND/AIM Russian Federation is among the high-incidence countries for gastric cancer (GC), with the incidence being projected to continue increasing. Using a non-invasive blood test with four stomach-specific biomarkers (pepsinogen-I (PG-I) and -II (PG-II), amidated gastrin-17 (G-17) and Helicobacter pylori (HP) IgG antibodies) in a hospital-based screening setting, we aimed to determine the prevalence of GC risk conditions: HP-infection and atrophic gastritis (AG). PATIENTS AND METHODS A population-derived cohort of 918 asymptomatic subjects (646 women and 272 men) with a mean age of 51.8 years (range=26-83) was examined with the GastroPanel® (GP) test. GP results were verified by gastroscopy and biopsies (the Updated Sydney System (USS) classification for all test-positive AG cases and for random 5% test-negatives (n=263) to correct for the verification bias. RESULTS Of the 918 subjects, only 199 (21.7%) tested completely normal, while 76.7% (704/918) had HP-infection. Altogether, in 99 subjects (10.8%), GP suggested AG: atrophic gastritis in the antrum (AGA) (n=21), atrophic gastritis in the corpus (AGC) (n=69) or atrophic pangastritis (AGpan) (n=9). The overall concordance between GP and USS classification was 82.5% (217/263) with weighted kappa intra-class correlation coefficient (ICC)=0.875 (95% confidence interval (CI)=0.840-0.901). The sensitivity/specificity balance in receiver operating characteristic (ROC) analysis for PG-I as a marker of moderate/severe AGC (AGC2+) had area under the curve (AUC)=0.895 (95%CI=0.837-0.953). Using the AGC2+ end-point, verification bias-corrected specificity of PGI reached 96.4% (95%CI=94.7-97.9) and that of PGI/PGII ratio 94.6% (95%CI=92.6-96.3), with inevitable erosion in sensitivities. CONCLUSION While capable of detecting the subjects at risk for GC (HP and/or AG), GP should be the cost-effective means to break the current ominous trend in GC incidence in Russian Federation.
Collapse
Affiliation(s)
- Laslo D Roman
- Department of Endoscopy, the Leningrad Regional Oncological Dispensary, St. Petersburg, Russian Federation
| | - Ruslan Lukyanchuk
- Department of Endoscopy, the Leningrad Regional Oncological Dispensary, St. Petersburg, Russian Federation
| | - Oleg A Sablin
- The Federal State Institute of Public Health 'The Nikiforov Russian Center of Emergency and Radiation Medicine', St. Petersburg, Russian Federation
| | - Elena I Araslanova
- The Federal State Institute of Public Health 'The Nikiforov Russian Center of Emergency and Radiation Medicine', St. Petersburg, Russian Federation
| | - Carita Eklund
- Department of Research and Development, Biohit Oyj, Helsinki, Finland
| | - Panu Hendolin
- Department of Research and Development, Biohit Oyj, Helsinki, Finland
| | - Lea Paloheimo
- Department of Research and Development, Biohit Oyj, Helsinki, Finland
| | - Kari Syrjänen
- Department of Clinical Research, Biohit Oyj, Helsinki, Finland Molecular Oncology Research Center, Barretos Cancer Hospital, Barretos, SP, Brazil
| |
Collapse
|
48
|
Cohen KB, Baumgartner WA, Temnikova I. SuperCAT: The (New and Improved) Corpus Analysis Toolkit. LREC Int Conf Lang Resour Eval 2016; 2016:2784-2788. [PMID: 29568820 PMCID: PMC5860820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
This paper reports SuperCAT, a corpus analysis toolkit. It is a radical extension of SubCAT, the Sublanguage Corpus Analysis Toolkit, from sublanguage analysis to corpus analysis in general. The idea behind SuperCAT is that representative corpora have no tendency towards closure-that is, they tend towards infinity. In contrast, non-representative corpora have a tendency towards closure-roughly, finiteness. SuperCAT focuses on general techniques for the quantitative description of the characteristics of any corpus (or other language sample), particularly concerning the characteristics of lexical distributions. Additionally, SuperCAT features a complete re-engineering of the previous SubCAT architecture.
Collapse
|
49
|
Browne JA, Yang R, Leir SH, Eggener SE, Harris A. Expression profiles of human epididymis epithelial cells reveal the functional diversity of caput, corpus and cauda regions. Mol Hum Reprod 2015; 22:69-82. [PMID: 26612782 DOI: 10.1093/molehr/gav066] [Citation(s) in RCA: 55] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 11/20/2015] [Indexed: 01/01/2023] Open
Abstract
STUDY HYPOTHESIS Region-specific transcriptional profiling of tissues and cultured epithelial cells from the human epididymis will predict functional specialization along the duct. STUDY FINDING We identified the molecular signature driving functions of the caput, corpus and cauda epithelium, and determined how these differ to establish the regional differentiation of the organ. WHAT IS KNOWN ALREADY The epithelium lining the human male genital ducts has a critical role in fertility. In particular, it controls the luminal environment in the epididymis, which is required for normal sperm maturation and reproductive competence. Studies in many animal species have largely informed our understanding of the molecular basis of epididymis function. However, there are substantial differences between species. STUDY DESIGN, SAMPLES/MATERIALS, METHODS Using RNA sequencing on biological replicates, we described gene expression profiles for tissue from each region of the epididymis and cultured epithelial cells derived from these regions. Bioinformatic tools were then utilized to identify differentially expressed genes (DEGs) between tissues and cells from the caput, corpus and cauda. MAIN RESULTS AND THE ROLE OF CHANCE The data showed that the caput is functionally divergent from the corpus and cauda, which have very similar transcriptomes. Interrogation of DEGs using gene ontology process enrichment analyses showed that processes of ion transport, response to hormone stimulus and urogenital tract development are more evident in the caput, while defense response processes are more important in the corpus/cauda. Consistent with these regional differences in epididymis function, we observed differential expression of transcription factors in the caput and corpus/cauda. LIMITATIONS, REASONS FOR CAUTION Cultured caput, corpus and cauda cells may not faithfully represent the same cells in the intact organ, due to loss of hormonal signals from the testis and communication from other cell types. WIDER IMPLICATIONS OF THE FINDINGS Our data provide a molecular characterization that will facilitate advances in understanding human epididymis epithelium biology in health and disease. They may also reveal the mechanisms coordinating epididymis luminal environment and sperm maturation. LARGE SCALE DATA Data deposited at http://www.ncbi.nlm.nih.gov/geo/GSE72986. STUDY FUNDING AND COMPETING INTERESTS This work was supported by the National Institutes of Health: R01HD068901 (PI: A.H.). The authors declare no conflict of interest.
Collapse
Affiliation(s)
- James A Browne
- Human Molecular Genetics Program, Lurie Children's Research Center, 2430 North Halsted Street, Box 211, Chicago, IL 60614, USA Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Rui Yang
- Human Molecular Genetics Program, Lurie Children's Research Center, 2430 North Halsted Street, Box 211, Chicago, IL 60614, USA Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Shih-Hsing Leir
- Human Molecular Genetics Program, Lurie Children's Research Center, 2430 North Halsted Street, Box 211, Chicago, IL 60614, USA Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Scott E Eggener
- Section of Urology, University of Chicago Medical Center, Chicago, IL, USA
| | - Ann Harris
- Human Molecular Genetics Program, Lurie Children's Research Center, 2430 North Halsted Street, Box 211, Chicago, IL 60614, USA Department of Pediatrics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| |
Collapse
|
50
|
Powley TL, Hudson CN, McAdams JL, Baronowsky EA, Phillips RJ. Vagal Intramuscular Arrays: The Specialized Mechanoreceptor Arbors That Innervate the Smooth Muscle Layers of the Stomach Examined in the Rat. J Comp Neurol 2015; 524:713-37. [PMID: 26355387 DOI: 10.1002/cne.23892] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Revised: 08/18/2015] [Accepted: 08/19/2015] [Indexed: 01/14/2023]
Abstract
The fundamental roles that the stomach plays in ingestion and digestion notwithstanding, little morphological information is available on vagal intramuscular arrays (IMAs), the afferents that innervate gastric smooth muscle. To characterize IMAs better, rats were given injections of dextran biotin in the nodose ganglia, and, after tracer transport, stomach whole mounts were collected. Specimens were processed for avidin-biotin permanent labeling, and subsets of the whole mounts were immunohistochemically processed for c-Kit or stained with cuprolinic blue. IMAs (n = 184) were digitized for morphometry and mapping. Throughout the gastric muscle wall, IMAs possessed common phenotypic features. Each IMA was generated by a parent neurite arborizing extensively, forming an array of multiple (mean = 212) branches averaging 193 µm in length. These branches paralleled, and coursed in apposition with, bundles of muscle fibers and interstitial cells of Cajal. Individual arrays averaged 4.3 mm in length and innervated volumes of muscle sheet, presumptive receptive fields, averaging 0.1 mm(3) . Evaluated by region and by muscle sheet, IMAs displayed architectural adaptations to the different loci. A subset (32%) of circular muscle IMAs issued specialized polymorphic collaterals to myenteric ganglia, and a subset (41%) of antral longitudinal muscle IMAs formed specialized net endings associated with the serosal boundary. IMAs were concentrated in regional patterns that correlated with the unique biomechanical adaptations of the stomach, specifically proximal stomach reservoir functions and antral emptying operations. Overall, the structural adaptations and distributions of the IMAs were consonant with the hypothesized stretch receptor roles of the afferents.
Collapse
Affiliation(s)
- Terry L Powley
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, 47907-2081
| | - Cherie N Hudson
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, 47907-2081
| | - Jennifer L McAdams
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, 47907-2081
| | - Elizabeth A Baronowsky
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, 47907-2081
| | - Robert J Phillips
- Department of Psychological Sciences, Purdue University, West Lafayette, Indiana, 47907-2081
| |
Collapse
|