Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For:	Bock HH. Clustering Methods: A History of k-Means Algorithms. Selected Contributions in Data Analysis and Classification 2007. [DOI: 10.1007/978-3-540-73560-1_15] [Citation(s) in RCA: 68] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Number

Cited by Other Article(s)

Zhao Z, Guo Y, Chowdhury T, Anjum S, Li J, Huang L, Cupp-Sutton KA, Burgett A, Shi D, Wu S. Top-Down Proteomics Analysis of Picogram-Level Complex Samples Using Spray-Capillary-Based Capillary Electrophoresis-Mass Spectrometry. Anal Chem 2024;96:8763-8771. [PMID: 38722793 DOI: 10.1021/acs.analchem.4c01119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]

Abstract

Proteomics analysis of mass-limited samples has become increasingly important for understanding biological systems in physiologically relevant contexts such as patient samples, multicellular organoids, spheroids, and single cells. However, relatively low sensitivity in top-down proteomics methods makes their application to mass-limited samples challenging. Capillary electrophoresis (CE) has emerged as an ideal separation method for mass-limited samples due to its high separation resolution, ultralow detection limit, and minimal sample volume requirements. Recently, we developed "spray-capillary", an electrospray ionization (ESI)-assisted device, that is capable of quantitative ultralow-volume sampling (e.g., pL-nL level). Here, we developed a spray-capillary-CE-MS platform for ultrasensitive top-down proteomics analysis of intact proteins in mass-limited complex biological samples. Specifically, to improve the sensitivity of the spray-capillary platform, we incorporated a polyethylenimine (PEI)-coated capillary and optimized the spray-capillary inner diameter. Under optimized conditions, we successfully detected over 200 proteoforms from 50 pg of E. coli lysate. To our knowledge, the spray-capillary CE-MS platform developed here represents one of the most sensitive detection methods for top-down proteomics. Furthermore, in a proof-of-principle experiment, we detected 261 ± 65 and 174 ± 45 intact proteoforms from fewer than 50 HeLa and OVCAR-8 cells, respectively, by coupling nanodroplet-based sample preparation with our optimized CE-MS platform. Overall, our results demonstrate the capability of the modified spray-capillary CE-MS platform to perform top-down proteomics analysis on picogram amounts of samples. This advancement presents the possibility of meaningful top-down proteomics analysis of mass-limited samples down to the level of single mammalian cells.

Collapse

Zhao Y, Gong P. Optimal site selection strategies for urban parks green spaces under the joint perspective of spatial equity and social equity. Front Public Health 2024;12:1310340. [PMID: 38638465 PMCID: PMC11024374 DOI: 10.3389/fpubh.2024.1310340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Accepted: 03/22/2024] [Indexed: 04/20/2024] Open

Abstract

Urban park green spaces (UPGS) are a crucial element of social public resources closely related to the health and well-being of urban residents, and issues of equity have always been a focal point of concern. This study takes the downtown area of Nanchang as an example and uses more accurate point of interest (POI) and area of interest (AOI) data as analysis sources. The improved Gaussian two-step floating catchment area (G2SFCA) and spatial autocorrelation models are then used to assess the spatial and social equity in the study area, and the results of the two assessments were coupled to determine the optimization objective using the community as the smallest unit. Finally, the assessment results are combined with the k-means algorithm and particle swarm algorithm (PSO) to propose practical optimization strategies with the objectives of minimum walking distance and maximum fairness. The results indicate (1) There are significant differences in UPGS accessibility among residents with different walking distances, with the more densely populated Old Town and Honggu Tan areas having lower average accessibility and being the main areas of hidden blindness, while the fringe areas in the northern and south-western parts of the city are the main areas of visible blindness. (2) Overall, the UPGS accessibility in Nanchang City exhibits a spatial pattern of decreasing from the east, south, and west to the center. Nanchang City is in transition towards improving spatial and social equity while achieving basic regional equity. (3) There is a spatial positive correlation between socioeconomic level and UPGS accessibility, reflecting certain social inequity. (4) Based on the above research results, the UPGS layout optimization scheme was proposed, 29 new UPGS locations and regions were identified, and the overall accessibility was improved by 2.76. The research methodology and framework can be used as a tool to identify the underserved areas of UPGS and optimize the spatial and social equity of UPGS, which is in line with the current trend of urban development in the world and provides a scientific basis for urban infrastructure planning and spatial resource allocation.

Collapse

Claréus B, Daukantaité D. Off track or on? Associations of positive and negative life events with the continuation versus cessation of repetitive adolescent nonsuicidal self-injury. J Clin Psychol 2023;79:2459-2477. [PMID: 37178314 DOI: 10.1002/jclp.23533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 02/13/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023]

Yiakoumetti A, Hanko EKR, Zou Y, Chua J, Chromy J, Stoney RA, Valdehuesa KNG, Connolly JA, Yan C, Hollywood KA, Takano E, Breitling R. Expanding flavone and flavonol production capabilities in Escherichia coli. Front Bioeng Biotechnol 2023;11:1275651. [PMID: 37920246 PMCID: PMC10619664 DOI: 10.3389/fbioe.2023.1275651] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 10/04/2023] [Indexed: 11/04/2023] Open

Abstract

Flavones and flavonols are important classes of flavonoids with nutraceutical and pharmacological value, and their production by fermentation with recombinant microorganisms promises to be a scalable and economically favorable alternative to extraction from plant sources. Flavones and flavonols have been produced recombinantly in a number of microorganisms, with Saccharomyces cerevisiae typically being a preferred production host for these compounds due to higher yields and titers of precursor compounds, as well as generally improved ability to functionally express cytochrome P450 enzymes without requiring modification to improve their solubility. Recently, a rapid prototyping platform has been developed for high-value compounds in E. coli, and a number of gatekeeper (2S)-flavanones, from which flavones and flavonols can be derived, have been produced to high titers in E. coli using this platform. In this study, we extended these metabolic pathways using the previously reported platform to produce apigenin, chrysin, luteolin and kaempferol from the gatekeeper flavonoids naringenin, pinocembrin and eriodictyol by the expression of either type-I flavone synthases (FNS-I) or type-II flavone synthases (FNS-II) for flavone biosynthesis, and by the expression of flavanone 3-dioxygenases (F3H) and flavonol synthases (FLS) for the production of the flavonol kaempferol. In our best-performing strains, titers of apigenin and kaempferol reached 128 mg L-1 and 151 mg L-1 in 96-DeepWell plates in cultures supplemented with an additional 3 mM tyrosine, though titers for chrysin (6.8 mg L-1) from phenylalanine, and luteolin (5.0 mg L-1) from caffeic acid were considerably lower. In strains with upregulated tyrosine production, apigenin and kaempferol titers reached 80.2 mg L-1 and 42.4 mg L-1 respectively, without the further supplementation of tyrosine beyond the amount present in the rich medium. Notably, the highest apigenin, chrysin and luteolin titers were achieved with FNS-II enzymes, suggesting that cytochrome P450s can show competitive performance compared with non-cytochrome P450 enzymes in prokaryotes for the production of flavones.

Collapse

Casalino L, Seitz C, Lederhofer J, Tsybovsky Y, Wilson IA, Kanekiyo M, Amaro RE. Breathing and Tilting: Mesoscale Simulations Illuminate Influenza Glycoprotein Vulnerabilities. ACS CENTRAL SCIENCE 2022;8:1646-1663. [PMID: 36589893 PMCID: PMC9801513 DOI: 10.1021/acscentsci.2c00981] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Indexed: 05/28/2023]

Hacking C, Verbeek H, Hamers JPH, Sion K, Aarts S. Text mining in long-term care: Exploring the usefulness of artificial intelligence in a nursing home setting. PLoS One 2022;17:e0268281. [PMID: 36006921 PMCID: PMC9409502 DOI: 10.1371/journal.pone.0268281] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 04/27/2022] [Indexed: 11/19/2022] Open

Abstract

Objectives

In nursing homes, narrative data are collected to evaluate quality of care as perceived by residents or their family members. This results in a large amount of textual data. However, as the volume of data increases, it becomes beyond the capability of humans to analyze it. This study aims to explore the usefulness of text mining approaches regarding narrative data gathered in a nursing home setting.

Design

Exploratory study showing a variety of text mining approaches.

Setting and participants

Data has been collected as part of the project ‘Connecting Conversations’: assessing experienced quality of care by conducting individual interviews with residents of nursing homes (n = 39), family members (n = 37) and care professionals (n = 49).

Methods

Several pre-processing steps were applied. A variety of text mining analyses were conducted: individual word frequencies, bigram frequencies, a correlation analysis and a sentiment analysis. A survey was conducted to establish a sentiment analysis model tailored to text collected in long-term care for older adults.

Results

Residents, family members and care professionals uttered respectively 285, 362 and 549 words per interview. Word frequency analysis showed that words that occurred most frequently in the interviews are often positive. Despite some differences in word usage, correlation analysis displayed that similar words are used by all three groups to describe quality of care. Most interviews displayed a neutral sentiment. Care professionals expressed a more diverse sentiment compared to residents and family members. A topic clustering analysis showed a total of 12 topics including ‘relations’ and ‘care environment’.

Conclusions and implications

This study demonstrates the usefulness of text mining to extend our knowledge regarding quality of care in a nursing home setting. With the rise of textual (narrative) data, text mining can lead to valuable new insights for long-term care for older adults.

Collapse

Casalino L, Seitz C, Lederhofer J, Tsybovsky Y, Wilson IA, Kanekiyo M, Amaro RE. Breathing and tilting: mesoscale simulations illuminate influenza glycoprotein vulnerabilities. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.08.02.502576. [PMID: 35982676 PMCID: PMC9387122 DOI: 10.1101/2022.08.02.502576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]

Xu Z, York LM, Seethepalli A, Bucciarelli B, Cheng H, Samac DA. Objective Phenotyping of Root System Architecture Using Image Augmentation and Machine Learning in Alfalfa (Medicago sativa L.). PLANT PHENOMICS (WASHINGTON, D.C.) 2022;2022:9879610. [PMID: 35479182 PMCID: PMC9012978 DOI: 10.34133/2022/9879610] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 03/03/2022] [Indexed: 12/28/2022]

Yuan M, Zobel J, Lin P. Measurement of clustering effectiveness for document collections. INFORM RETRIEVAL J 2022. [DOI: 10.1007/s10791-021-09401-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]

Schmidt MN, Seddig D, Davidov E, Mørup M, Albers KJ, Bauer JM, Glückstad FK. Latent profile analysis of human values: What is the optimal number of clusters? METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES 2021. [DOI: 10.5964/meth.5479] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Phan HP, Ngu BH. Introducing the Concept of Consonance-Disconsonance of Best Practice: A Focus on the Development of 'Student Profiling'. Front Psychol 2021;12:557968. [PMID: 33995160 PMCID: PMC8121024 DOI: 10.3389/fpsyg.2021.557968] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 04/07/2021] [Indexed: 11/17/2022] Open

Sessa M, Khan AR, Liang D, Andersen M, Kulahci M. Artificial Intelligence in Pharmacoepidemiology: A Systematic Review. Part 1-Overview of Knowledge Discovery Techniques in Artificial Intelligence. Front Pharmacol 2020;11:1028. [PMID: 32765261 PMCID: PMC7378532 DOI: 10.3389/fphar.2020.01028] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2019] [Accepted: 06/24/2020] [Indexed: 12/14/2022] Open

Spurek P, Byrski K, Tabor J. Online updating of active function cross-entropy clustering. Pattern Anal Appl 2019. [DOI: 10.1007/s10044-018-0701-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Unsupervised classification of children’s bodies using currents. ADV DATA ANAL CLASSI 2017. [DOI: 10.1007/s11634-017-0283-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]

Kriegel HP, Schubert E, Zimek A. The (black) art of runtime evaluation: Are we comparing algorithms or implementations? Knowl Inf Syst 2016. [DOI: 10.1007/s10115-016-1004-2] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]

Köhn HF, Chiu CY, Brusco MJ. Heuristic cognitive diagnosis when the Q-matrix is unknown. THE BRITISH JOURNAL OF MATHEMATICAL AND STATISTICAL PSYCHOLOGY 2015;68:268-291. [PMID: 25496248 DOI: 10.1111/bmsp.12044] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2012] [Revised: 07/12/2014] [Indexed: 06/04/2023]

Lord E, Diallo AB, Makarenkov V. Classification of bioinformatics workflows using weighted versions of partitioning and hierarchical clustering algorithms. BMC Bioinformatics 2015;16:68. [PMID: 25887434 PMCID: PMC4354763 DOI: 10.1186/s12859-015-0508-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2014] [Accepted: 02/20/2015] [Indexed: 11/10/2022] Open

Abstract

Background

Workflows, or computational pipelines, consisting of collections of multiple linked tasks are becoming more and more popular in many scientific fields, including computational biology. For example, simulation studies, which are now a must for statistical validation of new bioinformatics methods and software, are frequently carried out using the available workflow platforms. Workflows are typically organized to minimize the total execution time and to maximize the efficiency of the included operations. Clustering algorithms can be applied either for regrouping similar workflows for their simultaneous execution on a server, or for dispatching some lengthy workflows to different servers, or for classifying the available workflows with a view to performing a specific keyword search.

Results

In this study, we consider four different workflow encoding and clustering schemes which are representative for bioinformatics projects. Some of them allow for clustering workflows with similar topological features, while the others regroup workflows according to their specific attributes (e.g. associated keywords) or execution time. The four types of workflow encoding examined in this study were compared using the weighted versions of k-means and k-medoids partitioning algorithms. The Calinski-Harabasz, Silhouette and logSS clustering indices were considered. Hierarchical classification methods, including the UPGMA, Neighbor Joining, Fitch and Kitsch algorithms, were also applied to classify bioinformatics workflows. Moreover, a novel pairwise measure of clustering solution stability, which can be computed in situations when a series of independent program runs is carried out, was introduced.

Conclusions

Our findings based on the analysis of 220 real-life bioinformatics workflows suggest that the weighted clustering models based on keywords information or tasks execution times provide the most appropriate clustering solutions. Using datasets generated by the Armadillo and Taverna scientific workflow management system, we found that the weighted cosine distance in association with the k-medoids partitioning algorithm and the presence-absence workflow encoding provided the highest values of the Rand index among all compared clustering strategies. The introduced clustering stability indices, PS and PSG, can be effectively used to identify elements with a low clustering support.

Electronic supplementary material

The online version of this article (doi:10.1186/s12859-015-0508-1) contains supplementary material, which is available to authorized users.

Collapse

The $$k$$ k -means algorithm for 3D shapes with an application to apparel design. ADV DATA ANAL CLASSI 2014. [DOI: 10.1007/s11634-014-0187-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]

Fritz H, García-Escudero LA, Mayo-Iscar A. A fast algorithm for robust constrained clustering. Comput Stat Data Anal 2013. [DOI: 10.1016/j.csda.2012.11.018] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Ruwet C, Haesbroeck G. Classification performance resulting from a 2-means. J Stat Plan Inference 2013. [DOI: 10.1016/j.jspi.2012.08.004] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]