1
|
David I, Asiribo O, Dikko H. Nonlinear split-plot design modeling and analysis of rice varieties yield. SCIENTIFIC AFRICAN 2023. [DOI: 10.1016/j.sciaf.2022.e01444] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/10/2022] Open
|
2
|
Herrera JG, Ramos MP, de Lima Albuquerque BN, de Oliveira Farias de Aguiar JCR, Agra Neto AC, Guedes Paiva PM, do Amaral Ferraz Navarro DM, Pinto L. Multivariate evaluation of process parameters to obtain essential oil of Piper corcovadensis using supercritical fluid extraction. Microchem J 2022. [DOI: 10.1016/j.microc.2022.107747] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
|
3
|
Wong WK, Zhou J. Using CVX to construct optimal designs for biomedical studies with multiple objectives. J Comput Graph Stat 2022. [DOI: 10.1080/10618600.2022.2104858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- Weng Kee Wong
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772, U.S.A.
| | - Julie Zhou
- Department of Mathematics and Statistics, University of Victoria, Victoria, BC, Canada V8W 2Y2
| |
Collapse
|
4
|
Singh R, Stufken J. Selection of two-level supersaturated designs for main effects models. Technometrics 2022. [DOI: 10.1080/00401706.2022.2102080] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Affiliation(s)
- Rakhi Singh
- University of North Carolina at Greensboro, Greensboro, NC, USA
| | - John Stufken
- University of North Carolina at Greensboro, Greensboro, NC, USA
| |
Collapse
|
5
|
Haines LM. Approximate I-optimal designs for polynomial models over the unit ball. J Stat Plan Inference 2022. [DOI: 10.1016/j.jspi.2022.02.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
6
|
Lu L, Anderson‐Cook CM, Zhang M. Understanding the merits of winning data competition solutions for varied sets of objectives. Stat Anal Data Min 2021. [DOI: 10.1002/sam.11494] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Lu Lu
- University of South Florida Tampa Florida USA
| | | | | |
Collapse
|
7
|
Peris-Díaz MD, Krężel A. A guide to good practice in chemometric methods for vibrational spectroscopy, electrochemistry, and hyphenated mass spectrometry. Trends Analyt Chem 2021. [DOI: 10.1016/j.trac.2020.116157] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
8
|
Loftis C, Yuan K, Zhao Y, Hu M, Hu J. Lattice Thermal Conductivity Prediction Using Symbolic Regression and Machine Learning. J Phys Chem A 2020; 125:435-450. [PMID: 33355459 DOI: 10.1021/acs.jpca.0c08103] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
Prediction models of lattice thermal conductivity (κL) have wide applications in the discovery of thermoelectrics, thermal barrier coatings, and thermal management of semiconductors. However, κL is notoriously difficult to predict. Although classic models such as the Debye-Callaway model and the Slack model have been used to approximate the κL of inorganic compounds, their accuracy is far from being satisfactory. Herein we propose a genetic programming-based symbolic regression (SR) approach for finding analytical κL models and compare them with multilayer perceptron neural networks and random forest regression models using a hybrid cross-validation (CV) approach including both K-fold CV and holdout validation. Four formulae have been discovered by our SR approach that outperform the Slack formula as evaluated on our dataset. Through the analysis of our models' performance and the formulae generated, we found that the trained formulae successfully reproduce the correct physical law that governs the lattice thermal conductivity of materials. We also systematically show that currently extrapolative prediction over datasets with different distributions as the training set remains to be a big challenge for both SR and machine learning-based prediction models.
Collapse
Affiliation(s)
- Christian Loftis
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States
| | - Kunpeng Yuan
- Department of Mechanical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States.,Key Laboratory of Ocean Energy Utilization and Energy Conservation of Ministry of Education, School of Energy and Power Engineering, Dalian University of Technology, Dalian 116024, China
| | - Yong Zhao
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States
| | - Ming Hu
- Department of Mechanical Engineering, University of South Carolina, Columbia, South Carolina 29208, United States
| | - Jianjun Hu
- Department of Computer Science and Engineering, University of South Carolina, Columbia, South Carolina 29201, United States
| |
Collapse
|
9
|
Nitzsche R, Gröngröft A, Goj I, Kraume M. Ultrafiltration of Beechwood Hydrolysate for Concentrating Hemicellulose Sugars and Removal of Lignin—Parameter Estimation Using Statistical Methods and Multiobjective Optimization. Ind Eng Chem Res 2020. [DOI: 10.1021/acs.iecr.0c00487] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Roy Nitzsche
- DBFZ—Deutsches Biomasseforschungszentrum gemeinnützige GmbH, Torgauer Straße 116, 04347 Leipzig, Germany
| | - Arne Gröngröft
- DBFZ—Deutsches Biomasseforschungszentrum gemeinnützige GmbH, Torgauer Straße 116, 04347 Leipzig, Germany
| | - Ilona Goj
- Department SciTec, Ernst-Abbe-Hochschule Jena, Carl-Zeiss-Promenade 2, 07745 Jena, Germany
| | - Matthias Kraume
- Chair of Chemical and Process Engineering, Technische Universität Berlin, Fraunhoferstraße 33-36, 10587 Berlin, Germany
| |
Collapse
|
10
|
Anderson‐Cook CM, Myers KL, Lu L, Fugate ML, Quinlan KR, Pawley N. How to Host An Effective Data Competition: Statistical Advice for Competition Design and Analysis. Stat Anal Data Min 2019. [DOI: 10.1002/sam.11404] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
| | - Kary L. Myers
- Statistical SciencesLos Alamos National Laboratory Los Alamos New Mexico
| | - Lu Lu
- Department of Mathematics & StatisticsUniversity of South Florida Tampa Florida
| | - Michael L. Fugate
- Statistical SciencesLos Alamos National Laboratory Los Alamos New Mexico
| | - Kevin R. Quinlan
- Department of StatisticsPennsylvania State University State College Pennsylvania
| | - Norma Pawley
- Statistical SciencesLos Alamos National Laboratory Los Alamos New Mexico
| |
Collapse
|
11
|
Núñez Ares J, Goos P. Enumeration and Multicriteria Selection of Orthogonal Minimally Aliased Response Surface Designs. Technometrics 2019. [DOI: 10.1080/00401706.2018.1549103] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
| | - Peter Goos
- Department of Biosystems, KU Leuven, Leuven, Belgium
- Department of Engineering Management, University of Antwerp, Antwerp, Belgium
| |
Collapse
|
12
|
Peris-Díaz MD, Rodak O, Sweeney SR, Krężel A, Sentandreu E. Chemometrics-assisted optimization of liquid chromatography-quadrupole-time-of-flight mass spectrometry analysis for targeted metabolomics. Talanta 2019; 199:380-387. [PMID: 30952273 DOI: 10.1016/j.talanta.2019.02.075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2018] [Revised: 02/19/2019] [Accepted: 02/20/2019] [Indexed: 12/15/2022]
Abstract
Mass spectrometry-based metabolomics is characterized by a vast number of variables leading to a great degree of complexity. In this work, we aimed to simplify this process with a stepped chemometric optimization of the both funnel technology (funnel exit DC, FDC; funnel RF LP, FLC; funnel RF HP, FRP) and ion source parameters (Octopolo, Oct; and Fragmentor, Frag) of a quadrupole-time of flight (qTOF) for a human urinary metabolites. The workflow comprised a Box-Behnken experimental design with 47 experiments followed by the identification and quantification of a set of metabolites using high-resolution full-scan MS mode and feature extraction with an inclusion list. Metabolite peak areas were grouped according to abundance (high and low) and modeled by Random Forest regression (variance explained >85%). The full three-level factorial design consisting in 243 experiments was predicted and top 10 solutions for desirability function and those comprising the Pareto front were extracted and investigated. To guarantee the quality of results, we compared the Pareto front solutions with those achieved by standard instrumental parameters suggested by the manufacturer. A set of five solutions were identified that increased the mean peak area by 56-59% and 17%, for high- and low-abundance metabolites, respectively. The optimal parameters were determined to be: FLP, 100 V; FDC, 40 and 30 V; Frag, 275 and 400 V; and Oct, 600 and 800 V. The methodology applied throughout this work represents a flexible strategy to optimize instrumental parameters and exploit the performance of a qTOF MS detector.
Collapse
Affiliation(s)
- Manuel David Peris-Díaz
- Department of Chemical Biology, Faculty of Biotechnology, University of Wrocław, J.Curie 14a, 50-383 Wrocław, Poland.
| | - Olga Rodak
- Department of Reproduction and Clinic of Farm Animals, Faculty of Veterinary Medicine, Wrocław University of Environmental and Life Sciences, Wrocław, Poland
| | - Shannon R Sweeney
- Dell Pediatric Research Institute (DPRI), Austin, USA; Institute for Cellular and Molecular Biology, The University of Texas at Austin, Austin, USA
| | - Artur Krężel
- Department of Chemical Biology, Faculty of Biotechnology, University of Wrocław, J.Curie 14a, 50-383 Wrocław, Poland
| | - Enrique Sentandreu
- Institute of Agrochemistry and Food Technology (IATA-CSIC), Paterna, Valencia, Spain; Analytical Unit, Instituto de Investigación Sanitaria La Fe (IIS La Fe), Valencia, Spain
| |
Collapse
|
13
|
Wang Y, Biegler LT, Patel M, Wassick J. Parameters estimation and model discrimination for solid-liquid reactions in batch processes. Chem Eng Sci 2018. [DOI: 10.1016/j.ces.2018.05.040] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
14
|
Peris-Díaz MD, Sentandreu MA, Sentandreu E. Multiobjective optimization of liquid chromatography–triple-quadrupole mass spectrometry analysis of underivatized human urinary amino acids through chemometrics. Anal Bioanal Chem 2018; 410:4275-4284. [DOI: 10.1007/s00216-018-1083-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2017] [Revised: 03/27/2018] [Accepted: 04/11/2018] [Indexed: 01/04/2023]
|
15
|
|
16
|
|
17
|
Smucker BJ, Jensen W, Wu Z, Wang B. Robustness of classical and optimal designs to missing observations. Comput Stat Data Anal 2017. [DOI: 10.1016/j.csda.2016.12.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
18
|
Affiliation(s)
- Luzia A. Trinca
- Department of Biostatistics, São Paulo State University, Botucatu, Brazil
| | | |
Collapse
|
19
|
Volety KK, Huyberechts GPJ. Trade-Off Analysis in High-Throughput Materials Exploration. ACS COMBINATORIAL SCIENCE 2017; 19:145-152. [PMID: 28045488 DOI: 10.1021/acscombsci.6b00122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This Research Article presents a strategy to identify the optimum compositions in metal alloys with certain desired properties in a high-throughput screening environment, using a multiobjective optimization approach. In addition to the identification of the optimum compositions in a primary screening, the strategy also allows pointing to regions in the compositional space where further exploration in a secondary screening could be carried out. The strategy for the primary screening is a combination of two multiobjective optimization approaches namely Pareto optimality and desirability functions. The experimental data used in the present study have been collected from over 200 different compositions belonging to four different alloy systems. The metal alloys (comprising Fe, Ti, Al, Nb, Hf, Zr) are synthesized and screened using high-throughput technologies. The advantages of such a kind of approach compared to the limitations of the traditional and comparatively simpler approaches like ranking and calculating figures of merit are discussed.
Collapse
Affiliation(s)
- Kalpana K. Volety
- Flamac, a division of SIM vzw, Technologiepark 903 A, 9052 Zwijnaarde, Belgium
| | | |
Collapse
|
20
|
|
21
|
Chapman JL, Lu L, Anderson‐Cook CM. Impact of response variability on Pareto front optimization. Stat Anal Data Min 2015. [DOI: 10.1002/sam.11279] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Jessica L. Chapman
- Department of Mathematics, Computer Science, and Statistics St. Lawrence University Canton, NY 13617 USA
| | - Lu Lu
- Department of Mathematics and Statistics University of South Florida Tampa, FL 33620 USA
| | | |
Collapse
|
22
|
Statistical Model Selection for Better Prediction and Discovering Science Mechanisms That Affect Reliability. SYSTEMS 2015. [DOI: 10.3390/systems3030109] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
23
|
Cao Y, Smucker BJ, Robinson TJ. On using the hypervolume indicator to compare Pareto fronts: Applications to multi-criteria optimal experimental design. J Stat Plan Inference 2015. [DOI: 10.1016/j.jspi.2014.12.004] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
24
|
Affiliation(s)
- Jie Li
- Department of Statistics, Virginia Polytechnic Institute and State University, Blacksburg, VA 24601
| | - Dale L. Zimmerman
- Department of Statistics and Actuarial Science, 241 Schaeffer Hall, University of Iowa, Iowa City, IA 52242
| |
Collapse
|
25
|
Affiliation(s)
| | - Nathan M. Drew
- Nanotechnology Research Center, National Institute for Occupational Safety and Health, Cincinnati OH 45226
| |
Collapse
|
26
|
Müller WG, Pronzato L, Rendas J, Waldl H. Efficient prediction designs for random fields. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY 2015; 31:178-194. [PMID: 26300698 PMCID: PMC4540167 DOI: 10.1002/asmb.2084] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2013] [Revised: 10/20/2014] [Accepted: 10/20/2014] [Indexed: 06/04/2023]
Abstract
For estimation and predictions of random fields, it is increasingly acknowledged that the kriging variance may be a poor representative of true uncertainty. Experimental designs based on more elaborate criteria that are appropriate for empirical kriging (EK) are then often non-space-filling and very costly to determine. In this paper, we investigate the possibility of using a compound criterion inspired by an equivalence theorem type relation to build designs quasi-optimal for the EK variance when space-filling designs become unsuitable. Two algorithms are proposed, one relying on stochastic optimization to explicitly identify the Pareto front, whereas the second uses the surrogate criteria as local heuristic to choose the points at which the (costly) true EK variance is effectively computed. We illustrate the performance of the algorithms presented on both a simple simulated example and a real oceanographic dataset. © 2014 The Authors. Applied Stochastic Models in Business and Industry published by John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Werner G Müller
- Department of Applied Statistics, Johannes-Kepler-University of Linz Linz, Austria
| | - Luc Pronzato
- Laboratoire I3S, CNRS/Université de Nice-Sophia Antipolis Nice, France
| | - Joao Rendas
- Laboratoire I3S, CNRS/Université de Nice-Sophia Antipolis Nice, France
| | - Helmut Waldl
- Department of Applied Statistics, Johannes-Kepler-University of Linz Linz, Austria
| |
Collapse
|
27
|
Lu L, Anderson-Cook CM, Lin DK. Optimal designed experiments using a Pareto front search for focused preference of multiple objectives. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2013.04.008] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
28
|
A coordinate-exchange two-phase local search algorithm for the D- and I-optimal designs of split-plot experiments. Comput Stat Data Anal 2014. [DOI: 10.1016/j.csda.2013.03.015] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
29
|
Anderson-Cook CM, Hamada MS. Comment: Toward Guidelines for Practitioners on Screening Designs and Analysis. Technometrics 2014. [DOI: 10.1080/00401706.2013.822831] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
30
|
Lu L, Chapman JL, Anderson-cook CM. A Case Study on Selecting a Best Allocation of New Data for Improving the Estimation Precision of System and Subsystem Reliability Using Pareto Fronts. Technometrics 2013. [DOI: 10.1080/00401706.2013.831776] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
31
|
|
32
|
Sánchez M, Sarabia L, Ortiz M. On the construction of experimental designs for a given task by jointly optimizing several quality criteria: Pareto-optimal experimental designs. Anal Chim Acta 2012; 754:39-46. [DOI: 10.1016/j.aca.2012.10.014] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2012] [Revised: 09/23/2012] [Accepted: 10/08/2012] [Indexed: 11/28/2022]
|
33
|
|