1
|
Chen H, Lu D, Xiao Z, Li S, Zhang W, Luan X, Zhang W, Zheng G. Comprehensive applications of the artificial intelligence technology in new drug research and development. Health Inf Sci Syst 2024; 12:41. [PMID: 39130617 PMCID: PMC11310389 DOI: 10.1007/s13755-024-00300-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 07/27/2024] [Indexed: 08/13/2024] Open
Abstract
Purpose Target-based strategy is a prevalent means of drug research and development (R&D), since targets provide effector molecules of drug action and offer the foundation of pharmacological investigation. Recently, the artificial intelligence (AI) technology has been utilized in various stages of drug R&D, where AI-assisted experimental methods show higher efficiency than sole experimental ones. It is a critical need to give a comprehensive review of AI applications in drug R &D for biopharmaceutical field. Methods Relevant literatures about AI-assisted drug R&D were collected from the public databases (Including Google Scholar, Web of Science, PubMed, IEEE Xplore Digital Library, Springer, and ScienceDirect) through a keyword searching strategy with the following terms [("Artificial Intelligence" OR "Knowledge Graph" OR "Machine Learning") AND ("Drug Target Identification" OR "New Drug Development")]. Results In this review, we first introduced common strategies and novel trends of drug R&D, followed by characteristic description of AI algorithms widely used in drug R&D. Subsequently, we depicted detailed applications of AI algorithms in target identification, lead compound identification and optimization, drug repurposing, and drug analytical platform construction. Finally, we discussed the challenges and prospects of AI-assisted methods for drug discovery. Conclusion Collectively, this review provides comprehensive overview of AI applications in drug R&D and presents future perspectives for biopharmaceutical field, which may promote the development of drug industry.
Collapse
Affiliation(s)
- Hongyu Chen
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Dong Lu
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Ziyi Xiao
- Johns Hopkins Bloomberg School of Public Health, Baltimore, MD USA
| | - Shensuo Li
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Wen Zhang
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Xin Luan
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Weidong Zhang
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| | - Guangyong Zheng
- Shanghai Frontiers Science Center for Chinese Medicine Chemical Biology, Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China
| |
Collapse
|
2
|
Farisco M, Evers K, Changeux JP. Is artificial consciousness achievable? Lessons from the human brain. Neural Netw 2024; 180:106714. [PMID: 39270349 DOI: 10.1016/j.neunet.2024.106714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 07/29/2024] [Accepted: 09/06/2024] [Indexed: 09/15/2024]
Abstract
We here analyse the question of developing artificial consciousness from an evolutionary perspective, taking the evolution of the human brain and its relation with consciousness as a reference model or as a benchmark. This kind of analysis reveals several structural and functional features of the human brain that appear to be key for reaching human-like complex conscious experience and that current research on Artificial Intelligence (AI) should take into account in its attempt to develop systems capable of human-like conscious processing. We argue that, even if AI is limited in its ability to emulate human consciousness for both intrinsic (i.e., structural and architectural) and extrinsic (i.e., related to the current stage of scientific and technological knowledge) reasons, taking inspiration from those characteristics of the brain that make human-like conscious processing possible and/or modulate it, is a potentially promising strategy towards developing conscious AI. Also, it cannot be theoretically excluded that AI research can develop partial or potentially alternative forms of consciousness that are qualitatively different from the human form, and that may be either more or less sophisticated depending on the perspectives. Therefore, we recommend neuroscience-inspired caution in talking about artificial consciousness: since the use of the same word "consciousness" for humans and AI becomes ambiguous and potentially misleading, we propose to clearly specify which level and/or type of consciousness AI research aims to develop, as well as what would be common versus differ in AI conscious processing compared to human conscious experience.
Collapse
Affiliation(s)
- Michele Farisco
- Centre for Research Ethics and Bioethics, Department of Public Health and Caring Sciences, Uppsala University, Uppsala, Sweden; Biogem, Biology and Molecular Genetics Institute, Ariano Irpino (AV), Italy.
| | - Kathinka Evers
- Centre for Research Ethics and Bioethics, Department of Public Health and Caring Sciences, Uppsala University, Uppsala, Sweden
| | | |
Collapse
|
3
|
Huang P, Liu X, Xin Y, Gu Y, Lee A, Zhang Y, Xu Z, Chen P, Zhang Y, Deng W, Yu G, Wu D, Liu Z, Yao Q, Yang Y, Zhu Z, Kou X. Integrated Artificial Neural Network with Trainable Activation Function Enabled by Topological Insulator-Based Spin-Orbit Torque Devices. ACS NANO 2024; 18:29469-29478. [PMID: 39405579 DOI: 10.1021/acsnano.4c03278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/30/2024]
Abstract
Nonvolatile memristors offer a salient platform for artificial neural network (ANN), yet the integration of different function and algorithm blocks into one hardware system remains challenging. Here we demonstrate the brain-like synaptic (SOT-S) and neuronal (SOT-N) functions in the Bi2Te3/CrTe2 heterostructure-based spin-orbit torque (SOT) device. The SOT-S unit exhibits highly linear and symmetrical long-term potentiation/depression process, resulting in a fast-training of the MNIST data set with the classification accuracy above 90%. Meanwhile, the Sigmoid-shape transition curve inherited in the SOT-N cell replaces the software-based activation function block, hence reducing the system complexity. On this basis, we employ a serial-connected, voltage-mode sensing ANN architecture to enhance the vector-matrix multiplication signal strength with low reading error of 0.61% while simplifying the peripheral circuitry. Furthermore, the trainable activation function of SOT-N enables the implementation of the Batch Normalization algorithm and activation operation within one clock cycle, which bring about improved on/off-chip training performance close to the ideal baseline.
Collapse
Affiliation(s)
- Puyang Huang
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Xinqi Liu
- School of Physical Science and Technology, ShanghaiTech University, Shanghai 201210, China
- ShanghaiTech Laboratory for Topological Physics, ShanghaiTech University, Shanghai 201210, China
| | - Yue Xin
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Yu Gu
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Albert Lee
- Suzhou Inston Technology Co., Ltd., Suzhou, Jiangsu 215121, China
| | - Yifan Zhang
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Zhuo Xu
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Peng Chen
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Yu Zhang
- Beijing National Laboratory for Condensed Matter, Physics Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China
| | - Weijie Deng
- School of Physical Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Guoqiang Yu
- Beijing National Laboratory for Condensed Matter, Physics Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China
| | - Di Wu
- Suzhou Inston Technology Co., Ltd., Suzhou, Jiangsu 215121, China
| | - Zhongkai Liu
- School of Physical Science and Technology, ShanghaiTech University, Shanghai 201210, China
- ShanghaiTech Laboratory for Topological Physics, ShanghaiTech University, Shanghai 201210, China
| | - Qi Yao
- School of Physical Science and Technology, ShanghaiTech University, Shanghai 201210, China
- ShanghaiTech Laboratory for Topological Physics, ShanghaiTech University, Shanghai 201210, China
| | - Yumeng Yang
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Zhifeng Zhu
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
| | - Xufeng Kou
- School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
- ShanghaiTech Laboratory for Topological Physics, ShanghaiTech University, Shanghai 201210, China
| |
Collapse
|
4
|
Han T, Nebelung S, Khader F, Wang T, Müller-Franzes G, Kuhl C, Försch S, Kleesiek J, Haarburger C, Bressem KK, Kather JN, Truhn D. Medical large language models are susceptible to targeted misinformation attacks. NPJ Digit Med 2024; 7:288. [PMID: 39443664 PMCID: PMC11499642 DOI: 10.1038/s41746-024-01282-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2024] [Accepted: 10/02/2024] [Indexed: 10/25/2024] Open
Abstract
Large language models (LLMs) have broad medical knowledge and can reason about medical information across many domains, holding promising potential for diverse medical applications in the near future. In this study, we demonstrate a concerning vulnerability of LLMs in medicine. Through targeted manipulation of just 1.1% of the weights of the LLM, we can deliberately inject incorrect biomedical facts. The erroneous information is then propagated in the model's output while maintaining performance on other biomedical tasks. We validate our findings in a set of 1025 incorrect biomedical facts. This peculiar susceptibility raises serious security and trustworthiness concerns for the application of LLMs in healthcare settings. It accentuates the need for robust protective measures, thorough verification mechanisms, and stringent management of access to these models, ensuring their reliable and safe use in medical practice.
Collapse
Affiliation(s)
- Tianyu Han
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany.
| | - Sven Nebelung
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Firas Khader
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Tianci Wang
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Gustav Müller-Franzes
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Christiane Kuhl
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany
| | - Sebastian Försch
- Institute of Pathology, University Medical Center of the Johannes Gutenberg-University, Mainz, Germany
| | - Jens Kleesiek
- Institute for AI in Medicine, University Medicine Essen, Essen, Germany
| | | | - Keno K Bressem
- Department of Radiology, Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt Universität zu Berlin, Berlin, Germany
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany
| | - Jakob Nikolas Kather
- Else Kroener Fresenius Center for Digital Health (EKFZ), Technical University Dresden, Dresden, Germany
- Department of Medicine I, University Hospital Dresden, Dresden, Germany
- Medical Oncology, National Center for Tumor Diseases (NCT), University Hospital Heidelberg, Heidelberg, Germany
| | - Daniel Truhn
- Department of Diagnostic and Interventional Radiology, University Hospital Aachen, Aachen, Germany.
| |
Collapse
|
5
|
Luppi AI, Achterberg J, Schmidgall S, Bilgin IP, Herholz P, Sprang M, Fockter B, Ham AS, Thorat S, Ziaei R, Milisav F, Proca AM, Tolle HM, Suárez LE, Scotti P, Gellersen HM. Trainees' perspectives and recommendations for catalyzing the next generation of NeuroAI researchers. Nat Commun 2024; 15:9152. [PMID: 39443525 PMCID: PMC11499826 DOI: 10.1038/s41467-024-53375-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2023] [Accepted: 10/10/2024] [Indexed: 10/25/2024] Open
Abstract
New developing area of NeuroAI at the intersection of neuroscience and artificial intelligence has many open challenges, one of which is training the new generation of experts. In this Comment, the authors provide resources and outline training needs and recommendations for junior researchers working across artificial intelligence and neuroscience.
Collapse
Affiliation(s)
- Andrea I Luppi
- Centre for Eudaimonia and Human Flourishing, Department of Psychiatry, University of Oxford, Oxford, UK.
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada.
- UNIQUE Neuro-AI Center, Montreal, QC, Canada.
- St John's College, University of Cambridge, Cambridge, UK.
| | - Jascha Achterberg
- MRC Cognition and Brain Sciences Unit, University of Cambridge, Cambridge, UK.
- Intel Labs, Santa Clara, USA.
- Centre for Neural Circuits and Behaviou, University of Oxford, Oxford, UK.
| | | | - Isil Poyraz Bilgin
- Centre de recherche de l'Institut universitaire de gériatrie de Montréal, University of Montreal, Montreal, QC, Canada
| | - Peer Herholz
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
| | - Maximilian Sprang
- Faculty of Biology and Institute of Quantitative & Computational Biosciences, Johannes Gutenberg University Mainz, Mainz, Germany
- German Research Center for Artificial Intelligence (DFKI), Kaiserslautern, Germany
| | | | | | | | - Rojin Ziaei
- University of Maryland College Park, College Park, MD, USA
| | - Filip Milisav
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
- UNIQUE Neuro-AI Center, Montreal, QC, Canada
| | | | - Hanna M Tolle
- Department of Computing, Imperial College London, London, UK
| | - Laura E Suárez
- Montreal Neurological Institute, McGill University, Montreal, QC, Canada
- UNIQUE Neuro-AI Center, Montreal, QC, Canada
- Innodem Neurosciences, Montreal, QC, Canada
| | - Paul Scotti
- Princeton Neuroscience Institute, Princeton University, Princeton, NJ, USA
| | - Helena M Gellersen
- St John's College, University of Cambridge, Cambridge, UK
- German Center for Neurodegenerative Diseases, Magdeburg, Germany
- Department of Psychology, University of Cambridge, Cambridge, UK
| |
Collapse
|
6
|
Nau M, Schmid AC, Kaplan SM, Baker CI, Kravitz DJ. Centering cognitive neuroscience on task demands and generalization. Nat Neurosci 2024; 27:1656-1667. [PMID: 39075326 DOI: 10.1038/s41593-024-01711-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 06/17/2024] [Indexed: 07/31/2024]
Abstract
Cognitive neuroscience seeks generalizable theories explaining the relationship between behavioral, physiological and mental states. In pursuit of such theories, we propose a theoretical and empirical framework that centers on understanding task demands and the mutual constraints they impose on behavior and neural activity. Task demands emerge from the interaction between an agent's sensory impressions, goals and behavior, which jointly shape the activity and structure of the nervous system on multiple spatiotemporal scales. Understanding this interaction requires multitask studies that vary more than one experimental component (for example, stimuli and instructions) combined with dense behavioral and neural sampling and explicit testing for generalization across tasks and data modalities. By centering task demands rather than mental processes that tasks are assumed to engage, this framework paves the way for the discovery of new generalizable concepts unconstrained by existing taxonomies, and moves cognitive neuroscience toward an action-oriented, dynamic and integrated view of the brain.
Collapse
Affiliation(s)
- Matthias Nau
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA.
| | - Alexandra C Schmid
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA
| | - Simon M Kaplan
- Department of Psychological & Brain Sciences, The George Washington University, Washington, DC, USA
| | - Chris I Baker
- Laboratory of Brain and Cognition, National Institutes of Health, Bethesda, MD, USA.
| | - Dwight J Kravitz
- Department of Psychological & Brain Sciences, The George Washington University, Washington, DC, USA.
- Division of Behavioral and Cognitive Sciences, Directorate for Social, Behavioral, and Economic Sciences, US National Science Foundation, Arlington, VA, USA.
| |
Collapse
|
7
|
Weilenmann C, Ziogas AN, Zellweger T, Portner K, Mladenović M, Kaniselvan M, Moraitis T, Luisier M, Emboras A. Single neuromorphic memristor closely emulates multiple synaptic mechanisms for energy efficient neural networks. Nat Commun 2024; 15:6898. [PMID: 39138160 PMCID: PMC11322324 DOI: 10.1038/s41467-024-51093-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 07/27/2024] [Indexed: 08/15/2024] Open
Abstract
Biological neural networks do not only include long-term memory and weight multiplication capabilities, as commonly assumed in artificial neural networks, but also more complex functions such as short-term memory, short-term plasticity, and meta-plasticity - all collocated within each synapse. Here, we demonstrate memristive nano-devices based on SrTiO3 that inherently emulate all these synaptic functions. These memristors operate in a non-filamentary, low conductance regime, which enables stable and energy efficient operation. They can act as multi-functional hardware synapses in a class of bio-inspired deep neural networks (DNN) that make use of both long- and short-term synaptic dynamics and are capable of meta-learning or learning-to-learn. The resulting bio-inspired DNN is then trained to play the video game Atari Pong, a complex reinforcement learning task in a dynamic environment. Our analysis shows that the energy consumption of the DNN with multi-functional memristive synapses decreases by about two orders of magnitude as compared to a pure GPU implementation. Based on this finding, we infer that memristive devices with a better emulation of the synaptic functionalities do not only broaden the applicability of neuromorphic computing, but could also improve the performance and energy costs of certain artificial intelligence applications.
Collapse
Affiliation(s)
| | | | - Till Zellweger
- Integrated Systems Laboratory, ETH Zurich, Zurich, Switzerland
| | - Kevin Portner
- Integrated Systems Laboratory, ETH Zurich, Zurich, Switzerland
| | | | | | | | - Mathieu Luisier
- Integrated Systems Laboratory, ETH Zurich, Zurich, Switzerland
| | | |
Collapse
|
8
|
Tuckute G, Kanwisher N, Fedorenko E. Language in Brains, Minds, and Machines. Annu Rev Neurosci 2024; 47:277-301. [PMID: 38669478 DOI: 10.1146/annurev-neuro-120623-101142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/28/2024]
Abstract
It has long been argued that only humans could produce and understand language. But now, for the first time, artificial language models (LMs) achieve this feat. Here we survey the new purchase LMs are providing on the question of how language is implemented in the brain. We discuss why, a priori, LMs might be expected to share similarities with the human language system. We then summarize evidence that LMs represent linguistic information similarly enough to humans to enable relatively accurate brain encoding and decoding during language processing. Finally, we examine which LM properties-their architecture, task performance, or training-are critical for capturing human neural responses to language and review studies using LMs as in silico model organisms for testing hypotheses about language. These ongoing investigations bring us closer to understanding the representations and processes that underlie our ability to comprehend sentences and express thoughts in language.
Collapse
Affiliation(s)
- Greta Tuckute
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Nancy Kanwisher
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Evelina Fedorenko
- Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| |
Collapse
|
9
|
Zhu X, Zhao L, Zhu W. Salience Interest Option: Temporal abstraction with salience interest functions. Neural Netw 2024; 176:106342. [PMID: 38692188 DOI: 10.1016/j.neunet.2024.106342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Revised: 04/02/2024] [Accepted: 04/23/2024] [Indexed: 05/03/2024]
Abstract
Reinforcement Learning (RL) is a significant machine learning subfield that emphasizes learning actions based on environment to obtain optimal behavior policy. RL agents can make decisions at variable time scales in the form of temporal abstractions, also known as options. The issue of discovering options has seen a considerable research effort. Most notably, the Interest Option Critic (IOC) algorithm first extends the initial set to the interest function, providing a method for learning options specialized to certain state space regions. This approach offers a specific attention mechanism for action selection. Unfortunately, this method still suffers from the classic issues of poor data efficiency and lack of flexibility in RL when learning options end-to-end through backpropagation. This paper proposes a new approach called Salience Interest Option Critic (SIOC), which chooses subsets of existing initiation sets for RL. Specifically, these subsets are not learned by backpropagation, which is slow and tends to overfit, but through particle filters. This approach enables the rapid and flexible identification of critical subsets using only reward feedback. We conducted experiments in discrete and continuous domains, and our proposed method demonstrate higher efficiency and flexibility than other methods. The generated options are more valuable within a single task and exhibited greater interpretability and reusability in multi-task learning scenarios.
Collapse
Affiliation(s)
- Xianchao Zhu
- Key Laboratory of Grain Information Processing and Control (Henan University of Technology), Ministry of Education, 450001, Zhengzhou, China; Henan Key Laboratory of Grain Photoelectric Detection and Control, Henan University of Technology, 450001, Zhengzhou, China; School of Artificial Intelligence and Big Data, Henan University of Technology, 450001, Zhengzhou, China.
| | - Liang Zhao
- College of Electrical Engineering, Henan University of Technology, 450001, Zhengzhou, China
| | - William Zhu
- Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, 610054, Chengdu, China
| |
Collapse
|
10
|
Xu Y, Ma S, Cui H, Chen J, Xu S, Gong F, Golubovic A, Zhou M, Wang KC, Varley A, Lu RXZ, Wang B, Li B. AGILE platform: a deep learning powered approach to accelerate LNP development for mRNA delivery. Nat Commun 2024; 15:6305. [PMID: 39060305 PMCID: PMC11282250 DOI: 10.1038/s41467-024-50619-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Accepted: 07/09/2024] [Indexed: 07/28/2024] Open
Abstract
Ionizable lipid nanoparticles (LNPs) are seeing widespread use in mRNA delivery, notably in SARS-CoV-2 mRNA vaccines. However, the expansion of mRNA therapies beyond COVID-19 is impeded by the absence of LNPs tailored for diverse cell types. In this study, we present the AI-Guided Ionizable Lipid Engineering (AGILE) platform, a synergistic combination of deep learning and combinatorial chemistry. AGILE streamlines ionizable lipid development with efficient library design, in silico lipid screening via deep neural networks, and adaptability to diverse cell lines. Using AGILE, we rapidly design, synthesize, and evaluate ionizable lipids for mRNA delivery, selecting from a vast library. Intriguingly, AGILE reveals cell-specific preferences for ionizable lipids, indicating tailoring for optimal delivery to varying cell types. These highlight AGILE's potential in expediting the development of customized LNPs, addressing the complex needs of mRNA delivery in clinical practice, thereby broadening the scope and efficacy of mRNA therapies.
Collapse
Affiliation(s)
- Yue Xu
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Shihao Ma
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
| | - Haotian Cui
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada
| | - Jingan Chen
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, ON, Canada
| | - Shufen Xu
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Fanglin Gong
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
- Institute of Biomedical Engineering, University of Toronto, Toronto, ON, Canada
| | - Alex Golubovic
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Muye Zhou
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Kevin Chang Wang
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Andrew Varley
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Rick Xing Ze Lu
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada
| | - Bo Wang
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada.
- Peter Munk Cardiac Centre, University Health Network, Toronto, ON, Canada.
- Princess Margaret Cancer Center, University Health Network, Toronto, ON, Canada.
| | - Bowen Li
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON, Canada.
- Institute of Biomedical Engineering, University of Toronto, Toronto, ON, Canada.
- Department of Chemistry, University of Toronto, Toronto, ON, Canada.
- Department of Laboratory Medicine and Pathobiology, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
11
|
Zhang R, Pitkow X, Angelaki DE. Inductive biases of neural network modularity in spatial navigation. SCIENCE ADVANCES 2024; 10:eadk1256. [PMID: 39028809 PMCID: PMC11259174 DOI: 10.1126/sciadv.adk1256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 06/14/2024] [Indexed: 07/21/2024]
Abstract
The brain may have evolved a modular architecture for daily tasks, with circuits featuring functionally specialized modules that match the task structure. We hypothesize that this architecture enables better learning and generalization than architectures with less specialized modules. To test this, we trained reinforcement learning agents with various neural architectures on a naturalistic navigation task. We found that the modular agent, with an architecture that segregates computations of state representation, value, and action into specialized modules, achieved better learning and generalization. Its learned state representation combines prediction and observation, weighted by their relative uncertainty, akin to recursive Bayesian estimation. This agent's behavior also resembles macaques' behavior more closely. Our results shed light on the possible rationale for the brain's modularity and suggest that artificial systems can use this insight from neuroscience to improve learning and generalization in natural tasks.
Collapse
Affiliation(s)
- Ruiyi Zhang
- Tandon School of Engineering, New York University, New York, NY, USA
| | - Xaq Pitkow
- Neuroscience Institute, Carnegie Mellon University, Pittsburgh, PA, USA
- Department of Machine Learning, Carnegie Mellon University, Pittsburgh, PA, USA
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
- Department of Electrical and Computer Engineering, Rice University, Houston, TX, USA
- Center for Neuroscience and Artificial Intelligence, Baylor College of Medicine, Houston, TX, USA
| | - Dora E. Angelaki
- Tandon School of Engineering, New York University, New York, NY, USA
- Center for Neural Science, New York University, New York, NY, USA
| |
Collapse
|
12
|
Nakashima T, Otake S, Taniguchi A, Maeyama K, El Hafi L, Taniguchi T, Yamakawa H. Hippocampal formation-inspired global self-localization: quick recovery from the kidnapped robot problem from an egocentric perspective. Front Comput Neurosci 2024; 18:1398851. [PMID: 39092317 PMCID: PMC11291353 DOI: 10.3389/fncom.2024.1398851] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Accepted: 05/29/2024] [Indexed: 08/04/2024] Open
Abstract
It remains difficult for mobile robots to continue accurate self-localization when they are suddenly teleported to a location that is different from their beliefs during navigation. Incorporating insights from neuroscience into developing a spatial cognition model for mobile robots may make it possible to acquire the ability to respond appropriately to changing situations, similar to living organisms. Recent neuroscience research has shown that during teleportation in rat navigation, neural populations of place cells in the cornu ammonis-3 region of the hippocampus, which are sparse representations of each other, switch discretely. In this study, we construct a spatial cognition model using brain reference architecture-driven development, a method for developing brain-inspired software that is functionally and structurally consistent with the brain. The spatial cognition model was realized by integrating the recurrent state-space model, a world model, with Monte Carlo localization to infer allocentric self-positions within the framework of neuro-symbol emergence in the robotics toolkit. The spatial cognition model, which models the cornu ammonis-1 and -3 regions with each latent variable, demonstrated improved self-localization performance of mobile robots during teleportation in a simulation environment. Moreover, it was confirmed that sparse neural activity could be obtained for the latent variables corresponding to cornu ammonis-3. These results suggest that spatial cognition models incorporating neuroscience insights can contribute to improving the self-localization technology for mobile robots. The project website is https://nakashimatakeshi.github.io/HF-IGL/.
Collapse
Affiliation(s)
- Takeshi Nakashima
- Graduate School of Information Science and Engineering, Ritsumeikan University, Osaka, Japan
| | - Shunsuke Otake
- Graduate School of Information Science and Technology, Osaka University, Osaka, Japan
| | - Akira Taniguchi
- College of Information Science and Engineering, Ritsumeikan University, Osaka, Japan
| | - Katsuyoshi Maeyama
- Graduate School of Information Science and Engineering, Ritsumeikan University, Osaka, Japan
| | - Lotfi El Hafi
- Research Organization of Science and Technology, Ritsumeikan University, Shiga, Japan
| | - Tadahiro Taniguchi
- College of Information Science and Engineering, Ritsumeikan University, Osaka, Japan
| | - Hiroshi Yamakawa
- The Whole Brain Architecture Initiative, Tokyo, Japan
- School of Engineering, The University of Tokyo, Tokyo, Japan
- RIKEN Center for Advanced Intelligence Project, Tokyo, Japan
| |
Collapse
|
13
|
Zhu X. Temporally extended successor feature neural episodic control. Sci Rep 2024; 14:15103. [PMID: 38956201 PMCID: PMC11219751 DOI: 10.1038/s41598-024-65687-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 06/24/2024] [Indexed: 07/04/2024] Open
Abstract
One of the long-term goals of reinforcement learning is to build intelligent agents capable of rapidly learning and flexibly transferring skills, similar to humans and animals. In this paper, we introduce an episodic control framework based on the temporal expansion of subsequent features to achieve these goals, which we refer to as Temporally Extended Successor Feature Neural Episodic Control (TESFNEC). This method has shown impressive results in significantly improving sample efficiency and elegantly reusing previously learned strategies. Crucially, this model enhances agent training by incorporating episodic memory, significantly reducing the number of iterations required to learn the optimal policy. Furthermore, we adopt the temporal expansion of successor features a technique to capture the expected state transition dynamics of actions. This form of temporal abstraction does not entail learning a top-down hierarchy of task structures but focuses on the bottom-up combination of actions and action repetitions. Thus, our approach directly considers the temporal scope of sequences of temporally extended actions without requiring predefined or domain-specific options. Experimental results in the two-dimensional object collection environment demonstrate that the method proposed in this paper optimizes learning policies faster than baseline reinforcement learning approaches, leading to higher average returns.
Collapse
Affiliation(s)
- Xianchao Zhu
- Key Laboratory of Grain Information Processing and Control (Henan University of Technology), Ministry of Education, Zhengzhou, 450001, China.
- Henan Key Laboratory of Grain Photoelectric Detection and Control, Henan University of Technology, Zhengzhou, 450001, China.
- School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, 450001, China.
| |
Collapse
|
14
|
Li Q, Sorscher B, Sompolinsky H. Representations and generalization in artificial and brain neural networks. Proc Natl Acad Sci U S A 2024; 121:e2311805121. [PMID: 38913896 PMCID: PMC11228472 DOI: 10.1073/pnas.2311805121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/26/2024] Open
Abstract
Humans and animals excel at generalizing from limited data, a capability yet to be fully replicated in artificial intelligence. This perspective investigates generalization in biological and artificial deep neural networks (DNNs), in both in-distribution and out-of-distribution contexts. We introduce two hypotheses: First, the geometric properties of the neural manifolds associated with discrete cognitive entities, such as objects, words, and concepts, are powerful order parameters. They link the neural substrate to the generalization capabilities and provide a unified methodology bridging gaps between neuroscience, machine learning, and cognitive science. We overview recent progress in studying the geometry of neural manifolds, particularly in visual object recognition, and discuss theories connecting manifold dimension and radius to generalization capacity. Second, we suggest that the theory of learning in wide DNNs, especially in the thermodynamic limit, provides mechanistic insights into the learning processes generating desired neural representational geometries and generalization. This includes the role of weight norm regularization, network architecture, and hyper-parameters. We will explore recent advances in this theory and ongoing challenges. We also discuss the dynamics of learning and its relevance to the issue of representational drift in the brain.
Collapse
Affiliation(s)
- Qianyi Li
- The Harvard Biophysics Graduate Program, Harvard University, Cambridge, MA02138
- Center for Brain Science, Harvard University, Cambridge, MA02138
| | - Ben Sorscher
- The Applied Physics Department, Stanford University, Stanford, CA94305
| | - Haim Sompolinsky
- Center for Brain Science, Harvard University, Cambridge, MA02138
- Edmond and Lily Safra Center for Brain Sciences, Hebrew University, Jerusalem9190401, Israel
| |
Collapse
|
15
|
Sharafeldin A, Imam N, Choi H. Active sensing with predictive coding and uncertainty minimization. PATTERNS (NEW YORK, N.Y.) 2024; 5:100983. [PMID: 39005491 PMCID: PMC11240181 DOI: 10.1016/j.patter.2024.100983] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Revised: 03/11/2024] [Accepted: 04/08/2024] [Indexed: 07/16/2024]
Abstract
We present an end-to-end architecture for embodied exploration inspired by two biological computations: predictive coding and uncertainty minimization. The architecture can be applied to any exploration setting in a task-independent and intrinsically driven manner. We first demonstrate our approach in a maze navigation task and show that it can discover the underlying transition distributions and spatial features of the environment. Second, we apply our model to a more complex active vision task, whereby an agent actively samples its visual environment to gather information. We show that our model builds unsupervised representations through exploration that allow it to efficiently categorize visual scenes. We further show that using these representations for downstream classification leads to superior data efficiency and learning speed compared to other baselines while maintaining lower parameter complexity. Finally, the modular structure of our model facilitates interpretability, allowing us to probe its internal mechanisms and representations during exploration.
Collapse
Affiliation(s)
- Abdelrahman Sharafeldin
- ML@GT, Georgia Institute of Technology, Atlanta, GA 30332, USA
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
- School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Nabil Imam
- ML@GT, Georgia Institute of Technology, Atlanta, GA 30332, USA
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Hannah Choi
- ML@GT, Georgia Institute of Technology, Atlanta, GA 30332, USA
- School of Mathematics, Georgia Institute of Technology, Atlanta, GA 30332, USA
| |
Collapse
|
16
|
Madadi Y, Delsoz M, Khouri AS, Boland M, Grzybowski A, Yousefi S. Applications of artificial intelligence-enabled robots and chatbots in ophthalmology: recent advances and future trends. Curr Opin Ophthalmol 2024; 35:238-243. [PMID: 38277274 PMCID: PMC10959691 DOI: 10.1097/icu.0000000000001035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2024]
Abstract
PURPOSE OF REVIEW Recent advances in artificial intelligence (AI), robotics, and chatbots have brought these technologies to the forefront of medicine, particularly ophthalmology. These technologies have been applied in diagnosis, prognosis, surgical operations, and patient-specific care in ophthalmology. It is thus both timely and pertinent to assess the existing landscape, recent advances, and trajectory of trends of AI, AI-enabled robots, and chatbots in ophthalmology. RECENT FINDINGS Some recent developments have integrated AI enabled robotics with diagnosis, and surgical procedures in ophthalmology. More recently, large language models (LLMs) like ChatGPT have shown promise in augmenting research capabilities and diagnosing ophthalmic diseases. These developments may portend a new era of doctor-patient-machine collaboration. SUMMARY Ophthalmology is undergoing a revolutionary change in research, clinical practice, and surgical interventions. Ophthalmic AI-enabled robotics and chatbot technologies based on LLMs are converging to create a new era of digital ophthalmology. Collectively, these developments portend a future in which conventional ophthalmic knowledge will be seamlessly integrated with AI to improve the patient experience and enhance therapeutic outcomes.
Collapse
Affiliation(s)
- Yeganeh Madadi
- Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Mohammad Delsoz
- Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, TN, USA
| | - Albert S. Khouri
- Institute of Ophthalmology and Visual Science, University of Medicine and Dentistry of New Jersey, NJ, USA
| | - Michael Boland
- Department of Ophthalmology, Massachusetts Eye and Ear, Boston, MA, USA
| | - Andrzej Grzybowski
- Institute for Research in Ophthalmology, Foundation for Ophthalmology Development, Poznan, Poland
| | - Siamak Yousefi
- Department of Ophthalmology, University of Tennessee Health Science Center, Memphis, TN, USA
- Department of Genetics, Genomics, and Informatics, University of Tennessee Health Science Center, Memphis, TN, USA
| |
Collapse
|
17
|
Voigtlaender S, Pawelczyk J, Geiger M, Vaios EJ, Karschnia P, Cudkowicz M, Dietrich J, Haraldsen IRJH, Feigin V, Owolabi M, White TL, Świeboda P, Farahany N, Natarajan V, Winter SF. Artificial intelligence in neurology: opportunities, challenges, and policy implications. J Neurol 2024; 271:2258-2273. [PMID: 38367046 DOI: 10.1007/s00415-024-12220-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2023] [Revised: 01/20/2024] [Accepted: 01/22/2024] [Indexed: 02/19/2024]
Abstract
Neurological conditions are the leading cause of disability and mortality combined, demanding innovative, scalable, and sustainable solutions. Brain health has become a global priority with adoption of the World Health Organization's Intersectoral Global Action Plan in 2022. Simultaneously, rapid advancements in artificial intelligence (AI) are revolutionizing neurological research and practice. This scoping review of 66 original articles explores the value of AI in neurology and brain health, systematizing the landscape for emergent clinical opportunities and future trends across the care trajectory: prevention, risk stratification, early detection, diagnosis, management, and rehabilitation. AI's potential to advance personalized precision neurology and global brain health directives hinges on resolving core challenges across four pillars-models, data, feasibility/equity, and regulation/innovation-through concerted pursuit of targeted recommendations. Paramount actions include swift, ethical, equity-focused integration of novel technologies into clinical workflows, mitigating data-related issues, counteracting digital inequity gaps, and establishing robust governance frameworks balancing safety and innovation.
Collapse
Affiliation(s)
- Sebastian Voigtlaender
- Systems Neuroscience Division, Max-Planck-Institute for Biological Cybernetics, Tübingen, Germany
- Virtual Diagnostics Team, QuantCo Inc., Cambridge, MA, USA
| | - Johannes Pawelczyk
- Faculty of Medicine, Ruprecht-Karls-University, Heidelberg, Germany
- Graduate Center of Medicine and Health, Technical University Munich, Munich, Germany
| | - Mario Geiger
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- NVIDIA, Zurich, Switzerland
| | - Eugene J Vaios
- Department of Radiation Oncology, Duke University Medical Center, Durham, NC, USA
| | - Philipp Karschnia
- Department of Neurosurgery, Ludwig-Maximilians-University and University Hospital Munich, Munich, Germany
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Merit Cudkowicz
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Jorg Dietrich
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA
| | - Ira R J Hebold Haraldsen
- Department of Neurology, Division of Clinical Neuroscience, Oslo University Hospital, Oslo, Norway
| | - Valery Feigin
- National Institute for Stroke and Applied Neurosciences, Auckland University of Technology, Auckland, New Zealand
| | - Mayowa Owolabi
- Center for Genomics and Precision Medicine, College of Medicine, University of Ibadan, Ibadan, Nigeria
- Neurology Unit, Department of Medicine, University of Ibadan, Ibadan, Nigeria
- Blossom Specialist Medical Center, Ibadan, Nigeria
- Lebanese American University of Beirut, Beirut, Lebanon
| | - Tara L White
- Department of Behavioral and Social Sciences, Brown University, Providence, RI, USA
| | | | | | | | - Sebastian F Winter
- Department of Neurology, Massachusetts General Hospital and Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
18
|
Qu Y, Wei C, Du P, Che W, Zhang C, Ouyang W, Bian Y, Xu F, Hu B, Du K, Wu H, Liu J, Liu Q. Integration of cognitive tasks into artificial general intelligence test for large models. iScience 2024; 27:109550. [PMID: 38595796 PMCID: PMC11001637 DOI: 10.1016/j.isci.2024.109550] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/11/2024] Open
Abstract
During the evolution of large models, performance evaluation is necessary for assessing their capabilities. However, current model evaluations mainly rely on specific tasks and datasets, lacking a united framework for assessing the multidimensional intelligence of large models. In this perspective, we advocate for a comprehensive framework of cognitive science-inspired artificial general intelligence (AGI) tests, including crystallized, fluid, social, and embodied intelligence. The AGI tests consist of well-designed cognitive tests adopted from human intelligence tests, and then naturally encapsulates into an immersive virtual community. We propose increasing the complexity of AGI testing tasks commensurate with advancements in large models and emphasizing the necessity for the interpretation of test results to avoid false negatives and false positives. We believe that cognitive science-inspired AGI tests will effectively guide the targeted improvement of large models in specific dimensions of intelligence and accelerate the integration of large models into human society.
Collapse
Affiliation(s)
- Youzhi Qu
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Chen Wei
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Penghui Du
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Wenxin Che
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | - Chi Zhang
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| | | | | | - Feiyang Xu
- iFLYTEK AI Research, Hefei 230088, China
| | - Bin Hu
- School of Medical Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Kai Du
- Institute for Artificial Intelligence, Peking University, Beijing 100871, China
| | - Haiyan Wu
- Centre for Cognitive and Brain Sciences and Department of Psychology, University of Macau, Macau 999078, China
| | - Jia Liu
- Department of Psychology, Tsinghua University, Beijing 100084, China
| | - Quanying Liu
- Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen 518055, China
| |
Collapse
|
19
|
Gershman SJ. What have we learned about artificial intelligence from studying the brain? BIOLOGICAL CYBERNETICS 2024; 118:1-5. [PMID: 38337064 DOI: 10.1007/s00422-024-00983-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/12/2024]
Abstract
Neuroscience and artificial intelligence (AI) share a long, intertwined history. It has been argued that discoveries in neuroscience were (and continue to be) instrumental in driving the development of new AI technology. Scrutinizing these historical claims yields a more nuanced story, where AI researchers were loosely inspired by the brain, but ideas flowed mostly in the other direction.
Collapse
Affiliation(s)
- Samuel J Gershman
- Department of Psychology and Center for Brain Science, Harvard University, Cambridge, USA, Cambridge, USA.
- Center for Brains, Minds, and Machines,MIT, Cambridge, USA.
| |
Collapse
|
20
|
Pagkalos M, Makarov R, Poirazi P. Leveraging dendritic properties to advance machine learning and neuro-inspired computing. Curr Opin Neurobiol 2024; 85:102853. [PMID: 38394956 DOI: 10.1016/j.conb.2024.102853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 02/04/2024] [Accepted: 02/05/2024] [Indexed: 02/25/2024]
Abstract
The brain is a remarkably capable and efficient system. It can process and store huge amounts of noisy and unstructured information, using minimal energy. In contrast, current artificial intelligence (AI) systems require vast resources for training while still struggling to compete in tasks that are trivial for biological agents. Thus, brain-inspired engineering has emerged as a promising new avenue for designing sustainable, next-generation AI systems. Here, we describe how dendritic mechanisms of biological neurons have inspired innovative solutions for significant AI problems, including credit assignment in multi-layer networks, catastrophic forgetting, and high-power consumption. These findings provide exciting alternatives to existing architectures, showing how dendritic research can pave the way for building more powerful and energy efficient artificial learning systems.
Collapse
Affiliation(s)
- Michalis Pagkalos
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece; Department of Biology, University of Crete, Heraklion, 70013, Greece. https://twitter.com/MPagkalos
| | - Roman Makarov
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece; Department of Biology, University of Crete, Heraklion, 70013, Greece. https://twitter.com/_RomanMakarov
| | - Panayiota Poirazi
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece.
| |
Collapse
|
21
|
Sievers B, Thornton MA. Deep social neuroscience: the promise and peril of using artificial neural networks to study the social brain. Soc Cogn Affect Neurosci 2024; 19:nsae014. [PMID: 38334747 PMCID: PMC10880882 DOI: 10.1093/scan/nsae014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 12/20/2023] [Accepted: 02/04/2024] [Indexed: 02/10/2024] Open
Abstract
This review offers an accessible primer to social neuroscientists interested in neural networks. It begins by providing an overview of key concepts in deep learning. It then discusses three ways neural networks can be useful to social neuroscientists: (i) building statistical models to predict behavior from brain activity; (ii) quantifying naturalistic stimuli and social interactions; and (iii) generating cognitive models of social brain function. These applications have the potential to enhance the clinical value of neuroimaging and improve the generalizability of social neuroscience research. We also discuss the significant practical challenges, theoretical limitations and ethical issues faced by deep learning. If the field can successfully navigate these hazards, we believe that artificial neural networks may prove indispensable for the next stage of the field's development: deep social neuroscience.
Collapse
Affiliation(s)
- Beau Sievers
- Department of Psychology, Stanford University, 420 Jane Stanford Way, Stanford, CA 94305, USA
- Department of Psychology, Harvard University, 33 Kirkland St., Cambridge, MA 02138, USA
| | - Mark A Thornton
- Department of Psychological and Brain Sciences, Dartmouth College, 6207 Moore Hall, Hanover, NH 03755, USA
| |
Collapse
|
22
|
Pezzulo G, Parr T, Cisek P, Clark A, Friston K. Generating meaning: active inference and the scope and limits of passive AI. Trends Cogn Sci 2024; 28:97-112. [PMID: 37973519 DOI: 10.1016/j.tics.2023.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 10/03/2023] [Accepted: 10/05/2023] [Indexed: 11/19/2023]
Abstract
Prominent accounts of sentient behavior depict brains as generative models of organismic interaction with the world, evincing intriguing similarities with current advances in generative artificial intelligence (AI). However, because they contend with the control of purposive, life-sustaining sensorimotor interactions, the generative models of living organisms are inextricably anchored to the body and world. Unlike the passive models learned by generative AI systems, they must capture and control the sensory consequences of action. This allows embodied agents to intervene upon their worlds in ways that constantly put their best models to the test, thus providing a solid bedrock that is - we argue - essential to the development of genuine understanding. We review the resulting implications and consider future directions for generative AI.
Collapse
Affiliation(s)
- Giovanni Pezzulo
- Institute of Cognitive Sciences and Technologies, National Research Council, Rome, Italy.
| | - Thomas Parr
- Nuffield Department of Clinical Neurosciences, University of Oxford
| | - Paul Cisek
- Department of Neuroscience, University of Montréal, Montréal, Québec, Canada
| | - Andy Clark
- Department of Philosophy, University of Sussex, Brighton, UK; Department of Informatics, University of Sussex, Brighton, UK; Department of Philosophy, Macquarie University, Sydney, New South Wales, Australia
| | - Karl Friston
- Wellcome Centre for Human Neuroimaging, Queen Square Institute of Neurology, University College London, London, UK; VERSES AI Research Lab, Los Angeles, CA, USA
| |
Collapse
|
23
|
Hwang GM, Simonian AL. Special Issue-Biosensors and Neuroscience: Is Biosensors Engineering Ready to Embrace Design Principles from Neuroscience? BIOSENSORS 2024; 14:68. [PMID: 38391987 PMCID: PMC10886788 DOI: 10.3390/bios14020068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 01/25/2024] [Indexed: 02/24/2024]
Abstract
In partnership with the Air Force Office of Scientific Research (AFOSR), the National Science Foundation's (NSF) Emerging Frontiers and Multidisciplinary Activities (EFMA) office of the Directorate for Engineering (ENG) launched an Emerging Frontiers in Research and Innovation (EFRI) topic for the fiscal years FY22 and FY23 entitled "Brain-inspired Dynamics for Engineering Energy-Efficient Circuits and Artificial Intelligence" (BRAID) [...].
Collapse
Affiliation(s)
- Grace M. Hwang
- Johns Hopkins University Applied Physics Laboratory, 111000 Johns Hopkins Road, Laurel, MD 20723, USA
| | | |
Collapse
|
24
|
Cui Z, Sa B, Xue KH, Zhang Y, Xiong R, Wen C, Miao X, Sun Z. Magnetic-ferroelectric synergic control of multilevel conducting states in van der Waals multiferroic tunnel junctions towards in-memory computing. NANOSCALE 2024; 16:1331-1344. [PMID: 38131373 DOI: 10.1039/d3nr04712a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
van der Waals (vdW) multiferroic tunnel junctions (MFTJs) based on two-dimensional materials have gained significant interest due to their potential applications in next-generation data storage and in-memory computing devices. In this study, we construct vdW MFTJs by employing monolayer Mn2Se3 as the spin-filter tunnel barrier, TiTe2 as the electrodes and In2S3 as the tunnel barrier to investigate the spin transport properties based on first-principles quantum transport calculations. It is highlighted that apparent tunneling magnetoresistance (TMR) and tunneling electroresistance (TER) effects with a maximum TMR ratio of 6237% and TER ratio of 1771% can be realized by using bilayer In2S3 as the tunnel barrier under finite bias. Furthermore, the physical origin of the distinguished TMR and TER effects is unraveled from the k||-resolved transmission spectra and spin-dependent projected local density of states analysis. Interestingly, four distinguishable conductance states reveal the implementation of four-state nonvolatile data storage using one MFTJ unit. More importantly, in-memory logic computing and multilevel data storage can be achieved at the same time by magnetic switching and electrical control, respectively. These results shed light on vdW MFTJs in the applications of in-memory computing as well as multilevel data storage devices.
Collapse
Affiliation(s)
- Zhou Cui
- Multiscale Computational Materials Facility & Materials Genome Institute, School of Materials Science and Engineering, Fuzhou University, Fuzhou 350108, China.
| | - Baisheng Sa
- Multiscale Computational Materials Facility & Materials Genome Institute, School of Materials Science and Engineering, Fuzhou University, Fuzhou 350108, China.
| | - Kan-Hao Xue
- School of Integrated Circuits, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yinggan Zhang
- College of Materials, Fujian Provincial Key Laboratory of Theoretical and Computational Chemistry, Xiamen University, Xiamen 361005, P. R. China
| | - Rui Xiong
- Multiscale Computational Materials Facility & Materials Genome Institute, School of Materials Science and Engineering, Fuzhou University, Fuzhou 350108, China.
| | - Cuilian Wen
- Multiscale Computational Materials Facility & Materials Genome Institute, School of Materials Science and Engineering, Fuzhou University, Fuzhou 350108, China.
| | - Xiangshui Miao
- School of Integrated Circuits, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Zhimei Sun
- School of Materials Science and Engineering, and Center for Integrated Computational Materials Science, International Research Institute for Multidisciplinary Science, Beihang University, Beijing 100191, P. R. China.
| |
Collapse
|
25
|
Li H, Wan B, Fang Y, Li Q, Liu JK, An L. An FPGA implementation of Bayesian inference with spiking neural networks. Front Neurosci 2024; 17:1291051. [PMID: 38249589 PMCID: PMC10796689 DOI: 10.3389/fnins.2023.1291051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 12/06/2023] [Indexed: 01/23/2024] Open
Abstract
Spiking neural networks (SNNs), as brain-inspired neural network models based on spikes, have the advantage of processing information with low complexity and efficient energy consumption. Currently, there is a growing trend to design hardware accelerators for dedicated SNNs to overcome the limitation of running under the traditional von Neumann architecture. Probabilistic sampling is an effective modeling approach for implementing SNNs to simulate the brain to achieve Bayesian inference. However, sampling consumes considerable time. It is highly demanding for specific hardware implementation of SNN sampling models to accelerate inference operations. Hereby, we design a hardware accelerator based on FPGA to speed up the execution of SNN algorithms by parallelization. We use streaming pipelining and array partitioning operations to achieve model operation acceleration with the least possible resource consumption, and combine the Python productivity for Zynq (PYNQ) framework to implement the model migration to the FPGA while increasing the speed of model operations. We verify the functionality and performance of the hardware architecture on the Xilinx Zynq ZCU104. The experimental results show that the hardware accelerator of the SNN sampling model proposed can significantly improve the computing speed while ensuring the accuracy of inference. In addition, Bayesian inference for spiking neural networks through the PYNQ framework can fully optimize the high performance and low power consumption of FPGAs in embedded applications. Taken together, our proposed FPGA implementation of Bayesian inference with SNNs has great potential for a wide range of applications, it can be ideal for implementing complex probabilistic model inference in embedded systems.
Collapse
Affiliation(s)
- Haoran Li
- Guangzhou Institute of Technology, Xidian University, Guangzhou, China
| | - Bo Wan
- School of Computer Science and Technology, Xidian University, Xi'an, China
- Key Laboratory of Smart Human Computer Interaction and Wearable Technology of Shaanxi Province, Xi'an, China
| | - Ying Fang
- College of Computer and Cyber Security, Fujian Normal University, Fuzhou, China
- Digital Fujian Internet-of-Thing Laboratory of Environmental Monitoring, Fujian Normal University, Fuzhou, China
| | - Qifeng Li
- Research Center of Information Technology, Beijing Academy of Agriculture and Forestry Sciences, National Engineering Research Center for Information Technology in Agriculture, Beijing, China
| | - Jian K. Liu
- School of Computer Science, University of Birmingham, Birmingham, United Kingdom
| | - Lingling An
- Guangzhou Institute of Technology, Xidian University, Guangzhou, China
- School of Computer Science and Technology, Xidian University, Xi'an, China
| |
Collapse
|
26
|
Konya A, Nematzadeh P. Recent applications of AI to environmental disciplines: A review. THE SCIENCE OF THE TOTAL ENVIRONMENT 2024; 906:167705. [PMID: 37820816 DOI: 10.1016/j.scitotenv.2023.167705] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/13/2023] [Revised: 10/06/2023] [Accepted: 10/07/2023] [Indexed: 10/13/2023]
Abstract
The rapid development and efficiency of Artificial Intelligence (AI) tools have made them increasingly popular in various fields and research domains. The environmental discipline is now experiencing an exponential interest in harnessing the potential of AI over the past decade. We have reviewed the latest applications of AI tools in the environmental disciplines, highlighting the opportunities they present and discussing their advantages and disadvantages in this field. After the emergence of deep learning algorithms in 2010, interest in using AI tools for environmental tasks has grown exponentially. Among the studied articles, over 65 % of environmental tasks that demonstrate interest in using AI tools initially relied on conventional statistical and mathematical models. Using AI tools can greatly benefit the areas of environmental science and engineering. One of the main advantages of utilizing AI tools is their ability to analyze and process large amounts of data efficiently. Recently, the European Union established a European supercomputing ecosystem program to advance science and enhance the quality of life for its citizens. Nine of these projects prioritize environmental and sustainable goals. Despite the benefits of AI, it is still in its early stages of development, which comes with environmental concerns. The amount of power consumed and the time required to train an AI model can greatly affect the carbon emissions it produces, exacerbating the challenges posed by climate change. Efforts are currently underway to develop AI technology that is environmentally sustainable, minimizes energy consumption, and has a low carbon footprint. Selecting the appropriate AI model architecture can reduce energy consumption by almost 90 %. The main finding suggests that collaboration between environmental and AI professionals becomes crucial in leveraging the full potential of AI in addressing pressing environmental challenges.
Collapse
Affiliation(s)
- Aniko Konya
- University of Illinois, Chicago, IL 60637, USA.
| | | |
Collapse
|
27
|
Manukian M, Bahdasariants S, Yakovenko S. Artificial physics engine for real-time inverse dynamics of arm and hand movement. PLoS One 2023; 18:e0295750. [PMID: 38091328 PMCID: PMC10718432 DOI: 10.1371/journal.pone.0295750] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 11/28/2023] [Indexed: 12/18/2023] Open
Abstract
Simulating human body dynamics requires detailed and accurate mathematical models. When solved inversely, these models provide a comprehensive description of force generation that considers subject morphology and can be applied to control real-time assistive technology, for example, orthosis or muscle/nerve stimulation. Yet, model complexity hinders the speed of its computations and may require approximations as a mitigation strategy. Here, we use machine learning algorithms to provide a method for accurate physics simulations and subject-specific parameterization. Several types of artificial neural networks (ANNs) with varied architecture were tasked to generate the inverse dynamic transformation of realistic arm and hand movement (23 degrees of freedom). Using a physical model, we generated representative limb movements with bell-shaped end-point velocity trajectories within the physiological workspace. This dataset was used to develop ANN transformations with low torque errors (less than 0.1 Nm). Multiple ANN implementations using kinematic sequences solved accurately and robustly the high-dimensional kinematic Jacobian and inverse dynamics of arm and hand. These results provide further support for the use of ANN architectures that use temporal trajectories of time-delayed values to make accurate predictions of limb dynamics.
Collapse
Affiliation(s)
- Mykhailo Manukian
- Faculty of Applied Science, Ukrainian Catholic University, Lviv, Ukraine
| | - Serhii Bahdasariants
- Department of Human Performance—Pathophysiology, Rehabilitation, and Performance, School of Medicine, West Virginia University, Morgantown, West Virginia, United States of America
| | - Sergiy Yakovenko
- Department of Human Performance—Pathophysiology, Rehabilitation, and Performance, School of Medicine, West Virginia University, Morgantown, West Virginia, United States of America
- Department of Neuroscience, School of Medicine, West Virginia University, Morgantown, West Virginia, United States of America
- Rockefeller Neuroscience Institute, School of Medicine, West Virginia University, Morgantown, West Virginia, United States of America
- Mechanical and Aerospace Engineering, Benjamin M. Statler College of Engineering and Mineral Resources, West Virginia University, Morgantown, West Virginia, United States of America
- Department of Biomedical Engineering, Benjamin M. Statler College of Engineering and Mineral Resources, West Virginia University, Morgantown, West Virginia, United States of America
| |
Collapse
|
28
|
Schilling A, Sedley W, Gerum R, Metzner C, Tziridis K, Maier A, Schulze H, Zeng FG, Friston KJ, Krauss P. Predictive coding and stochastic resonance as fundamental principles of auditory phantom perception. Brain 2023; 146:4809-4825. [PMID: 37503725 PMCID: PMC10690027 DOI: 10.1093/brain/awad255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 06/27/2023] [Accepted: 07/15/2023] [Indexed: 07/29/2023] Open
Abstract
Mechanistic insight is achieved only when experiments are employed to test formal or computational models. Furthermore, in analogy to lesion studies, phantom perception may serve as a vehicle to understand the fundamental processing principles underlying healthy auditory perception. With a special focus on tinnitus-as the prime example of auditory phantom perception-we review recent work at the intersection of artificial intelligence, psychology and neuroscience. In particular, we discuss why everyone with tinnitus suffers from (at least hidden) hearing loss, but not everyone with hearing loss suffers from tinnitus. We argue that intrinsic neural noise is generated and amplified along the auditory pathway as a compensatory mechanism to restore normal hearing based on adaptive stochastic resonance. The neural noise increase can then be misinterpreted as auditory input and perceived as tinnitus. This mechanism can be formalized in the Bayesian brain framework, where the percept (posterior) assimilates a prior prediction (brain's expectations) and likelihood (bottom-up neural signal). A higher mean and lower variance (i.e. enhanced precision) of the likelihood shifts the posterior, evincing a misinterpretation of sensory evidence, which may be further confounded by plastic changes in the brain that underwrite prior predictions. Hence, two fundamental processing principles provide the most explanatory power for the emergence of auditory phantom perceptions: predictive coding as a top-down and adaptive stochastic resonance as a complementary bottom-up mechanism. We conclude that both principles also play a crucial role in healthy auditory perception. Finally, in the context of neuroscience-inspired artificial intelligence, both processing principles may serve to improve contemporary machine learning techniques.
Collapse
Affiliation(s)
- Achim Schilling
- Neuroscience Lab, University Hospital Erlangen, 91054 Erlangen, Germany
- Cognitive Computational Neuroscience Group, University Erlangen-Nürnberg, 91058 Erlangen, Germany
| | - William Sedley
- Translational and Clinical Research Institute, Newcastle University Medical School, Newcastle upon Tyne NE2 4HH, UK
| | - Richard Gerum
- Cognitive Computational Neuroscience Group, University Erlangen-Nürnberg, 91058 Erlangen, Germany
- Department of Physics and Astronomy and Center for Vision Research, York University, Toronto, ON M3J 1P3, Canada
| | - Claus Metzner
- Neuroscience Lab, University Hospital Erlangen, 91054 Erlangen, Germany
| | | | - Andreas Maier
- Pattern Recognition Lab, University Erlangen-Nürnberg, 91058 Erlangen, Germany
| | - Holger Schulze
- Neuroscience Lab, University Hospital Erlangen, 91054 Erlangen, Germany
| | - Fan-Gang Zeng
- Center for Hearing Research, Departments of Anatomy and Neurobiology, Biomedical Engineering, Cognitive Sciences, Otolaryngology–Head and Neck Surgery, University of California Irvine, Irvine, CA 92697, USA
| | - Karl J Friston
- Wellcome Centre for Human Neuroimaging, Institute of Neurology, University College London, London WC1N 3AR, UK
| | - Patrick Krauss
- Neuroscience Lab, University Hospital Erlangen, 91054 Erlangen, Germany
- Cognitive Computational Neuroscience Group, University Erlangen-Nürnberg, 91058 Erlangen, Germany
- Pattern Recognition Lab, University Erlangen-Nürnberg, 91058 Erlangen, Germany
| |
Collapse
|
29
|
Chakraborty C, Bhattacharya M, Islam MA, Agoramoorthy G. ChatGPT indicates the path and initiates the research to open up the black box of artificial intelligence. Int J Surg 2023; 109:4367-4368. [PMID: 37830950 PMCID: PMC10720827 DOI: 10.1097/js9.0000000000000701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2023] [Accepted: 08/12/2023] [Indexed: 10/14/2023]
Affiliation(s)
- Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal
| | | | - Md. Aminul Islam
- Advanced Molecular Lab, Department of Microbiology, President Abdul Hamid Medical College, Karimganj, Kishoreganj
- College of Pharmacy and Health Care, Tajen University, Yanpu, Pingtung, Taiwan
| | - Govindasamy Agoramoorthy
- Honeybee Population Health Foundation, Chennai, India
- College of Pharmacy and Health Care, Tajen University, Yanpu, Pingtung, Taiwan
| |
Collapse
|
30
|
Wang Q, Sun T, Li R. Does artificial intelligence (AI) reduce ecological footprint? The role of globalization. ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH INTERNATIONAL 2023; 30:123948-123965. [PMID: 37995036 DOI: 10.1007/s11356-023-31076-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/13/2023] [Indexed: 11/24/2023]
Abstract
This article explores the impact of artificial intelligence (AI) on global ecological footprints, which has important implications for global sustainability in the digital age. Using the comprehensive evaluation index of AI constructed by the entropy method and the dataset at the global national level, we find that from 2010 to 2019, the overall level of global AI shows an upward trend, in which the growth rate of AI in developed countries is more pronounced and exhibits a stable growth trend, while the growth rate of AI in developing countries displays a trend of instability. The research results show that AI has a significant inhibitory effect on ecological footprints. This conclusion holds even after endogeneity and robustness tests. In addition, under the effect of globalization, the impact of AI on ecological footprints shows nonlinear characteristics. As globalization deepens, the marginal effect of AI in reducing the ecological footprint shows an increasing trend. These findings emphasize the important role of AI in environmental governance and provide a new and comprehensive perspective for policymakers. Therefore, the government should continue to support the research and application of AI, promote the cross-industry integration of AI, and play a positive role in the process of globalization to promote global sustainable development.
Collapse
Affiliation(s)
- Qiang Wang
- School of Economics and Management, China University of Petroleum (East China), Qingdao, 266580, People's Republic of China.
- School of Economics and Management, Xinjiang University, Wulumuqi, 830046, People's Republic of China.
| | - Tingting Sun
- School of Economics and Management, China University of Petroleum (East China), Qingdao, 266580, People's Republic of China
| | - Rongrong Li
- School of Economics and Management, China University of Petroleum (East China), Qingdao, 266580, People's Republic of China
- School of Economics and Management, Xinjiang University, Wulumuqi, 830046, People's Republic of China
| |
Collapse
|
31
|
Bardal M, Chalmers E. Four attributes of intelligence, a thousand questions. BIOLOGICAL CYBERNETICS 2023; 117:407-409. [PMID: 38059989 DOI: 10.1007/s00422-023-00979-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/07/2023] [Accepted: 11/08/2023] [Indexed: 12/08/2023]
Abstract
Jeff Hawkins is one of those rare individuals who speaks the languages of both AI and neuroscience. In his recent book, "A Thousand Brains: A New Theory of Intelligence", Hawkins proposes that current learning algorithms lack four attributes which will be necessary for true machine intelligence. Here we demonstrate that a minimal learning system which satisfies all four points can be constructed using only simple, classical machine learning techniques. We illustrate that such a system falls short of biological intelligence in some important ways. We suggest that Hawkins' list is a useful model, but the "recipe" for true intelligence-if there is one-may not be so easily defined.
Collapse
Affiliation(s)
- Matthieu Bardal
- Department of Mathematics and Computing, Mount Royal University, 4825 Mt Royal Gate SW, Calgary, AB, T3E6K6, Canada
| | - Eric Chalmers
- Department of Mathematics and Computing, Mount Royal University, 4825 Mt Royal Gate SW, Calgary, AB, T3E6K6, Canada.
| |
Collapse
|
32
|
Aru J, Larkum ME, Shine JM. The feasibility of artificial consciousness through the lens of neuroscience. Trends Neurosci 2023; 46:1008-1017. [PMID: 37863713 DOI: 10.1016/j.tins.2023.09.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Revised: 08/23/2023] [Accepted: 09/27/2023] [Indexed: 10/22/2023]
Abstract
Interactions with large language models (LLMs) have led to the suggestion that these models may soon be conscious. From the perspective of neuroscience, this position is difficult to defend. For one, the inputs to LLMs lack the embodied, embedded information content characteristic of our sensory contact with the world around us. Secondly, the architectures of present-day artificial intelligence algorithms are missing key features of the thalamocortical system that have been linked to conscious awareness in mammals. Finally, the evolutionary and developmental trajectories that led to the emergence of living conscious organisms arguably have no parallels in artificial systems as envisioned today. The existence of living organisms depends on their actions and their survival is intricately linked to multi-level cellular, inter-cellular, and organismal processes culminating in agency and consciousness.
Collapse
Affiliation(s)
- Jaan Aru
- Institute of Computer Science, University of Tartu, Tartu, Estonia.
| | - Matthew E Larkum
- Institute of Biology, Humboldt University Berlin, Berlin, Germany.
| | - James M Shine
- Brain and Mind Center, The University of Sydney, Sydney, Australia.
| |
Collapse
|
33
|
Mah A, Schiereck SS, Bossio V, Constantinople CM. Distinct value computations support rapid sequential decisions. Nat Commun 2023; 14:7573. [PMID: 37989741 PMCID: PMC10663503 DOI: 10.1038/s41467-023-43250-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 11/03/2023] [Indexed: 11/23/2023] Open
Abstract
The value of the environment determines animals' motivational states and sets expectations for error-based learning1-3. How are values computed? Reinforcement learning systems can store or cache values of states or actions that are learned from experience, or they can compute values using a model of the environment to simulate possible futures3. These value computations have distinct trade-offs, and a central question is how neural systems decide which computations to use or whether/how to combine them4-8. Here we show that rats use distinct value computations for sequential decisions within single trials. We used high-throughput training to collect statistically powerful datasets from 291 rats performing a temporal wagering task with hidden reward states. Rats adjusted how quickly they initiated trials and how long they waited for rewards across states, balancing effort and time costs against expected rewards. Statistical modeling revealed that animals computed the value of the environment differently when initiating trials versus when deciding how long to wait for rewards, even though these decisions were only seconds apart. Moreover, value estimates interacted via a dynamic learning rate. Our results reveal how distinct value computations interact on rapid timescales, and demonstrate the power of using high-throughput training to understand rich, cognitive behaviors.
Collapse
Affiliation(s)
- Andrew Mah
- Center for Neural Science, New York University, New York, NY, 10003, USA
| | | | - Veronica Bossio
- Center for Neural Science, New York University, New York, NY, 10003, USA
- Zuckerman Institute, Columbia University, New York, NY, 10027, USA
| | | |
Collapse
|
34
|
Vafaii H, Yates JL, Butts DA. Hierarchical VAEs provide a normative account of motion processing in the primate brain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.27.559646. [PMID: 37808629 PMCID: PMC10557690 DOI: 10.1101/2023.09.27.559646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/10/2023]
Abstract
The relationship between perception and inference, as postulated by Helmholtz in the 19th century, is paralleled in modern machine learning by generative models like Variational Autoencoders (VAEs) and their hierarchical variants. Here, we evaluate the role of hierarchical inference and its alignment with brain function in the domain of motion perception. We first introduce a novel synthetic data framework, Retinal Optic Flow Learning (ROFL), which enables control over motion statistics and their causes. We then present a new hierarchical VAE and test it against alternative models on two downstream tasks: (i) predicting ground truth causes of retinal optic flow (e.g., self-motion); and (ii) predicting the responses of neurons in the motion processing pathway of primates. We manipulate the model architectures (hierarchical versus non-hierarchical), loss functions, and the causal structure of the motion stimuli. We find that hierarchical latent structure in the model leads to several improvements. First, it improves the linear decodability of ground truth factors and does so in a sparse and disentangled manner. Second, our hierarchical VAE outperforms previous state-of-the-art models in predicting neuronal responses and exhibits sparse latent-to-neuron relationships. These results depend on the causal structure of the world, indicating that alignment between brains and artificial neural networks depends not only on architecture but also on matching ecologically relevant stimulus statistics. Taken together, our results suggest that hierarchical Bayesian inference underlines the brain's understanding of the world, and hierarchical VAEs can effectively model this understanding.
Collapse
|
35
|
Rouleau N, Levin M. The Multiple Realizability of Sentience in Living Systems and Beyond. eNeuro 2023; 10:ENEURO.0375-23.2023. [PMID: 37963652 PMCID: PMC10646883 DOI: 10.1523/eneuro.0375-23.2023] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2023] [Accepted: 10/23/2023] [Indexed: 11/16/2023] Open
Affiliation(s)
- Nicolas Rouleau
- Department of Health Sciences, Wilfrid Laurier University, Waterloo, Ontario N2L 3C5, Canada
- Department of Biomedical Engineering, Tufts University, Medford, MA 02155
- Allen Discovery Center at, Tufts University, Medford, MA 02155
| | - Michael Levin
- Allen Discovery Center at, Tufts University, Medford, MA 02155
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02215
| |
Collapse
|
36
|
Chakraborty C, Pal S, Bhattacharya M, Dash S, Lee SS. Overview of Chatbots with special emphasis on artificial intelligence-enabled ChatGPT in medical science. Front Artif Intell 2023; 6:1237704. [PMID: 38028668 PMCID: PMC10644239 DOI: 10.3389/frai.2023.1237704] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 10/05/2023] [Indexed: 12/01/2023] Open
Abstract
The release of ChatGPT has initiated new thinking about AI-based Chatbot and its application and has drawn huge public attention worldwide. Researchers and doctors have started thinking about the promise and application of AI-related large language models in medicine during the past few months. Here, the comprehensive review highlighted the overview of Chatbot and ChatGPT and their current role in medicine. Firstly, the general idea of Chatbots, their evolution, architecture, and medical use are discussed. Secondly, ChatGPT is discussed with special emphasis of its application in medicine, architecture and training methods, medical diagnosis and treatment, research ethical issues, and a comparison of ChatGPT with other NLP models are illustrated. The article also discussed the limitations and prospects of ChatGPT. In the future, these large language models and ChatGPT will have immense promise in healthcare. However, more research is needed in this direction.
Collapse
Affiliation(s)
- Chiranjib Chakraborty
- Department of Biotechnology, School of Life Science and Biotechnology, Adamas University, Kolkata, West Bengal, India
| | - Soumen Pal
- School of Mechanical Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | | | - Snehasish Dash
- School of Mechanical Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu, India
| | - Sang-Soo Lee
- Institute for Skeletal Aging and Orthopedic Surgery, Hallym University Chuncheon Sacred Heart Hospital, Chuncheon-si, Gangwon-do, Republic of Korea
| |
Collapse
|
37
|
Nayebi A, Rajalingham R, Jazayeri M, Yang GR. Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes. ARXIV 2023:arXiv:2305.11772v2. [PMID: 37292459 PMCID: PMC10246064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Humans and animals have a rich and flexible understanding of the physical world, which enables them to infer the underlying dynamical trajectories of objects and events, plausible future states, and use that to plan and anticipate the consequences of actions. However, the neural mechanisms underlying these computations are unclear. We combine a goal-driven modeling approach with dense neurophysiological data and high-throughput human behavioral readouts that contain thousands of comparisons to directly impinge on this question. Specifically, we construct and evaluate several classes of sensory-cognitive networks to predict the future state of rich, ethologically-relevant environments, ranging from self-supervised end-to-end models with pixel-wise or object-slot objectives, to models that future predict in the latent space of purely static image-pretrained or dynamic video-pretrained foundation models. We find that "scale is not all you need", and that many state-of-the-art machine learning models fail to perform well on our neural and behavioral benchmarks for future prediction. In fact, only one class of models matches these data well overall. We find that neural responses are currently best predicted by models trained to predict the future state of their environment in the latent space of pretrained foundation models optimized for dynamic scenes in a self-supervised manner. These models also approach the neurons' ability to predict the environmental state variables that are visually hidden from view, despite not being explicitly trained to do so. Finally, we find that not all foundation model latents are equal. Notably, models that future predict in the latent space of video foundation models that are optimized to support a diverse range of egocentric sensorimotor tasks, reasonably match both human behavioral error patterns and neural dynamics across all environmental scenarios that we were able to test. Overall, these findings suggest that the neural mechanisms and behaviors of primate mental simulation have strong inductive biases associated with them, and are thus far most consistent with being optimized to future predict on reusable visual representations that are useful for Embodied AI more generally.
Collapse
Affiliation(s)
- Aran Nayebi
- McGovern Institute for Brain Research, MIT; Cambridge, MA 02139
| | - Rishi Rajalingham
- McGovern Institute for Brain Research, MIT; Cambridge, MA 02139
- Reality Labs, Meta; 390 9th Ave, New York, NY 10001
| | - Mehrdad Jazayeri
- McGovern Institute for Brain Research, MIT; Cambridge, MA 02139
- Department of Brain and Cognitive Sciences, MIT; Cambridge, MA 02139
| | - Guangyu Robert Yang
- McGovern Institute for Brain Research, MIT; Cambridge, MA 02139
- Department of Brain and Cognitive Sciences, MIT; Cambridge, MA 02139
- Department of Electrical Engineering and Computer Science, MIT; Cambridge, MA 02139
| |
Collapse
|
38
|
Pham TQ, Matsui T, Chikazoe J. Evaluation of the Hierarchical Correspondence between the Human Brain and Artificial Neural Networks: A Review. BIOLOGY 2023; 12:1330. [PMID: 37887040 PMCID: PMC10604784 DOI: 10.3390/biology12101330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 09/22/2023] [Accepted: 10/10/2023] [Indexed: 10/28/2023]
Abstract
Artificial neural networks (ANNs) that are heavily inspired by the human brain now achieve human-level performance across multiple task domains. ANNs have thus drawn attention in neuroscience, raising the possibility of providing a framework for understanding the information encoded in the human brain. However, the correspondence between ANNs and the brain cannot be measured directly. They differ in outputs and substrates, neurons vastly outnumber their ANN analogs (i.e., nodes), and the key algorithm responsible for most of modern ANN training (i.e., backpropagation) is likely absent from the brain. Neuroscientists have thus taken a variety of approaches to examine the similarity between the brain and ANNs at multiple levels of their information hierarchy. This review provides an overview of the currently available approaches and their limitations for evaluating brain-ANN correspondence.
Collapse
Affiliation(s)
| | - Teppei Matsui
- Graduate School of Brain Science, Doshisha University, Kyoto 610-0321, Japan
| | | |
Collapse
|
39
|
Szorkovszky A, Veenstra F, Glette K. From real-time adaptation to social learning in robot ecosystems. Front Robot AI 2023; 10:1232708. [PMID: 37860631 PMCID: PMC10584317 DOI: 10.3389/frobt.2023.1232708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Accepted: 08/18/2023] [Indexed: 10/21/2023] Open
Abstract
While evolutionary robotics can create novel morphologies and controllers that are well-adapted to their environments, learning is still the most efficient way to adapt to changes that occur on shorter time scales. Learning proposals for evolving robots to date have focused on new individuals either learning a controller from scratch, or building on the experience of direct ancestors and/or robots with similar configurations. Here we propose and demonstrate a novel means for social learning of gait patterns, based on sensorimotor synchronization. Using movement patterns of other robots as input can drive nonlinear decentralized controllers such as CPGs into new limit cycles, hence encouraging diversity of movement patterns. Stable autonomous controllers can then be locked in, which we demonstrate using a quasi-Hebbian feedback scheme. We propose that in an ecosystem of robots evolving in a heterogeneous environment, such a scheme may allow for the emergence of generalist task-solvers from a population of specialists.
Collapse
Affiliation(s)
- Alex Szorkovszky
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Frank Veenstra
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Kyrre Glette
- RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Oslo, Norway
- Department of Informatics, University of Oslo, Oslo, Norway
| |
Collapse
|
40
|
Abstract
The current gap between computing algorithms and neuromorphic hardware to emulate brains is an outstanding bottleneck in developing neural computing technologies. Aimone and Parekh discuss the possibility of bridging this gap using theoretical computing frameworks from a neuroscience perspective.
Collapse
Affiliation(s)
- James B Aimone
- Neural Exploration and Research Laboratory, Center for Computing Research, Sandia National Laboratories, Albuquerque, NM, USA.
| | - Ojas Parekh
- Neural Exploration and Research Laboratory, Center for Computing Research, Sandia National Laboratories, Albuquerque, NM, USA.
| |
Collapse
|
41
|
Pagkalos M, Makarov R, Poirazi P. Leveraging dendritic properties to advance machine learning and neuro-inspired computing. ARXIV 2023:arXiv:2306.08007v1. [PMID: 37396619 PMCID: PMC10312913] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The brain is a remarkably capable and efficient system. It can process and store huge amounts of noisy and unstructured information using minimal energy. In contrast, current artificial intelligence (AI) systems require vast resources for training while still struggling to compete in tasks that are trivial for biological agents. Thus, brain-inspired engineering has emerged as a promising new avenue for designing sustainable, next-generation AI systems. Here, we describe how dendritic mechanisms of biological neurons have inspired innovative solutions for significant AI problems, including credit assignment in multilayer networks, catastrophic forgetting, and high energy consumption. These findings provide exciting alternatives to existing architectures, showing how dendritic research can pave the way for building more powerful and energy-efficient artificial learning systems.
Collapse
Affiliation(s)
- Michalis Pagkalos
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece
- Department of Biology, University of Crete, Heraklion, 70013, Greece
| | - Roman Makarov
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece
- Department of Biology, University of Crete, Heraklion, 70013, Greece
| | - Panayiota Poirazi
- Institute of Molecular Biology and Biotechnology (IMBB), Foundation for Research and Technology Hellas (FORTH), Heraklion, 70013, Greece
| |
Collapse
|
42
|
Malakasis N, Chavlis S, Poirazi P. Synaptic turnover promotes efficient learning in bio-realistic spiking neural networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.22.541722. [PMID: 37292929 PMCID: PMC10245885 DOI: 10.1101/2023.05.22.541722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
While artificial machine learning systems achieve superhuman performance in specific tasks such as language processing, image and video recognition, they do so use extremely large datasets and huge amounts of power. On the other hand, the brain remains superior in several cognitively challenging tasks while operating with the energy of a small lightbulb. We use a biologically constrained spiking neural network model to explore how the neural tissue achieves such high efficiency and assess its learning capacity on discrimination tasks. We found that synaptic turnover, a form of structural plasticity, which is the ability of the brain to form and eliminate synapses continuously, increases both the speed and the performance of our network on all tasks tested. Moreover, it allows accurate learning using a smaller number of examples. Importantly, these improvements are most significant under conditions of resource scarcity, such as when the number of trainable parameters is halved and when the task difficulty is increased. Our findings provide new insights into the mechanisms that underlie efficient learning in the brain and can inspire the development of more efficient and flexible machine learning algorithms.
Collapse
Affiliation(s)
- Nikos Malakasis
- School of Medicine, University of Crete, Heraklion 70013, Greece
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Heraklion 70013, Greece
| | - Spyridon Chavlis
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Heraklion 70013, Greece
| | - Panayiota Poirazi
- Institute of Molecular Biology and Biotechnology, Foundation for Research and Technology-Hellas, Heraklion 70013, Greece
| |
Collapse
|