高级检索

类脑持续学习进展与趋势

Progress and Trends of Brain-Inspired Continual Learning

  • 摘要: 持续学习致力于使人工智能系统在不断掌握新任务与知识的同时,有效避免对已学内容的灾难性遗忘,从而在可塑性与稳定性之间取得平衡。当前,基于反向传播的传统神经网络通常依赖全局参数更新,在非平稳数据流中容易发生灾难性遗忘。近年来,大模型通过检索增强、参数高效微调、混合专家系统和超长上下文等机制,展现出一种“软持续学习”特性,即在上下文学习和外部记忆检索方面表现出与传统持续学习方法不同的优势。然而,这类方法仍面临超长上下文资源消耗大、内部预训练参数在持续训练过程中产生冲突和遗忘等问题。与此同时,类脑持续学习方法受到广泛关注,其借鉴了突触局部学习机制、人脑不同功能分区与核团结构,以及分级记忆系统(如短时/长时记忆、情景/语义记忆)之间的协同作用,以期实现类似人类与现实世界长期、持续交互与学习的能力。本文围绕以下3方面展开论述:(1)灾难性遗忘的产生根源、量化评估指标、经典持续学习方法及当前大模型在持续学习中的研究进展;(2)人类持续学习的行为特征、人脑持续学习的神经机制,以及脑启发算法在持续学习中的应用,如基于互补学习系统理论的海马-新皮层长短时记忆交互、记忆巩固与重放等机制;(3)当前持续学习领域面临的开放挑战与未来评价体系的建议。

     

    Abstract: Continual learning aims to enable artificial intelligence systems to continuously master new tasks and knowledge while effectively avoiding catastrophic forgetting of previously learned content, thereby striking a balance between plasticity and stability. Currently, neural networks based on backpropagation typically rely on global parameter updates, making them prone to catastrophic forgetting in non-stationary data streams. In recent years, large models have demonstrated a form of “soft continual learning” through mechanisms such as retrieval augmentation, parameter-efficient fine-tuning, mixture of experts, and extended long contexts, showcasing advantages over traditional continual learning methods in terms of in-context learning and external memory retrieval. However, such methods still face challenges, including the high resource consumption of processing ultra-long contexts and the conflicts and forgetting that arise within internal pre-trained parameters during continual training. Meanwhile, brain-inspired continual learning methods have attracted considerable attention. These methods draw on mechanisms such as local synaptic plasticity, the distinct functional areas and nuclear structures of the human brain, and the synergistic interaction between hierarchical memory systems (e.g., short-term/long-term memory, episodic/semantic memory), aiming to achieve capabilities akin to humans’ long-term, sustained interaction and learning with the real world. This paper discusses the following 3 aspects: (1) the root causes of catastrophic forgetting, quantitative evaluation metrics, classical continual learning methods, and current research progress in continual learning for large models; (2) the behavioral characteristics of human continual learning, the neural mechanisms underlying human continual learning, and the application of brain-inspired algorithms in continual learning, including the interaction between the hippocampus and neocortex in short-term/long-term memory based on the complementary learning systems theory, as well as mechanisms such as memory consolidation and replay; (3) the open challenges currently facing the field of continual learning, along with suggestions for future evaluation frameworks.

     

/

返回文章
返回