Multi-step Knowledge Retrieval and Inference over Unstructured Data

发布于 2024-06-27  2 次阅读


AI 摘要

该文章讨论了大语言模型(LLM)和生成型人工智能的出现如何革新了各个领域的自然语言应用。然而,在医学、法律和金融等领域的高风险决策任务中,需求的精确度、全面性和逻辑一致性要求纯LLM或检索增强产生(RAG)方法往往无法达到。在Elemental Cognition(EC),我们开发了一个神经符号人工智能平台来解决这些问题。该平台集成了经过微调的LLM用于知识提取和对齐,并与强大的符号推理引擎相结合,用于逻辑推断、规划和交互式约束求解。文章描述了在此类领域中固有的多步推理挑战,批评了现有基于LLM的方法的局限性,并演示了Cora的神经符号方法如何有效地解决了这些问题。文章概述了系统架构、知识提取和形式推理的关键算法,并介绍了初步评估结果,突显了Cora相对于知名的LLM和RAG基准的卓越表现。

[PDF] [Site] [Kimi]

The advent of Large Language Models (LLMs) and Generative AI has revolutionized natural language applications across various domains. However, high-stakes decision-making tasks in fields such as medical, legal and finance require a level of precision, comprehensiveness, and logical consistency that pure LLM or Retrieval-Augmented-Generation (RAG) approaches often fail to deliver. At Elemental Cognition (EC), we have developed a neuro-symbolic AI platform to tackle these problems. The platform integrates fine-tuned LLMs for knowledge extraction and alignment with a robust symbolic reasoning engine for logical inference, planning and interactive constraint solving. We describe Cora, a Collaborative Research Assistant built on this platform, that is designed to perform complex research and discovery tasks in high-stakes domains. This paper discusses the multi-step inference challenges inherent in such domains, critiques the limitations of existing LLM-based methods, and demonstrates how Cora's neuro-symbolic approach effectively addresses these issues. We provide an overview of the system architecture, key algorithms for knowledge extraction and formal reasoning, and present preliminary evaluation results that highlight Cora's superior performance compared to well-known LLM and RAG baselines.

Hello
最后更新于 2024-08-02