网络和人工智能技术,特别是语义网和知识图谱,最近在教育领域引起了极大关注。然而,从知识和数据的角度来看,用于K-12 教育的特定学科知识图谱仍然缺乏充分性和可持续性。为了解决这些问题,我们提出了一个异构的可持续K-12教育知识图(EDUKG)。我们首先设计了一个跨学科的细粒度本体,用于统一建模K-12 教育中的知识和资源,共定义了635 个类、445 个对象属性和1314 个数据类型属性。在本体的指导下,我们提出了一种灵活的方法,用于交互式地从教科书中提取事实知识。此外,我们还基于我们提出的通用实体链接系统建立了一套通用机制,用于EDUKG 的可持续维护,该机制可动态索引EDUKG 中具有知识主题的众多异构资源和数据。我们进一步评估了EDUKG,以说明其充分性、丰富性和可变性。我们发布的EDUKG 包含超过2.52 亿个实体和38.6 亿个三元组。
Web and artificial intelligence technologies, especially semantic web and knowledge graph (KG), have recently raised significant attention in educational scenarios. Nevertheless, subject-specific KGs for K-12 education still lack sufficiency and sustainability from knowledge and data perspectives. To tackle these issues, we propose EDUKG, a heterogeneous sustainable K-12 Educational Knowledge Graph. We first design an interdisciplinary and fine-grained ontology for uniformly modeling knowledge and resource in K-12 education, where we define 635 classes, 445 object properties, and 1314 datatype properties in total. Guided by this ontology, we propose a flexible methodology for interactively extracting factual knowledge from textbooks. Furthermore, we establish a general mechanism based on our proposed generalized entity linking system for EDUKG’s sustainable maintenance, which can dynamically index numerous heterogeneous resources and data with knowledge topics in EDUKG. We further evaluate EDUKG to illustrate its sufficiency, richness, and variability. We publish EDUKG with more than 252 million entities and 3.86 billion triplets.