TY - GEN
T1 - Constructing Low-Redundant and High-Accuracy Knowledge Graphs for Education
AU - Li, Qing
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023/5
Y1 - 2023/5
N2 - Motivated by the successful applications of commonsense knowledge graphs (KGs) and encyclopedia KGs, many KG-based applications have been developed in education, such as course content visualization and learning path/material recommendations. While KGs for education are often constructed manually, attempts have been made to leverage machine learning algorithms to extract triples from teaching materials. However, education-related KGs learned by existing algorithms contain significant amounts of redundancy and noise. It is because the entities and relations in teaching materials are often instructional, abstract, and implicit, while textbooks often contain detailed explanations, examples, and illustrations. Off-the-shelf KG construction algorithms are designed for concrete entities. To this end, we propose an effective framework to construct low-redundant and high-accuracy KGs for education. First, we design an ontology that is tailored for education. By choosing related Wikidata items, we construct an instructional entity set. We avoid using traditional methods such as named-entity recognition to extract entities from textbooks, aiming to reduce redundancy. Then, we add subtopic relations among our selected instructional entities based on the corresponding hierarchy in Wikidata, and form a backbone. Second, we design a machine reading comprehension model with pre-defined questions to extract other types of relations, such as equivalent to, applied to, and inventor of. Third, we apply active KG error detection to further refine the KG with minimal human effort. In the experiments, we take the artificial intelligence domain as an example and demonstrate the effectiveness of the proposed framework. Our KG achieves an accuracy of around 80% scored by domain experts.
AB - Motivated by the successful applications of commonsense knowledge graphs (KGs) and encyclopedia KGs, many KG-based applications have been developed in education, such as course content visualization and learning path/material recommendations. While KGs for education are often constructed manually, attempts have been made to leverage machine learning algorithms to extract triples from teaching materials. However, education-related KGs learned by existing algorithms contain significant amounts of redundancy and noise. It is because the entities and relations in teaching materials are often instructional, abstract, and implicit, while textbooks often contain detailed explanations, examples, and illustrations. Off-the-shelf KG construction algorithms are designed for concrete entities. To this end, we propose an effective framework to construct low-redundant and high-accuracy KGs for education. First, we design an ontology that is tailored for education. By choosing related Wikidata items, we construct an instructional entity set. We avoid using traditional methods such as named-entity recognition to extract entities from textbooks, aiming to reduce redundancy. Then, we add subtopic relations among our selected instructional entities based on the corresponding hierarchy in Wikidata, and form a backbone. Second, we design a machine reading comprehension model with pre-defined questions to extract other types of relations, such as equivalent to, applied to, and inventor of. Third, we apply active KG error detection to further refine the KG with minimal human effort. In the experiments, we take the artificial intelligence domain as an example and demonstrate the effectiveness of the proposed framework. Our KG achieves an accuracy of around 80% scored by domain experts.
KW - Educational knowledge graphs
KW - Knowledge graph construction
UR - http://www.scopus.com/inward/record.url?scp=85163398833&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-33023-0_13
DO - 10.1007/978-3-031-33023-0_13
M3 - Conference article published in proceeding or book
SN - 9783031330223
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 148
EP - 160
BT - Learning Technologies and Systems - 21st International Conference on Web-Based Learning, ICWL 2022, and 7th International Symposium on Emerging Technologies for Education, SETE 2022, Revised Selected Papers
A2 - González-González, Carina S.
A2 - Area-Moreira, Manuel
A2 - Fernández-Manjón, Baltasar
A2 - Li, Frederick
A2 - García-Peñalvo, Francisco José
A2 - Sciarrone, Filippo
A2 - Spaniol, Marc
A2 - García-Holgado, Alicia
A2 - Hemmje, Matthias
A2 - Hao, Tianyong
ER -