Entity Linking among Categorized Knowledge Resources for Computer Science Curricula

Authors

  • Michiko Yasukawa Gunma University
  • Koichi Yamazaki Tokyo Denki University

DOI:

https://doi.org/10.52731/lir.v003.152

Abstract

In educational information processing, both humans and computers require more effective information access than ever. However, there is one serious issue in text processing in higher education. The quantity of digitized text resources is not sufficient to develop large language models in specialized areas. General purpose language models often create unreliable results when the training data do not cover the target domain. To deal with this problem, our proposed method is designed to establish links between lecture course information and entities for real world knowledge. Owing to this entity linking, the meaning of text data can be more easily understood by humans and computers. Our method employs curriculum standards and university syllabus information to associate important keywords with text entities defined in Wikipedia articles. The results of evaluation experiments suggest the effectiveness of the proposed method in feature selection and entity linking for educational information. The contributions of this study include detailed comparison among dictionaries in Japanese morphological analysis. Our findings are expected to provide useful insights for researchers engaging in educational data analysis.

References

I. Yamada, A. Asai, H. Shindo, H. Takeda, and Y. Matsumoto, “LUKE: Deep Contextualized

Entity Representations with Entity-aware Self-attention,” in Proceedings

of the 2020 Conference on Empirical Methods in Natural Language Processing

(EMNLP). ACM, 2020, pp. 6442–6454.

M. Yasukawa and K. Yamazaki, “Feature Selection by Thematic and Temporal Distinction

in Research Grant Applications,” IIAI Letters on Institutional Research, vol.

, no. LIR019, pp. 1–13, 2022.

G. K. Zipf, Human Behavior and the Principle of Least Effort: An Introduction to

Human Ecology. Addison-Wesley, 1949.

C. D. Manning, P. Raghavan, and H. Sch¨utze, Introduction to Information Retrieval.

Cambridge University Press, 2008.

Information Processing Society of Japan (IPSJ), “Computing Curriculum Standard

J17,” 2018. [Online]. Available: https://www.ipsj.or.jp/annai/committee/education/

j07/curriculum j17.html

T. Kudo, “MeCab: Yet Another Part-of-Speech and Morphological Analyzer.”

[Online]. Available: https://taku910.github.io/mecab/

M. Asahara and Y. Matsumoto, “ipadic version 2.7.0 User’s Manual.” [Online].

Available: https://ja.osdn.net/projects/ipadic/

Center for Language Resource Development, NINJAL, “Electronic Dictionary with

Uniformity and Identity.” [Online]. Available: https://clrd.ninjal.ac.jp/unidic/

S. Toshinori, “Neologism dictionary based on the language resources on

the Web for Mecab,” 2015. [Online]. Available: https://github.com/neologd/

mecab-ipadic-neologd

C. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.

A. Savitzky and M. J. Golay, “Smoothing and differentiation of data by simplified

least squares procedures.” Analytical chemistry, vol. 36, no. 8, pp. 1627–1639, 1964.

S. Aihara, M. Mori, S. Hirokawa, K. Kanekawa, and T. Sugihara, “Text Analysis to

the Preambles of the 4th Medium-term Goals / Plans of National University Corporations,”

IIAI Letters on Institutional Research, vol. 001, no. LIR024, pp. 1–6, 2022.

T. Tsumagari, N. Nakazato, and T. Tsumagari, “Student’s Interests and Career Understanding:

A Topic Analysis of First-year Career Courses,” IIAI Letters on Institutional

Research, vol. 001, no. LIR013, pp. 1–8, 2022.

T. Oishi, “What is the Essential Curriculum for IR in Japan?” IIAI Letters on Institutional

Research, vol. 001, no. LIR009, pp. 1–5, 2022.

Association for Computing Machinery (ACM), “Curricula Recommendations,” 2020.

[Online]. Available: https://www.acm.org/education/curricula-recommendations

C. Peng, F. Xia, M. Naseriparsa, and F. Osborne, “Knowledge Graphs: Opportunities

and Challenges,” Artificial Intelligence Review, pp. 1–32, 2023.

N. Kondo, T. Hatanaka, and T. Matsuda, “Evaluation of Predictive Models in Institutional

Research Based on Multi-Objective Optimization,” IIAI Letters on Institutional

Research, vol. 001, no. LIR018, pp. 1–9, 2022.

S. K. Mohamad and Z. Tasir, “Educational data mining: A review,” Procedia - Social

and Behavioral Sciences, vol. 97, pp. 320–324, 2013.

R. Asif, A. Merceron, S. A. Ali, and N. G. Haider, “Analyzing undergraduate students’

performance using educational data mining,” Computers & Education, vol. 113, pp.

–194, 2017.

E. B. Costa, B. Fonseca, M. A. Santana, F. F. de Ara´ujo, and J. Rego, “Evaluating

the effectiveness of educational data mining techniques for early prediction of students’

academic failure in introductory programming courses,” Computers in Human

Behavior, vol. 73, pp. 247–256, 2017.

C. Angeli, S. K. Howard, J. Ma, J. Yang, and P. A. Kirschner, “Data mining in educational

technology classroom research: Can it make a contribution?” Computers &

Education, vol. 113, pp. 226–242, 2017.

M. Misuraca, G. Scepi, and M. Spano, “Using Opinion Mining as an educational

analytic: An integrated strategy for the analysis of students ’feedback,” Studies in

Educational Evaluation, vol. 68, pp. 100 979.1–100 979.9, 2021.

J. Akoka, I. Comyn-Wattiau, N. Prat, and V. C. Storey, “Knowledge contributions in

design science research: Paths of knowledge types,” Decision Support Systems, vol.

, pp. 113 898.1–113 898.14, 2023.

A. A. Mukherjee, A. Raj, and S. Aggarwal, “Identification of barriers and their mitigation

strategies for industry 5.0 implementation in emerging economies,” International

Journal of Production Economics, vol. 257, pp. 108 770.1–108 770.15, 2023.

Downloads

Published

2023-08-30