Entity Linking among Categorized Knowledge Resources for Computer Science Curricula
DOI:
https://doi.org/10.52731/lir.v003.152Abstract
In educational information processing, both humans and computers require more effective information access than ever. However, there is one serious issue in text processing in higher education. The quantity of digitized text resources is not sufficient to develop large language models in specialized areas. General purpose language models often create unreliable results when the training data do not cover the target domain. To deal with this problem, our proposed method is designed to establish links between lecture course information and entities for real world knowledge. Owing to this entity linking, the meaning of text data can be more easily understood by humans and computers. Our method employs curriculum standards and university syllabus information to associate important keywords with text entities defined in Wikipedia articles. The results of evaluation experiments suggest the effectiveness of the proposed method in feature selection and entity linking for educational information. The contributions of this study include detailed comparison among dictionaries in Japanese morphological analysis. Our findings are expected to provide useful insights for researchers engaging in educational data analysis.
References
I. Yamada, A. Asai, H. Shindo, H. Takeda, and Y. Matsumoto, “LUKE: Deep Contextualized
Entity Representations with Entity-aware Self-attention,” in Proceedings
of the 2020 Conference on Empirical Methods in Natural Language Processing
(EMNLP). ACM, 2020, pp. 6442–6454.
M. Yasukawa and K. Yamazaki, “Feature Selection by Thematic and Temporal Distinction
in Research Grant Applications,” IIAI Letters on Institutional Research, vol.
, no. LIR019, pp. 1–13, 2022.
G. K. Zipf, Human Behavior and the Principle of Least Effort: An Introduction to
Human Ecology. Addison-Wesley, 1949.
C. D. Manning, P. Raghavan, and H. Sch¨utze, Introduction to Information Retrieval.
Cambridge University Press, 2008.
Information Processing Society of Japan (IPSJ), “Computing Curriculum Standard
J17,” 2018. [Online]. Available: https://www.ipsj.or.jp/annai/committee/education/
j07/curriculum j17.html
T. Kudo, “MeCab: Yet Another Part-of-Speech and Morphological Analyzer.”
[Online]. Available: https://taku910.github.io/mecab/
M. Asahara and Y. Matsumoto, “ipadic version 2.7.0 User’s Manual.” [Online].
Available: https://ja.osdn.net/projects/ipadic/
Center for Language Resource Development, NINJAL, “Electronic Dictionary with
Uniformity and Identity.” [Online]. Available: https://clrd.ninjal.ac.jp/unidic/
S. Toshinori, “Neologism dictionary based on the language resources on
the Web for Mecab,” 2015. [Online]. Available: https://github.com/neologd/
mecab-ipadic-neologd
C. Bishop, Pattern Recognition and Machine Learning. Springer, 2006.
A. Savitzky and M. J. Golay, “Smoothing and differentiation of data by simplified
least squares procedures.” Analytical chemistry, vol. 36, no. 8, pp. 1627–1639, 1964.
S. Aihara, M. Mori, S. Hirokawa, K. Kanekawa, and T. Sugihara, “Text Analysis to
the Preambles of the 4th Medium-term Goals / Plans of National University Corporations,”
IIAI Letters on Institutional Research, vol. 001, no. LIR024, pp. 1–6, 2022.
T. Tsumagari, N. Nakazato, and T. Tsumagari, “Student’s Interests and Career Understanding:
A Topic Analysis of First-year Career Courses,” IIAI Letters on Institutional
Research, vol. 001, no. LIR013, pp. 1–8, 2022.
T. Oishi, “What is the Essential Curriculum for IR in Japan?” IIAI Letters on Institutional
Research, vol. 001, no. LIR009, pp. 1–5, 2022.
Association for Computing Machinery (ACM), “Curricula Recommendations,” 2020.
[Online]. Available: https://www.acm.org/education/curricula-recommendations
C. Peng, F. Xia, M. Naseriparsa, and F. Osborne, “Knowledge Graphs: Opportunities
and Challenges,” Artificial Intelligence Review, pp. 1–32, 2023.
N. Kondo, T. Hatanaka, and T. Matsuda, “Evaluation of Predictive Models in Institutional
Research Based on Multi-Objective Optimization,” IIAI Letters on Institutional
Research, vol. 001, no. LIR018, pp. 1–9, 2022.
S. K. Mohamad and Z. Tasir, “Educational data mining: A review,” Procedia - Social
and Behavioral Sciences, vol. 97, pp. 320–324, 2013.
R. Asif, A. Merceron, S. A. Ali, and N. G. Haider, “Analyzing undergraduate students’
performance using educational data mining,” Computers & Education, vol. 113, pp.
–194, 2017.
E. B. Costa, B. Fonseca, M. A. Santana, F. F. de Ara´ujo, and J. Rego, “Evaluating
the effectiveness of educational data mining techniques for early prediction of students’
academic failure in introductory programming courses,” Computers in Human
Behavior, vol. 73, pp. 247–256, 2017.
C. Angeli, S. K. Howard, J. Ma, J. Yang, and P. A. Kirschner, “Data mining in educational
technology classroom research: Can it make a contribution?” Computers &
Education, vol. 113, pp. 226–242, 2017.
M. Misuraca, G. Scepi, and M. Spano, “Using Opinion Mining as an educational
analytic: An integrated strategy for the analysis of students ’feedback,” Studies in
Educational Evaluation, vol. 68, pp. 100 979.1–100 979.9, 2021.
J. Akoka, I. Comyn-Wattiau, N. Prat, and V. C. Storey, “Knowledge contributions in
design science research: Paths of knowledge types,” Decision Support Systems, vol.
, pp. 113 898.1–113 898.14, 2023.
A. A. Mukherjee, A. Raj, and S. Aggarwal, “Identification of barriers and their mitigation
strategies for industry 5.0 implementation in emerging economies,” International
Journal of Production Economics, vol. 257, pp. 108 770.1–108 770.15, 2023.