Proposed Analytical Process for More Convenient Utilization of Open Data

Verification Using Tourist Number Data

Authors

  • Soh SAKURAI Chiba University of Commerce
  • Noriko Shiabata Yokohama City University
  • Akira Nagamatsu Graduate School of Engineering, Tohoku University

DOI:

https://doi.org/10.52731/lir.v004.281

Keywords:

open data, tourism, regression analysis, principal component analysis

Abstract

This study proposes a statistical analysis process for the effective and convenient utilization of open data. Considering the current state of open data use in Japan and other countries, it focuses on the challenge where maximizing data utility is heavily dependent on user skills. The process is validated using easily accessible prefectural tourist data, which is rich in academic research. By applying principal component analysis and regression analysis, the study defines a specific model and proposes a process aimed at enabling more practical and straightforward applications of open data.

References

WHITE HOUSE, “Promoting Transparency in Government,” 8 Dec. 2009; https:// obamawhitehouse.archives.gov/blog/2009/12/08/promoting-transparency-government.

S. Miyagawa, M. Endo, M. Urata, and T. Yasuda, “Development of data utilization methods for objective analysis in municipal operations: Effectiveness and visualization through in-formation linkage between local events and operations,” Studies in Science and Technology, vol. 12, no. 2, 2023, pp. 131-136.

C. Batini, A. Rula, M. Scannapieco, and G. Viscusi, “From data quality to big data quality, ” Journal of Database Management, vol. 26, no. 1, 2015, pp. 60-82.

S. Sadiq, and M. Indulska, "Open data: Quality over quantity,” International Journal of In-formation Management, vol. 37, no. 3, 2017, pp. 150-154.

R.Y. Wang and D.M. Strong, “Beyond accuracy: What data quality means to data consum-ers,” Journal of Management Information Systems, vol. 12, no. 4, 1996, pp. 533.

R.Y. Wang, “A product perspective on total data quality management,” Communications of the ACM, vol. 41, no. 2, 1998, pp. 58-65.

T. Koltay, “Quality of Open Research Data: Values, Convergences and Governance,” In-formation, vol. 11, no. 175, 2020; DOI:10.3390/info11040175.

M. Archie, S. Gershon, A. Katcoff, and A. Zeng, “Who’s Watching?,” MIT Open-CourseWare, 2018, Spring; https://courses.csail.mit.edu/6.857/2018/project/Archie-Gershon- Katchoff-Zeng-Netflix.pdf.

US-CERT, “Protecting Aggregated Data,” 2005; https://www.cisa.gov/sites/default/files/ publications/Data-Agg-120605.pdf

J. Koo, G.G. Kang, and Y.G. Kim, “Security and Privacy in Big Data Life Cycle: A Survey and Open Challenges,” Sustainability, vol. 12, no. 24, 2020, 10571; DOI: 10.3390/ su122410571.

A. Burdick, J. Drucker, P. Lunenfeld, T. Presner, and J. Schnapp, Digital Humanities, The MIT Press, 2012.

Digital Agency, “About the Municipal Standard Open Data Set (Former Standard Data Set),” March 31, 2023. https://www.digital.go.jp/resources/open_data/municipal-standard- data-set-test

German National Tourist Board, “Open Data Destination Germany,” 17 Feb. 2020; https://open-data-germany.org/en/how-the-largest-digital-infrastructure-project-in-tourism-is-taking-shape/.

European Union, “Open Data in tourism,” 24 Oct. 2018; https://data.europa.eu/en/ publications/datastories/open-data-tourism.

García-Madurga, Miguel-Ángel, and Ana-Julia Grilló-Méndez. "Artificial Intelligence in the tourism industry: An overview of reviews." Administrative Sciences 13, no. 8, 172, 2023; doi: 10.3390/admsci13080172.

S.I. Stewart and C.A. Vogt, “Multi-destination trip patterns,” Annals of Tourism Research, vol. 24, no. 2, 1997, pp.458-461; DOI:10.1016/S0160-7383(97)80017-5.

C. Tideswell and B. Faulkner, “Multidestination Travel Patterns of International Visitors to Queensland,” Journal of Travel Research, vol. 37, no. 4, 1999, pp. 364-374; DOI:10.1177/004728759903700406.

C.C. Lue, J.L. Crompton, and D.R. Fesenmaier, “Conceptualization of multi-destination pleasure trips,” Annals of Tourism Research, vol. 20, no. 2, 1993, pp.289-301; DOI: 10.1016/0160-7383.

T.H. Wonnacott and R.J. Wonnacott, Regression, John Wiley & Sons, Inc., 1981, Trans-lated by Y. Tabata and T. Ohta as "Kaikibunseki to sono ouyou" (Regression Analysis and Its Applications), Gendai Sugaku-sha, Japan, 1998.

Honda, Masami, "Declaration to be the World's Most Advanced IT Nation and E-government Policy," IPSJ-SIG Security Psychology and Trust, vol. 2015, no. 10, 2015, pp. 1-7. [in Japanese].

Ministry of Land, Infrastructure, Transport and Tourism, Japan Tourism Agency, "Common Standards for Tourism Statistics Survey Guidelines, Revised March 2013," Kanko Iri-komikyaku Tokei ni kansuru Kyotsu Kijun Chosa Yoryo, Heisei 25-nen 3-gatsu kaitei [in Japanese], 2013; https://www.mlit.go.jp/kankocho/content/810003353.pdf

Malhotra, Naresh K. Marketing Research: An Applied Orientation. 4th ed., Prentice Hall, 2004. Translated by Kazuo Kobayashi, supervised by the Japan Marketing Research Asso-ciation. Japanese title: "Marketing Risachi no Riron to Jissai – Riron Hen." Doyukan, 2006.

Weisberg, Sanford. Applied Linear Regression. 4th ed., John Wiley & Sons, Inc., 2014. Japanese edition supervised by Etsuo Miyaoka, translated as "Deta Kaiseki no tame no Senkei Kaiki [Gencho Daiyonhan]", Kyoritsu Shuppan, 2024.

A.E. Hoerl and R.W. Kennard, “Ridge Regression: Biased Estimation for Nonorthogonal Problems,” Technometrics, vol. 42, no. 1, 2000, pp.80-86; DOI:/10.2307/1271436.

R. Tibshirani, “Regression Shrinkage and Selection via the Lasso,” Journal of the Royal Statistical Society. Series B (Methodological), vol. 58, no. 1 1996, pp. 267-288; https://www.jstor.org/stable/2346178

H. Wold, “Soft Modelling by Latent Variables: The Non-Linear Iterative Partial Least Squares (NIPALS) Approach”, Journal of Applied Probability, vol. 12, Issue S1, 1975, pp. 117-142.

Downloads

Published

2024-09-15