Data science: opportunities to transform education

Authors

DOI:

https://doi.org/10.55056/cte.368

Keywords:

data science, high education, clustering, natural language processing, text mining

Abstract

The article concerns the issue of data science tools implementation, including the text mining and natural language processing algorithms for increasing the value of high education for development modern and technologically flexible society. Data science is the field of study that involves tools, algorithms, and knowledge of math and statistics to discover knowledge from the raw data. Data science is developing fast and penetrating all spheres of life. More people understand the importance of the science of data and the need for implementation in everyday life. Data science is used in business for business analytics and production, in sales for offerings and, for sales forecasting, in marketing for customizing customers, and recommendations on purchasing, digital marketing, in banking and insurance for risk assessment, fraud detection, scoring, and in medicine for disease forecasting, process automation and patient health monitoring, in tourism in the field of price analysis, flight safety, opinion mining etc. However, data science applications in education have been relatively limited, and many opportunities for advancing the fields still unexplored.

Downloads

Download data is not yet available.
Abstract views: 460 / PDF views: 168

References

Blei, D.M. Probabilistic Topic Models. Communications of the ACM 55(4), 77–84 (2012). doi:10.1145/2133806.2133826 DOI: https://doi.org/10.1145/2133806.2133826

Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet Allocation. Journal of Machine Learning Research 3, 993–1022. http://jmlr.org/papers/v3/blei03a.html (2003)

Brunner, R.J., Kim, E.J. Teaching data science. Procedia Computer Science 80, 1947–1956 (2016). doi:10.1016/j.procs.2016.05 DOI: https://doi.org/10.1016/j.procs.2016.05.513

Chen, H., Chiang, R.H.L., Storey, V.C.: Business intelligence and analytics: from big data to big impact. MIS quarterly: Management Information Systems 36(4), 1165–1188 (2012) DOI: https://doi.org/10.2307/41703503

Deerwester, S., Dumais, S.T., Furnas, G.W., Landauer, T.K., Harshman, R.: Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science 41(6), 391–407 (1990). doi:10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9 DOI: https://doi.org/10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO;2-9

Deerwester, S.C., Dumais, S.T., Furnas, G.W., Harshman, R.A., Landauer, T.K., Lochbaum, K.E., Streeter, L.A.: Computer information retrieval using latent semantic structure. US Patent 4,839,853, 13 June 1989

Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society. Series B (Methodological) 39(1), 1–38 (1977) DOI: https://doi.org/10.1111/j.2517-6161.1977.tb01600.x

George, G., Osinga, E.C., Lavie, D., Scott, B.A.: Big data and data science methods for management research: From the Editors. Academy of Management Journal 59(5), 1493–1507 (2016). doi:10.5465/amj.2016.4005 DOI: https://doi.org/10.5465/amj.2016.4005

Hnot, T.V., Nehrey, M.V.: Alhorytmy Data Science u modeliuvanni biznes-protsesiv (Data Science Algorithms in Modeling Business Processes). Ekonomika i suspilstvo 12, 743–751 (2017)

James, G., Witten, D., Hastie, T., Tibshirani, R.: An Introduction to Statistical Learning: with Application in R. Springer, New York (2013) DOI: https://doi.org/10.1007/978-1-4614-7138-7

Kiv, A., Semerikov, S., Soloviev, V., Kibalnyk, L., Danylchuk, H., Matviychuk, A.: Experimental Economics and Machine Learning for Prediction of Emergent Economy Dynamics. In: Kiv, A., Semerikov, S., Soloviev, V., Kibalnyk, L., Danylchuk, H., Matviychuk, A. (eds.) Experimental Economics and Machine Learning for Prediction of Emergent Economy Dynamics, Proceedings of the Selected Papers of the 8th International Conference on Monitoring, Modeling & Management of Emergent Economy (M3E2 2019), Odessa, Ukraine, May 22-24, 2019. CEUR Workshop Proceedings 2422, 1–4. http://ceur-ws.org/Vol-2422/paper00.pdf (2019). Accessed 1 Aug 2019

Kucherov, D.P.: The Synthesis of Adaptive Terminal Control Algorithm for Inertial Secondary Order System with Bounded Noises. Journal of Automation and Information Sciences 39(9), 16–25 (2007). doi:10.1615/JAutomatInfScien.v39.i9.20 DOI: https://doi.org/10.1615/JAutomatInfScien.v39.i9.20

Nehrey, M., Hnot, T.: Data Science Tools Application for Business Processes Modelling in Aviation. In: Shmelova, T., Sikirda, Yu., Rizun, N., Kucherov, D. (eds.) Cases on Modern Computer Systems in Aviation, 176-190. IGI Global, Hershey (2019). doi:10.4018/978-1-5225-7588-7.ch006 DOI: https://doi.org/10.4018/978-1-5225-7588-7.ch006

Nehrey, M., Hnot, T.: Using recommendation approaches for ratings matrixes in online marketing. Studia Ekonomiczne 342, 115–130 (2017)

Nokel, M.A., Loukachevitch, N.V.: Tematicheskie modeli: dobavlenie bigramm i uchet skhodstva mezhdu unigrammami i bigrammami (Topic models: adding bigrams and taking account of the similarity between unigrams and bigrams). Computational methods and programming 16(2), 215–234 (2015). doi:10.26089/NumMet.v16r222 DOI: https://doi.org/10.26089/NumMet.v16r222

Parish, E.J., Duraisamy, K.: A paradigm for data-driven predictive modeling using field inversion and machine learning. Journal of Computational Physics 305, 758–774 (2016). doi:10.1016/j.jcp.2015.11.012 DOI: https://doi.org/10.1016/j.jcp.2015.11.012

Patriarca, R., Di Gravio, G., Costantino, F.: A Monte Carlo evolution of the Functional Resonance Analysis Method (FRAM) to assess performance variability in complex systems. Safety science 91, 49-60 (2017). doi:10.1016/j.ssci.2016.07.016 DOI: https://doi.org/10.1016/j.ssci.2016.07.016

Périaux, J., Chen, H.Q., Mantel, B., Sefrioui, M., Sui, H.T.: Combining game theory and genetic algorithms with application to DDM-nozzle optimization problems. Finite elements in analysis and design 37(5), 417–429 (2001). doi:10.1016/S0168-874X(00)00055-X DOI: https://doi.org/10.1016/S0168-874X(00)00055-X

Rizun, N., Shmelova, T.: Decision-Making Models of the Human-Operator as an Element of the Socio-Technical Systems. In: Batko, R., Szopa, A. (eds.) Strategic Imperatives and Core Competencies in the Era of Robotics and Artificial Intelligence, pp. 167–204. IGI Global, Hershey (2017). doi:10.4018/978-1-5225-1656-9.ch009 DOI: https://doi.org/10.4018/978-1-5225-1656-9.ch009

Salton, G., Wong, A., Yang, C.S.: A Vector Space Model for Automatic Indexing. Communications of the ACM 18(11), 613–620 (1975) DOI: https://doi.org/10.1145/361219.361220

Semerikov, S.O., Teplytskyi, I.O., Yechkalo, Yu.V., Kiv, A.E.: Computer Simulation of Neural Networks Using Spreadsheets: The Dawn of the Age of Camelot. In: Kiv, A.E., Soloviev, V.N. (eds.) Proceedings of the 1st International Workshop on Augmented Reality in Education (AREdu 2018), Kryvyi Rih, Ukraine, October 2, 2018. CEUR Workshop Proceedings 2257, 122–147. http://ceur-ws.org/Vol-2257/paper14.pdf (2018). Accessed 30 Nov 2018 DOI: https://doi.org/10.31812/123456789/2648

Semerikov, S.O., Teplytskyi, I.O., Yechkalo, Yu.V., Markova, O.M., Soloviev, V.N., Kiv, A.E.: Computer Simulation of Neural Networks Using Spreadsheets: Dr. Anderson, Welcome Back. In: Ermolayev, V., Mallet, F., Yakovyna, V., Kharchenko, V., Kobets, V., Korniłowicz, A., Kravtsov, H., Nikitchenko, M., Semerikov, S., Spivakovsky, A. (eds.) Proceedings of the 15th International Conference on ICT in Education, Research and Industrial Applications. Integration, Harmonization and Knowledge Transfer (ICTERI, 2019), Kherson, Ukraine, June 12-15 2019, vol. II: Workshops. CEUR Workshop Proceedings 2393, 833–848. http://ceur-ws.org/Vol-2393/paper_348.pdf (2019). Accessed 30 Jun 2019

Shoro, A.G., Soomro, T.R.: Big Data Analysis: Apache Spark Perspective. Global Journal of Computer Science and Technology 15(1-C). https://computerresearch.org/index.php/computer/article/view/1137 (2015)

Vorontsov, K.V., Potapenko, A.A.: Modifikatcii EM-algoritma dlia veroiatnostnogo tematicheskogo modelirovaniia (EM-like algorithms for probabilistic topic modeling). Machine Learning and Data Analysis 1(6), 657–686 (2013)

Xiong, J., Yu, G., Zhang, X.: Research on Governance Structure of Big Data of Civil Aviation. Journal of Computer and Communications 5(5), 112–118 (2017). doi:10.4236/jcc.2017.55009 DOI: https://doi.org/10.4236/jcc.2017.55009

Downloads

Published

2019-03-21

Issue

Section

Digital transformation of learning

How to Cite

Volkova, N.P., Rizun, N.O. and Nehrey, M.V., 2019. Data science: opportunities to transform education. CTE Workshop Proceedings [Online], 6, pp.48–73. Available from: https://doi.org/10.55056/cte.368 [Accessed 28 April 2025].
Received 2018-08-04
Accepted 2018-12-21
Published 2019-03-21

Most read articles by the same author(s)