Section
|
Computer science
|
Title
|
Online web navigation assistant
|
Author(-s)
|
Ali N.M.abc,
Gadallah A.M.a,
Hefny H.A.a,
Novikov B.A.d
|
Affiliations
|
Cairo Universitya,
Port Said Universityb,
Saint Petersburg State Universityc,
Higher School of Economics, Saint Petersburgd
|
Abstract
|
The problem of finding relevant data while searching the internet represents a big challenge for web users due to the enormous amounts of available information on the web. These difficulties are related to the well-known problem of information overload. In this work, we propose an online web assistant called OWNA. We developed a fully integrated framework for making recommendations in real-time based on web usage mining techniques. Our work starts with preparing raw data, then extracting useful information that helps build a knowledge base as well as assigns a specific weight for certain factors. The experiments show the advantages of the proposed model against alternative approaches.
|
Keywords
|
web mining, web personalization, link prediction, web usage mining, recommender systems, web log, web navigation assistant
|
UDC
|
004.048, 004.622, 004.657
|
MSC
|
68T10, 68U35
|
DOI
|
10.35634/vm210109
|
Received
|
8 July 2020
|
Language
|
English
|
Citation
|
Ali N.M., Gadallah A.M., Hefny H.A., Novikov B.A. Online web navigation assistant, Vestnik Udmurtskogo Universiteta. Matematika. Mekhanika. Komp'yuternye Nauki, 2021, vol. 31, issue 1, pp. 116-131.
|
References
|
- Cooley R., Mobasher B., Srivastava J. Web mining: information and pattern discovery on the World Wide Web, Proceedings Ninth IEEE International Conference on Tools with Artificial Intelligence, IEEE, 1997, pp. 558-567. https://doi.org/10.1109/tai.1997.632303
- Resnick P., Varian H.R. Recommender systems, Communications of the ACM, 1997, vol. 40, no. 3, pp. 56-58. https://doi.org/10.1145/245108.245121
- Burke R. Hybrid recommender systems: survey and experiments, User Modeling and User-Adapted Interaction, 2002, vol. 12, no. 4, pp. 331-370. https://doi.org/10.1023/A:1021240730564
- Al-Yazeed N.M.A., Gadallah A.M., Hefny H.A. A hybrid recommendation model for web navigation, 2015 IEEE Seventh International Conference on Intelligent Computing and Information Systems (ICICIS), IEEE, 2015, pp. 552-560. https://doi.org/10.1109/IntelCIS.2015.7397276
- Herlocker J.L., Konstan J.A., Borchers A., Ried J. An algorithmic framework for performing collaborative filtering, Proceedings of the 22nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '99), ACM, 1999, pp. 230-237. https://doi.org/10.1145/312624.312682
- Deshpande M., Karypis G. Item-based top-N recommendation algorithms, ACM Transactions on Information Systems (TOIS), 2004, vol. 22, no. 1, pp. 143-177. https://doi.org/10.1145/963770.963776
- Jafari M., Sabzchi F.S., Irani A.J. Applying web usage mining techniques to design effective web recommendation systems: a case study, Advances in Computer Science: International Journal (ACSIJ), 2014, vol. 3, no. 2, pp. 78-90. http://www.acsij.org/acsij/article/view/216
- Sarria M.D.D., Guzman E.L. A recommendation-based web usage mining model for a university community, 2012 Eighth Latin American Web Congress, IEEE, 2012, pp. 71-78. https://doi.org/10.1109/la-web.2012.23
- Fu X., Budzik J., Hammond K.J. Mining navigation history for recommendation, Proceedings of the 5th International Conference on Intelligent User Interfaces, ACM, 2000, pp. 106-112. https://doi.org/10.1145/325737.325796
- Wu Y.-H., Chen Y.-C., Chen A.L.P. Enabling personalized recommendation on the Web based on user interests and behaviors, Proceedings Eleventh International Workshop on Research Issues in Data Engineering. Document Management for Data Intensive Business and Scientific Applications. RIDE 2001, IEEE, 2001, pp. 17-24. https://doi.org/10.1109/ride.2001.916487
- Singh M.P. The practical handbook of internet computing, New York: Chapman and Hall/CRC, 2004. https://doi.org/10.1201/9780203507223
- Anitha A., Nallaperumal K. A web usage mining based recommendation model for learning management systems, 2010 IEEE International Conference on Computational Intelligence and Computing Research, IEEE, 2010, pp. 1-4. https://doi.org/10.1109/iccic.2010.5705888
- Nigam B., Tokekar S., Jain S. Evaluation of models for predicting user's next request in web usage mining, International Journal on Cybernetics and Informatics (IJCI), 2015, vol. 4, no. 1, pp. 1-13. https://doi.org/10.5121/ijci.2015.4101
- Forsati R., Meybodi M.R. Effective page recommendation algorithms based on distributed learning automata and weighted association rules, Expert Systems with Applications, 2010, vol. 37, no. 2, pp. 1316-1330. https://doi.org/10.1016/j.eswa.2009.06.010
- Mobasher B., Dai H., Luo T., Nakagawa M. Effective personalization based on association rule discovery from web usage data, Proceedings of the 3rd International Workshop on Web Information and Data Management (WIDM '01), ACM, 2001, pp. 9-15. https://doi.org/10.1145/502932.502935
- Lin W., Alvarez S.A., Ruiz C. Collaborative recommendation via adaptive association rule mining, Proceedings of the International Workshop on Web Mining for E-Commerce- Challenges and Opportunities (WebKDD '2000), Citeseer, 2000. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.29.7811
- Langhnoja S.G., Barot M.P., Mehta D.B. Web usage mining using association rule mining on clustered data for pattern discovery, International Journal of Data Mining Techniques and Applications, 2013, vol. 2, no. 1, pp. 141-150. http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.678.4780&rep=rep1&type=pdf
- Lakshminarayan C., Kosuru R., Hsu M. Modeling complex clickstream data by stochastic models: theory and methods, Proceedings of the 25th International Conference Companion on World Wide Web, International World Wide Web Conferences Steering Committee, 2016, pp. 879-884. https://doi.org/10.1145/2872518.2891070
- Vellingiri J., Pandian S.C. A survey on web usage mining, Global Journal of Computer Science and Technology, 2011, vol. 11, no. 4, pp. 66-72. https://computerresearch.org/index.php/computer/article/view/710
- Géry M., Haddad H. Evaluation of web usage mining approaches for user's next request prediction, Proceedings of the 5th ACM International Workshop on Web Information and Data Management (WIDM '03), ACM, 2003, pp. 74-81. https://doi.org/10.1145/956699.956716
- Bhushan R., Nath R. Recommendation of optimized web pages to users using Web Log mining techniques, 2013 3rd IEEE International Advance Computing Conference (IACC), IEEE, 2013, pp. 1030-1033. https://doi.org/10.1109/IAdCC.2013.6514368
- Dhyani D., Bhowmick S.S., Ng W.-K. Modelling and predicting Web page accesses using Markov processes, Proceedings of the 14th International Workshop on Database and Expert Systems Applications, IEEE, 2003, pp. 332-336. https://doi.org/10.1109/dexa.2003.1232044
- Ko H.-G., Kim E., Ko I.-Y., Chang D. Semantically-based recommendation by using semantic clusters of users viewing history, Proceedings of the International Conference on Big Data and Smart Computing (BIGCOMP), 2014, pp. 83-87. https://doi.org/10.1109/bigcomp.2014.6741412
- Chimphlee S., Salim N., Ngadiman M.S., Chimphlee W., Srinoy S. Rough sets clustering and Markov model for web access prediction, Proceedings of the Postgraduate Annual Research Seminar, 2006, pp. 470-475. http://eprints.utm.my/id/eprint/3370/
- Khalil F., Li J., Wang H. A framework of combining Markov model with association rules for predicting web page accesses, Proceedings of the 5th Australasian Conference on Data Mining and Analystics (AusDM '06), Australian Computer Society, 2006, pp. 177-184. https://dl.acm.org/doi/10.5555/1273808.1273832
- Maratea A., Petrosino A. An heuristic approach to page recommendation in web usage mining, Proceedings of the 9th International Conference on Intelligent Systems Design and Applications (ISDA), 2009, pp. 1043-1048. https://doi.org/10.1109/isda.2009.252
- Rao V.V.R.M., Kumari V.V. An efficient hybrid successive Markov model for predicting web user usage behavior using web usage mining, International Journal of Data Engineering (IJDE), 2010, vol. 1, no. 5, pp. 43-62. http://www.cscjournals.org/library/manuscriptinfo.php?mc=IJDE-25
- Anitha A. A new web usage mining approach for next page access prediction, International Journal of Computer Applications (IJCA), 2010, vol. 8, no. 11, pp. 7-10. https://www.ijcaonline.org/archives/volume8/number11/1252-1700
- Jalali M., Mustapha N., Sulaiman M.N., Mamat A. WebPUM: A Web-based recommendation system to predict user future movements, Expert Systems with Applications, 2010, vol. 37, no. 9, pp. 6201-6212. https://doi.org/10.1016/j.eswa.2010.02.105
- AlMurtadha Y., Sulaiman M.N.B., Mustapha N., Udzir N.I. IPACT: Improved web page recommendation system using profile aggregation based on clustering of transactions, American Journal of Applied Sciences, 2011, vol. 8, no. 3, pp. 277-283. https://doi.org/10.3844/ajassp.2011.277.283
- Mishra R., Kumar P., Bhasker B. A web recommendation system considering sequential information, Decision Support Systems, 2015, vol. 75, no. 1, pp. 1-10. https://doi.org/10.1016/j.dss.2015.04.004
- Ali N.M., Gadallah A.M., Hefny H.A., Novikov B. An integrated framework for web data preprocessing towards modeling user behavior, Proceedings of the 2020 International Multi-Conference on Industrial Engineering and Modern Technologies (FarEastCon), IEEE, 2020, pp. 1-8. https://doi.org/10.1109/FarEastCon50210.2020.9271467
- Iliou C., Kostoulas T., Tsikrika T., Katos V., Vrochidis S., Kompatsiaris Y. Towards a framework for detecting advanced web bots, Proceedings of the 14th International Conference on Availability, Reliability and Security, ACM, 2019, no. 18. https://doi.org/10.1145/3339252.3339267
- Patel P., Parmar M. Improve heuristics for user session identification through web server log in web usage mining, International Journal of Computer Science and Information Technologies, 2014, vol. 5, no. 3, pp. 3562-3565. http://ijcsit.com/docs/Volume%205/vol5issue03/ijcsit20140503201.pdf
- Ganibardi A., Ali C.A. Weblog data structuration: A stream-centric approach for improving session reconstruction quality, Proceedings of the 20th International Conference on Information Integration and Web-based Applications and Services, ACM, 2018, pp. 263-271. https://doi.org/10.1145/3282373.3282379
- Leoni M.D., Dündar S. Event-log abstraction using batch session identification and clustering, Proceedings of the 35th Annual ACM Symposium on Applied Computing, ACM, 2020, pp. 36-44. https://doi.org/10.1145/3341105.3373861
- Chitraa V., Thanamani A.S. A novel technique for sessions identification in web usage mining preprocessing, International Journal of Computer Applications, 2011, vol. 34, no. 9, pp. 23-27. https://www.ijcaonline.org/archives/volume34/number9/4127-5958
- Markov Z., Larose D.T. Data mining the web: uncovering patterns in web content, structure, and usage, John Wiley and Sons, 2007. https://doi.org/10.1002/0470108096
- Microsoft Docs: The Modern Documentation Service for Microsoft, IIS Logging, 2018, Available: https://docs.microsoft.com/en-us/windows/win32/http/iis-logging, last access (01 Feb., 2020).
|
Full text
|
|