Bachelor's Thesis

Classification of Web Pages Using Machine Learning Methods

Thesis Info

Supervisor

doc. RNDr. Ľubomír Antoni, PhD.

Consultant

RNDr. Šimon Horvát

🎯 Objectives

  • 1

    Process an overview of machine learning methods in the field of text analysis.

  • 2

    Propose and extract suitable attributes for web page classification using web scraping.

  • 3

    Implement machine learning methods to classify web pages according to defined categories and compare the achieved results.

📚 Recommended Literature

  • Kazemian, H. B., & Ahmed, S. (2015). Comparisons of machine learning techniques for detecting malicious webpages. Expert Systems with Applications, 42(3), 1166-1177.
  • Chen, H., & Chau, M. (2003). Web Mining: Machine Learning for Web. Annual Review of Information Science and Technology 2004, 38, 289.
  • Raschka, S., & Mirjalili, V. (2019). Python machine learning: Machine learning and deep learning with Python, scikit-learn, and TensorFlow 2. Packt Publishing Ltd.