Following ICT bachelor programme offerings (studyinfo.fi) using web scraping with Python
Kisseljov, Andrei (2023)
Kisseljov, Andrei
2023
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:amk-202302172516
https://urn.fi/URN:NBN:fi:amk-202302172516
Tiivistelmä
The aim of this thesis was to develop a web scraping tool that can help in
finding and extracting information on ICT bachelor programs from the website studyinfo.fi. The problem addressed is the difficulty of finding information on ICT bachelor programs offered by universities. The solution is a web scraping tool that can be used to collect data on ICT bachelor programs from the website studyinfo.fi. This tool helped in extracting data on the different ICT programs offered in different universities in Finland. The collected data included the university name, study program, teaching mode, and application type. The collected data was stored in an Excel file and was displayed in a tabular format. The user was able to search for a specific program using the program name. In addition, the user was able to filter the search results by the provider, teaching, or the type of the application. The thesis was implemented as a web scraping tool. The tool was developed using the Python programming language with the use of Beautiful Soup4, Selenium, and Pandas as key libraries. A tool was created that collected data specified above and was able to store it in Excel file for further analysis. In the end, tool met all expectations and requirements.
finding and extracting information on ICT bachelor programs from the website studyinfo.fi. The problem addressed is the difficulty of finding information on ICT bachelor programs offered by universities. The solution is a web scraping tool that can be used to collect data on ICT bachelor programs from the website studyinfo.fi. This tool helped in extracting data on the different ICT programs offered in different universities in Finland. The collected data included the university name, study program, teaching mode, and application type. The collected data was stored in an Excel file and was displayed in a tabular format. The user was able to search for a specific program using the program name. In addition, the user was able to filter the search results by the provider, teaching, or the type of the application. The thesis was implemented as a web scraping tool. The tool was developed using the Python programming language with the use of Beautiful Soup4, Selenium, and Pandas as key libraries. A tool was created that collected data specified above and was able to store it in Excel file for further analysis. In the end, tool met all expectations and requirements.