Web/HTML Screen Scraping Using Python ONLY.
Web/HTML Screen Scraping using Python:
Go to the Top500 SuperComputer Web site at https://top500.org/lists/2019/06/
View the Web page source and inspect the HTML table code for the entries.
Note the fields in the table: Rank, System, Cores, RMax, RPeak, Power.
a) Using Python with BeautifulSoup, extract the data from the Web site
and create a csv dataset containing data on all 500 systems.
b) Clean & explore the dataset, producing summary statistics
and visualizations for Cores, RMax, RPeak, and Power.
c) Display and explain the relationship between Cores and RPeak,
and Cores and Power. Consider transforming the data to reduce
the range of the data values.