Web_and_HTML_Screen_Scraping

data scraping with BeautifulSoup, data manipulation and visualization

0
0
Python

Web_and_HTML_Screen_Scraping

Data scraping and visualization with Python.

Data source: Top500 SuperComputer Web site at https://www.top500.org/list/2018/06/

View the page source and inspect the HTML table code for the entries.

Note the fields in the table: Rank, System, Cores, RMax, RPeak, Power.

Using Python with BeautifulSoup, extract the data from the Web site and create a csv dataset containing that data.

  • Clean & explore the dataset, producing summary statistics for Cores, RMax, RPeak, and Power.
  • Display and explain the relationship between
    • Cores and RPeak
    • Cores and Power

[Challenge problem]:
Display summary statistics and visualizations for the Country data.