BeautifulSoup

CBC news article comments analysis and visualization

This post is focuses on a project of the analysis of scaped comments from CBC News website articles. (BTW, there are lots of “CBC’s” in the world. The one I am referring to is the Canadian Broadcasting Corporation.) Another post details web scraping method used to get the articles and their comments. As a Canadian […]

CBC news article comments analysis and visualization Read More »

CBC news comment and replies interaction network visualization

The CBC (Canadian Broadcasting Corporation) news website articles often have a comments section. It would be interesting to see the interactions between comments and replies, and to understand which person makes the most comments, and frequently used words and phrases. See the results: https://sitrucp.github.io/cbc_comments/image_grid.html Another post details a method to retrieve the comments. Comments include

CBC news comment and replies interaction network visualization Read More »

CBC news article comments web scraping

This post is focuses on a project where I scaped comments from CBC News website articles. (BTW, there are lots of “CBC’s” in the world. The one I am referring to is the Canadian Broadcasting Corporation.) There is another post that focuses on analysis and visualization of the scraped data. As a Canadian often travelling

CBC news article comments web scraping Read More »

Scraping public health web page using Python, BeautifulSoup4, Requests and Pandas

During the 2020 COVID-19 pandemic in Canada I wanted to get COVID-19 confirmed cases counts data for the city of Montreal. The data I wanted was made freely available by the Quebec Government’s Health Montreal website in a tabular format that was updated regularly.  EDIT: Health Montreal stopped publishing data so the link no longer

Scraping public health web page using Python, BeautifulSoup4, Requests and Pandas Read More »

Scroll to Top