Skip to content

Mars web scraping and visualization project using python, beautiful soup and html in order to get the latest news about Mars.

Notifications You must be signed in to change notification settings

sherirosalia/Web_Scraping

Repository files navigation

Web Scraping Mars

Dynamic, real time browser automation and data collection

Latest news about Mars in a user interface.

About

NASA publicises data on a varity of platforms, and this project pulls information with the Python Beautiful Soup library from several different websites.

Data collected from these links: https://mars.nasa.gov/news/ https://www.jpl.nasa.gov/spaceimages/?search=&category=Mars https://www.jpl.nasa.gov/ https://twitter.com/marswxreport?lang=en https://space-facts.com/mars/ https://astrogeology.usgs.gov/search/results?q=hemisphere+enhanced&k1=target&v1=Mars

Screencast

Screencast of web app here: Web Scraping Mars

Deployment

Localhost

Run by inputting "python app.py" in bash or terminal.

This was hosted externally for a while, but proved problematic due to the changes on the sites being scraped. For example, Twitter commonly changed the tweet css class names, and when it did the script broke.

Frameworks

Python Flask Bootstrap 4

File Structure

The directory "mars_webscrape directory" contains scripts and templates

Landing Page

landing page

Info Table

table of results

Large Image

first large image

Mars Image

mars sphere

About

Mars web scraping and visualization project using python, beautiful soup and html in order to get the latest news about Mars.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages