Skip to content

arjayjean/superhero_power_stat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SuperHeroAPI ETL

🧰 Languages and Tools


I have created an ETL that:

  • Extract: Data is extracted from a SuperHero API by Yoann Cribier
  • Transform: It will be transformed in Python from its JSON format into a form that fits the MySQL database
  • Load: Then loaded into a MySQL database

⚙ ETL Process

Notes:

  • Before I execute the ETL, I need to take more time transforming the data. The data was transformed but I need to replace the redundant data that was loaded. There were entities that were empty values, that needed to be NULL, and names that were repeated in the "alias" and "name" column. This will take time away from analysis that needs to be made. Instead of making those analysis, data cleaning would have to be done before; which is what I have done in MySQL.