Skip to content
View BobbyAxelrods's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report BobbyAxelrods

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
BobbyAxelrods/README.md

Hi there, I'm Shafiq 👋

I'm a Data engineer | Builder & Designer at core💻

Connect with me on linkedin : Yu Shi | LinkedIn

With almost 4 of experience as a data engineer, I have honed my skills in designing, constructing, and managing robust data pipelines and infrastructure. My expertise caters to a wide range of analytical and business intelligence requirements. Proficient in various technologies and tools, including SQL, Python, ETL frameworks, and big data technologies like Hadoop and Spark, I ensure efficient and effective data processing.

To showcase my proficiency in data engineering, my GitHub account houses a diverse collection of projects and code samples. These projects not only highlight my ability to extract, transform, and load data from multiple sources but also exemplify my expertise in data modeling, visualization, and reporting.

Committed to continuous learning and personal growth, I am eager to collaborate with other professionals in the field. I warmly invite you to connect with me on GitHub, where we can explore my work, exchange ideas, and share knowledge.

Let's connect and make meaningful contributions together!


���� I'm currently working on

  • My old projects

🌱 I'm currently learning

  • Machine Learning
  • Data Engineering
  • Kubernetes

💼 Technical Skills



Tools Coverages

Nifi | Airflow | Postgres | SQL | Python | Google Cloud : Bigquery & Instances | AWS : S3, Redshift & EC2 | Hadoop | Streamlit | Pandas | Kafka | Spark | Shell | Terraform | Docker | Selenium Scrapper | dbt | | No SQL : Mongo Db | Data Modelling | Data Integration | Data Governance : Open MetaData | Zookeeper | API | Git source control | Data Warehouse | Data Lake

Popular repositories Loading

  1. 0103_Streaming_Scheduler_Template 0103_Streaming_Scheduler_Template Public

    Creating a template for airflow code block & structure to expedite process of building scheduler

    Python 2

  2. spark-ETL-component-library spark-ETL-component-library Public

    Forked from claimed-framework/component-library

    The goal of CLAIMED is to enable low-code/no-code rapid prototyping style programming to seamlessly CI/CD into production.

    Jupyter Notebook 1

  3. xlsx_to_json xlsx_to_json Public

    Simple apps to convert xlsx files to json with option nrows for speeds , dtypes as string to avoid missing leading 000 when conversion

    Python 1

  4. streamlit-converter-deploy-instances streamlit-converter-deploy-instances Public

    Deploy live streamlit for analyst to utilize specific to maintain leading 000 and control read how many rows.

    Python 1

  5. analyst-ingestor-postgres analyst-ingestor-postgres Public

    A tools for analyst to ingest data dedicated for vizualization , they dont have any access to other schema which avoid the danger for deleting other important data) . They dont have to access to db…

    1

  6. analyst-tools-v2 analyst-tools-v2 Public

    Python 1