Skip to content
View Rudresh99's full-sized avatar

Block or report Rudresh99

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Rudresh99/README.md

Hi there,

I'm Rudresh Joshi 👋

Data Engineer

Python | SQL | PySpark | AWS | DBT | Snowflake | Databricks

I'm a Data Engineer with 5+ years of experience designing and building scalable data pipelines, modern data platforms, and analytics solutions. I enjoy transforming raw data into reliable, business-ready insights by leveraging cloud technologies, distributed computing, and best practices in data engineering.

I actively build real-world projects to strengthen my expertise in ETL development, data warehousing, orchestration, cloud platforms, and big data technologies.


🚀 About Me

  • 💼 Data Engineer with 5+ years of industry experience
  • 🐍 Strong in Python, SQL, and PySpark
  • ☁️ Experienced with AWS, Snowflake, Redshift, and Airflow
  • 📊 Passionate about Data Engineering, Data Warehousing, and Analytics
  • 🌱 Continuously learning modern Data Engineering tools and architectures
  • 🎯 Currently building production-style Data Engineering projects

🛠️ Tech Stack

Programming Languages

  • Python
  • SQL

Data Engineering

  • ETL Development
  • Data Warehousing
  • Data Modeling
  • Apache Spark
  • PySpark
  • dbt
  • Apache Airflow

Cloud & Databases

  • AWS S3
  • AWS Glue
  • Amazon Redshift
  • Snowflake
  • SQLite
  • MySQL

Libraries & Tools

  • Pandas
  • BeautifulSoup
  • SQLAlchemy
  • Git
  • GitHub
  • VS Code

AI Tools

  • Cursor
  • github co-pilot
  • Claude Opus 4.0
  • Chat GPT

📂 Featured Projects

📦 SQL Data Warehouse

A modern Data Warehouse built using Medallion Architecture with Bronze, Silver, and Gold layers.

Tech Stack: SQL • Data Modeling • ETL • Analytics


📊 SQL Analytics Repository

A comprehensive collection of SQL scripts covering:

  • Database Exploration
  • Measures & KPIs
  • Time-Series Analysis
  • Window Functions
  • Segmentation
  • Cumulative Analytics
  • Part-to-Whole Analysis

📚 Books ETL Pipeline

An end-to-end ETL pipeline that:

  • Scrapes book data
  • Cleans and transforms datasets
  • Loads data into SQLite & Parquet
  • Visualizes insights with Streamlit

Tech Stack: Python • BeautifulSoup • Pandas • SQLite • Streamlit


🎯 Currently Learning

  • Apache Kafka
  • Docker
  • Kubernetes
  • CI/CD for Data Engineering
  • Apache Iceberg
  • Delta Lake
  • Databricks
  • Advanced AWS Data Services

📈 GitHub Goals

  • Build production-ready Data Engineering projects
  • Share reusable SQL solutions
  • Explore modern Data Stack tools
  • Contribute to open-source projects
  • Document best practices for ETL and Analytics Engineering

📫 Connect With Me


"Turning raw data into reliable insights through scalable data engineering solutions."

Pinned Loading

  1. SQL_DataWarehouse_Analytics_Project SQL_DataWarehouse_Analytics_Project Public

    Building a modern data warehouse using Medallion Architecture with MySQL Workbench , including ETL Process, data modelling, and analytics.

    TSQL

  2. ETL_Book_Data_Scrape_Project ETL_Book_Data_Scrape_Project Public

    End-to-end ETL pipeline built with Python, featuring web scraping, data processing, database storage, and an interactive Streamlit dashboard for book data analysis.

    Python