Tucson Rudresh99

Hi there,

I'm Rudresh Joshi 👋

Data Engineer

Python | SQL | PySpark | AWS | DBT | Snowflake | Databricks

I'm a Data Engineer with 5+ years of experience designing and building scalable data pipelines, modern data platforms, and analytics solutions. I enjoy transforming raw data into reliable, business-ready insights by leveraging cloud technologies, distributed computing, and best practices in data engineering.

I actively build real-world projects to strengthen my expertise in ETL development, data warehousing, orchestration, cloud platforms, and big data technologies.

🚀 About Me

💼 Data Engineer with 5+ years of industry experience
🐍 Strong in Python, SQL, and PySpark
☁️ Experienced with AWS, Snowflake, Redshift, and Airflow
📊 Passionate about Data Engineering, Data Warehousing, and Analytics
🌱 Continuously learning modern Data Engineering tools and architectures
🎯 Currently building production-style Data Engineering projects

🛠️ Tech Stack

Programming Languages

Python
SQL

Data Engineering

ETL Development
Data Warehousing
Data Modeling
Apache Spark
PySpark
dbt
Apache Airflow

Cloud & Databases

AWS S3
AWS Glue
Amazon Redshift
Snowflake
SQLite
MySQL

Libraries & Tools

Pandas
BeautifulSoup
SQLAlchemy
Git
GitHub
VS Code

AI Tools

Cursor
github co-pilot
Claude Opus 4.0
Chat GPT

📂 Featured Projects

📦 SQL Data Warehouse

A modern Data Warehouse built using Medallion Architecture with Bronze, Silver, and Gold layers.

Tech Stack: SQL • Data Modeling • ETL • Analytics

📊 SQL Analytics Repository

A comprehensive collection of SQL scripts covering:

Database Exploration
Measures & KPIs
Time-Series Analysis
Window Functions
Segmentation
Cumulative Analytics
Part-to-Whole Analysis

📚 Books ETL Pipeline

An end-to-end ETL pipeline that:

Scrapes book data
Cleans and transforms datasets
Loads data into SQLite & Parquet
Visualizes insights with Streamlit

Tech Stack: Python • BeautifulSoup • Pandas • SQLite • Streamlit

🎯 Currently Learning

Apache Kafka
Docker
Kubernetes
CI/CD for Data Engineering
Apache Iceberg
Delta Lake
Databricks
Advanced AWS Data Services

📈 GitHub Goals

Build production-ready Data Engineering projects
Share reusable SQL solutions
Explore modern Data Stack tools
Contribute to open-source projects
Document best practices for ETL and Analytics Engineering

📫 Connect With Me

💼 LinkedIn: (Add your LinkedIn URL)
📧 Email: rudreshjoshi99@gmail.com
🌐 GitHub: https://github.com/Rudresh99

"Turning raw data into reliable insights through scalable data engineering solutions."

Provide feedback

Saved searches

Use saved searches to filter your results more quickly