Skip to content
View HarshithNR02's full-sized avatar

Block or report HarshithNR02

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
HarshithNR02/README.md

Hi, I'm Harshith Nerlikere Ramesh πŸ‘‹

πŸŽ“ MS in Data Science, UMass Dartmouth (August 2026) | πŸ”­ Open to Data Scientist / ML Engineer / Data Analyst roles

I build end-to-end machine learning systems β€” from raw data and feature engineering through deployment, serving, and monitoring. I care about models that are honest, reproducible, and defensible, not just high-scoring.


πŸ› οΈ Tech Stack

Languages: Python, SQL
ML / Modeling: LightGBM, XGBoost, scikit-learn, SHAP, Optuna, Isolation Forest
LLM / AI: OpenAI API, LangChain, RAG, FAISS, Text-to-SQL
MLOps: MLflow, Docker, FastAPI, GitHub Actions (CI/CD), Evidently (drift monitoring)
Data: PostgreSQL, Kafka, DuckDB, Parquet, Azure Blob Storage
Apps: Streamlit


πŸ”­ Featured Projects

End-to-end platform on 6M+ real deliveries (Cainiao LaDe, 5 cities). Two models running side by side: a supervised LightGBM disruption classifier and an unsupervised Isolation Forest anomaly detector. Includes a Kafka streaming pipeline, a FastAPI serving layer with a feature store, a hybrid Text-to-SQL + FAISS RAG assistant, a simulated A/B test (two-proportion z-test), Evidently drift monitoring, and a live operational dashboard deployed on AWS EC2.

Key finding: 30 couriers (0.6% of the network) generated 20% of all anomalies β€” turning 66K raw alerts into a 30-courier action list.

LightGBM churn model on 970K users Γ— 110 features (time-based validation, AUC 0.9481). CLV-based segmentation into 5 actionable cohorts, deployed on Hugging Face Spaces with CI/CD.

LightGBM delay classifier on 18.2M US domestic flights with strict temporal validation (AUC 0.8629), SHAP analysis showing real_time_turn_gap as the #1 predictor, cascade/route-risk scoring, and a $2.78B annualized savings estimate. FastAPI + Docker + GitHub Actions CI/CD.

LightGBM global model (WRMSSE 0.4681) across 30,490 item-store series, beating AutoARIMA and 7 other baselines. PostgreSQL business insights and a LangChain + GPT-4o-mini natural language query interface.


🌐 Connect

Popular repositories Loading

  1. DataScinceNotebooks DataScinceNotebooks Public

    Jupyter Notebook

  2. jjj jjj Public

  3. Customer-Satisfaction-Prediction Customer-Satisfaction-Prediction Public

  4. medichat medichat Public

    Python

  5. walmart_M5_Demand_Forecasting walmart_M5_Demand_Forecasting Public

    End-to-end Walmart sales forecasting on the M5 dataset β€” LightGBM global model (WRMSSE 0.4681) across 30,490 item-store series, PostgreSQL business insights, and a LangChain + Streamlit natural lan…

    Jupyter Notebook

  6. FLIGHT_DELAY_INTELLIGENCE_PLATFORM FLIGHT_DELAY_INTELLIGENCE_PLATFORM Public

    Predicts US domestic flight delays on 18.2M flights β€” LightGBM AUC 0.8629, SHAP explainability, cascade propagation, $2.78B cost-benefit analysis, deployed on Hugging Face Spaces.

    Jupyter Notebook