Build software better, together

imakash45 / rossmann-sales-forecasting

End-to-end Data Science project — Sales Analytics + LightGBM forecasting model + Interactive Plotly Dash dashboard for Rossmann Store Sales.

python data machine-learning dashboard eda kaggle data-analysis sales-forecasting data-science-projects lightgbm-models retail-analytics feature-engineering-ml plotly-dashboard

Updated May 21, 2026
Jupyter Notebook

Hamdaan-P / ML-Repo

Star

Comprehensive Machine Learning Portfolio: Real-world data science, classification, regression, and business analytics in Python

python machine-learning statistics scikit-learn regression eda kaggle classification data-analysis retail feature-engineering predictive-modeling business-analytics feature-engineering-ml

Updated Jun 4, 2026
Jupyter Notebook

myselfsukhendu09 / Dry-Bean-Type-Classification

Star

Automated classification of 7 different types of dry beans using machine learning techniques. This project leverages computer vision-extracted geometric and shape features (such as Area, Perimeter, and Shape Factors) to accurately identify bean varieties including Barbunya, Bombay, Cali, Dermason, Horoz, Seker, and Sira.

python scikit-learn classification-algorithm machine-learning-projects data-science-projects multiclass-classification-models exploratory-data-analysis-eda artificial-intelligence-and-machine-learning dry-bean-dataset feature-engineering-ml

Updated Jan 22, 2026
Jupyter Notebook

Akshay8087 / ScoutIQ-Football-Intelligence-Match-Prediction-Platform

Star

ScoutIQ is a football intelligence and match prediction platform that uses FIFA-style data to deliver scouting insights, team comparisons, EDA, feature engineering, ML model benchmarking, explainability, and Flask-based win probability predictions.

python flask data-science machine-learning eda xgboost lightgbm sports-analytics football-analytics match-predictions feature-engineering-ml

Updated Jun 10, 2026
Jupyter Notebook

voyager2005 / sensera-poc

Star

Open-source proof-of-concept repository for Sensera, developed as Founding ML Engineer, implementing an end-to-end pipeline from data ingestion and epoch-based feature extraction to anomaly detection, risk scoring, and explainable analysis. Built using scikit-learn and tensorflow

python machine-learning research deep-learning tensorflow scikit-learn poc predictive-modeling anomaly-detection probabilistic-modeling explainable-ai r-and-d sequence-modeling feature-engineering-ml

Updated Apr 10, 2026
Jupyter Notebook

Sparkydev007 / Credit-Risk-Prediction-System

Star

Credit Risk Prediction System is an end-to-end machine learning project that predicts loan default risk using customer financial data. It applies EDA, feature encoding, and advanced models like Random Forest and XGBoost, and is deployed via Streamlit for real-time credit risk assessment.

random-forest xgboost binary-classification feature-engineering-ml

Updated Feb 8, 2026
Jupyter Notebook

C-Saha958 / Customer-Opex-Optimization-Analysis

Star

Retail margin restoration project: Identified and eliminated 1.42% margin erosion by fixing flawed discount and retention Opex allocation.

Updated May 8, 2026
HTML

BrentOchieng / british-airways-booking-prediction-ml

Star

Predicting customer booking completion for British Airways using Random Forest & XGBoost. Built as part of the Forage Data Science Virtual Job Simulation. Covers EDA, feature engineering, class imbalance handling, threshold tuning, and ROC-AUC model comparison.

classification-algorithm random-forest-classifier machine-learning-projects sklearn-library data-science-projects xgboost-classifier imbalanced-data-handling feature-engineering-ml forage-simulation british-airways-forage

Updated Jun 4, 2026
Jupyter Notebook

TejashwiniSaravanan / Healthcare-Analytics-PySpark-ML-GCP-Strategy

Star

Two-part project combining a PySpark MLlib pipeline (83.12% accuracy) with a GCP cloud architecture proposal for real-time patient monitoring. Covers feature engineering, Random Forest classification, and HIPAA-compliant healthcare infrastructure using BigQuery, Vertex AI, and Cloud Healthcare API.

python bigquery machine-learning apache-spark random-forest pyspark predictive-modeling hipaa google-cloud-platform healthcare-analytics vertex-ai feature-engineering-ml

Updated May 6, 2026
Jupyter Notebook

mwasifanwar / automl_framework

Star

Comprehensive AutoML framework that automates data preprocessing, feature engineering, model selection, hyperparameter tuning, and deployment. Features neural architecture search and automated data cleaning pipelines.

python data-science machine-learning scikit-learn machine-learning-algorithms hyperparameter-optimization feature-engineering automl machine-learning-models mlops data-science-projects scikit-learn-python automl-algorithms feature-engineering-algorithm mlops-workflow feature-engineering-ml

Updated Nov 5, 2025
Python

mohsin1782005 / Laptop-Price-Predictor

Star

An end-to-end Machine Learning project predicting laptop prices using hardware specs. Includes advanced data cleaning, Feature Engineering (Regex for Resolution, Touchscreen extraction), and benchmarking between Linear Regression and Random Forest Regressors. Achieved a 14% improvement in MAE via ensemble modeling. Built with Python & Scikit-Learn.

python data-science machine-learning random-forest scikit-learn pandas predictive-modeling regression-models data-analysis-python feature-engineering-ml

Updated Feb 18, 2026
Jupyter Notebook

Farhood-2025 / House-Price-Prediction-Regression

Star

An end-to-end machine learning project for predicting house prices using regression models, feature engineering, and hyperparameter tuning.

python data-science machine-learning random-forest numpy scikit-learn regression pandas house-price-prediction feature-engineering-ml

Updated Jun 27, 2026
Jupyter Notebook

IhsanSA / Machine-Learning-Linear-vs-Nonlinear-Regression-for-Building-Energy-Efficiency-Prediction

Star

Energy prediction model that compares linear and nonlinear regression for building efficiency.

jupyter-notebook predictive-modeling linear-regression-models supervised-machine-learning model-comparison machine-learning-projects model-evaluation-metrics performance-metrics-calculation feature-engineering-ml nonlinear-regression-model

Updated May 8, 2026
Jupyter Notebook

ayush-gangwar-09 / Machine-Learning-Homework-Task-IIOT4

Star

This repository contains my machine learning homework tasks and their implementations. It includes data preprocessing, feature engineering, model training, evaluation, and prediction pipelines using Python and popular ML libraries.

data-science python-3 machine-learning-models model-training-and-evaluation data-preprocessing-and-cleaning feature-engineering-ml

Updated Mar 26, 2026
Jupyter Notebook

Arif-1411 / Data-analysis

Star

Exploratory data analysis projects using Python, Pandas, NumPy, Matplotlib, and Seaborn. Covers data cleaning, visualization, statistical analysis, and insight extraction from real-world datasets.

numpy pandas data-visualization seaborn matplotlib data-wrangling data-preprocessing feature-engineering feature-engineering-ml

Updated Mar 7, 2026
Jupyter Notebook

DeemonDuck / IPO-Sentinel---Listing_Gain_Prediction

Star

ML-based IPO listing gain predictor using subscription demand and market sentiment, with planned extensions including grey market premium and company fundamentals for improved investment decision insights.

data-science machine-learning scikit-learn python3 xgboost classification lightgbm predictive-modeling finance-management ipo stock-market-analysis catboost model-evaluation-metrics feature-engineering-ml

Updated Apr 23, 2026
Jupyter Notebook

DeebeshS-ML / customer-churn-prediction

Star

Customer churn prediction using machine learning classification models and feature engineering.

python machine-learning random-forest scikit-learn pandas seaborn xgboost classification logistic-regression data-science-projects matplotlib-pyplot customer-churn feature-engineering-ml

Updated Jun 11, 2026
Jupyter Notebook

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature-engineering-ml

Here are 17 public repositories matching this topic...

imakash45 / rossmann-sales-forecasting

Hamdaan-P / ML-Repo

myselfsukhendu09 / Dry-Bean-Type-Classification

Akshay8087 / ScoutIQ-Football-Intelligence-Match-Prediction-Platform

voyager2005 / sensera-poc

Sparkydev007 / Credit-Risk-Prediction-System

C-Saha958 / Customer-Opex-Optimization-Analysis

BrentOchieng / british-airways-booking-prediction-ml

TejashwiniSaravanan / Healthcare-Analytics-PySpark-ML-GCP-Strategy

mwasifanwar / automl_framework

mohsin1782005 / Laptop-Price-Predictor

Farhood-2025 / House-Price-Prediction-Regression

IhsanSA / Machine-Learning-Linear-vs-Nonlinear-Regression-for-Building-Energy-Efficiency-Prediction

ayush-gangwar-09 / Machine-Learning-Homework-Task-IIOT4

Arif-1411 / Data-analysis

DeemonDuck / IPO-Sentinel---Listing_Gain_Prediction

DeebeshS-ML / customer-churn-prediction

Improve this page

Add this topic to your repo