Metadata-Version: 2.4
Name: mlforge-py
Version: 1.0.0
Summary: End-to-end ML pipeline library — data ingestion to deployment
Home-page: https://github.com/srinagadurga3455/mlforge
Author: srinagadurga3455
Keywords: machine learning pipeline preprocessing training evaluation deployment
Classifier: Programming Language :: Python :: 3
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Topic :: Scientific/Engineering :: Artificial Intelligence
Requires-Python: >=3.9
Description-Content-Type: text/markdown
License-File: LICENSE
Requires-Dist: pandas>=1.5.0
Requires-Dist: numpy>=1.23.0
Requires-Dist: scikit-learn>=1.2.0
Requires-Dist: pyyaml>=6.0
Requires-Dist: requests>=2.28.0
Provides-Extra: kaggle
Requires-Dist: kaggle>=1.5.0; extra == "kaggle"
Provides-Extra: huggingface
Requires-Dist: datasets>=2.0.0; extra == "huggingface"
Provides-Extra: xgboost
Requires-Dist: xgboost>=1.7.0; extra == "xgboost"
Provides-Extra: lightgbm
Requires-Dist: lightgbm>=3.3.0; extra == "lightgbm"
Provides-Extra: deployment
Requires-Dist: fastapi>=0.95.0; extra == "deployment"
Requires-Dist: uvicorn>=0.20.0; extra == "deployment"
Provides-Extra: databases
Requires-Dist: psycopg2-binary>=2.9.0; extra == "databases"
Requires-Dist: pymongo>=4.0.0; extra == "databases"
Requires-Dist: sqlalchemy>=2.0.0; extra == "databases"
Provides-Extra: cloud
Requires-Dist: boto3>=1.26.0; extra == "cloud"
Provides-Extra: streaming
Requires-Dist: kafka-python>=2.0.0; extra == "streaming"
Provides-Extra: images
Requires-Dist: Pillow>=9.0.0; extra == "images"
Provides-Extra: all
Requires-Dist: kaggle>=1.5.0; extra == "all"
Requires-Dist: datasets>=2.0.0; extra == "all"
Requires-Dist: xgboost>=1.7.0; extra == "all"
Requires-Dist: lightgbm>=3.3.0; extra == "all"
Requires-Dist: fastapi>=0.95.0; extra == "all"
Requires-Dist: uvicorn>=0.20.0; extra == "all"
Requires-Dist: psycopg2-binary>=2.9.0; extra == "all"
Requires-Dist: pymongo>=4.0.0; extra == "all"
Requires-Dist: sqlalchemy>=2.0.0; extra == "all"
Requires-Dist: boto3>=1.26.0; extra == "all"
Requires-Dist: kafka-python>=2.0.0; extra == "all"
Requires-Dist: Pillow>=9.0.0; extra == "all"
Dynamic: author
Dynamic: classifier
Dynamic: description
Dynamic: description-content-type
Dynamic: home-page
Dynamic: keywords
Dynamic: license-file
Dynamic: provides-extra
Dynamic: requires-dist
Dynamic: requires-python
Dynamic: summary

# mlforge Examples

Three complete end-to-end examples showing how to use mlforge.
Each one is fully commented — every line explains what it does and why.

## Examples

### 01 — Quickstart (5 minutes)
`python examples/01_quickstart.py`

The simplest possible pipeline. Generates data, cleans it, trains a model, makes predictions.
Start here if you're new to mlforge or ML.

### 02 — Titanic Survival Prediction (Classification)
`python examples/02_titanic.py`

Predicts who survived the Titanic disaster (yes/no).
Shows how to handle real messy data — missing values, text columns, duplicates.

Requires: `titanic.csv` in the same folder.
Download: https://raw.githubusercontent.com/datasciencedojo/datasets/master/titanic.csv

### 03 — House Price Prediction (Regression)
`python examples/03_house_prices.py`

Predicts house prices from features like size, bedrooms, and location.
Shows feature engineering, cross validation, hyperparameter tuning, and drift detection.

No download needed — generates realistic data automatically.

## What you'll learn

| Concept | Where it's shown |
|---|---|
| Loading data from a file | 02_titanic.py |
| Checking data quality | 02_titanic.py, 03_house_prices.py |
| Filling missing values | 02_titanic.py |
| Encoding text columns | 02_titanic.py, 03_house_prices.py |
| Handling outliers | 02_titanic.py, 03_house_prices.py |
| Normalising features | All examples |
| Creating new features | 03_house_prices.py |
| Training multiple models | 02_titanic.py, 03_house_prices.py |
| Comparing models | 02_titanic.py, 03_house_prices.py |
| Cross validation | 03_house_prices.py |
| Hyperparameter tuning | 03_house_prices.py |
| Saving models | 02_titanic.py, 03_house_prices.py |
| Drift detection | 03_house_prices.py |
