Predicting-Insurance-Charges-Using-Machine-Learning

This project predicts individual insurance charges using a Random Forest Regressor model. The model takes features like age, BMI, number of children, smoking status, sex, and region to estimate medical insurance costs.

Features Used

Numerical: age, bmi, children
Categorical: sex, smoker, region

Dataset

The dataset used is the popular Insurance Dataset from Kaggle.

Requirements

Numpy
Pandas
scikit-learn
joblib

How to Run

Ensure insurance.csv is in the project folder.
Run the main script:
If the model does not exist, it will train the model and save it as model.pkl and pipeline.pkl.
If the model already exists, it will load the model and run inference on test.csv.
Output predictions will be saved to output.csv with columns:
predicted_charges → model predictions
actual_charges → actual charges (from test set)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
main.py		main.py
model.pkl		model.pkl
output.csv		output.csv
pipeline.pkl		pipeline.pkl
test.csv		test.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Predicting-Insurance-Charges-Using-Machine-Learning

Features Used

Dataset

Requirements

How to Run

About

Uh oh!

Releases

Packages

Languages

Kritank07/Predicting-Insurance-Charges-Using-Machine-Learning

Folders and files

Latest commit

History

Repository files navigation

Predicting-Insurance-Charges-Using-Machine-Learning

Features Used

Dataset

Requirements

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages