Authors:
Abhishek Deore
Daniel Shevelev
Kashyap Nadendla
Shreya Kolte
Ram Dheeraj Kamarajugadda
Gender
Smoking status and age relation
Age vs average glucose level
Heart disease and strokes
Correlation Heat Map

Approach
Logistic Regression
K-Nearest Neighbors (KNN)
Naive Bayes
Decision Tree
Random Forest
Approach
| Feature | Coefficient |
|---|---|
| age_band | 1.187519 |
| avg_glucose_level | 0.880860 |
| hypertension | 0.551390 |
| heart_diesease | 0.288187 |
| gender | 0.073007 |
| smoking_status | -0.053451 |
| bmi | -0.211967 |
Age is a big factor in stroke and has a positive coefficient above 1.
Average glucose level is the second most significant feature with a coefficient of 0.88
BMI coefficient is -0.21. Higher BMI is associated with lower likelihood of the target variable.
Class imbalance where one class (e.g stroke) significantly outnumbers the other, chances of leading to biased models
Choosing an appropriate model that balances complexity and interpretability