ML Swiss Army Knife 🛠️

An intuitive, user-friendly machine learning application that simplifies the entire ML workflow from data analysis to model deployment. Built with Streamlit, this tool provides a comprehensive suite of ML capabilities accessible through a modern web interface.

🌟 Key Features

📊 Data Analysis & Preprocessing

Interactive data upload and preview
Automated data type detection and quality analysis
Missing value visualization and handling
Feature correlation analysis
Automated data preprocessing pipeline

🤖 Model Training & Evaluation

Traditional Models
- Ridge Regression
- Lasso Regression
- Random Forest
- Gradient Boosting
- SVR (Support Vector Regression)
Advanced Models
- XGBoost
- LightGBM
- Prophet (Time Series)
- SARIMA (Time Series)
Features
- Automated feature preprocessing
- Model performance metrics
- Cross-validation support
- Feature importance analysis
- Interactive parameter tuning

📈 Visualization

Feature distribution plots
Correlation heatmaps
Model performance comparisons
Prediction vs Actual plots
Time series forecasting plots

📚 Interactive Tutorial

Step-by-step guidance
Best practices
Troubleshooting tips
Real-world examples
Advanced topics

🚀 Quick Start

Prerequisites

Python 3.8 or higher
pip package manager

Installation

Clone the repository:

git clone https://github.com/yourusername/ml-swiss-army-knife.git
cd ml-swiss-army-knife

Create and activate a virtual environment:

# Windows
python -m venv venv
venv\Scripts\activate

# macOS/Linux
python -m venv venv
source venv/bin/activate

Install dependencies:

pip install -r requirements.txt

Launch the application:

streamlit run app.py

📖 Usage Guide

Basic Workflow

Data Upload

# Example CSV structure
date,feature1,feature2,target
2024-01-01,23.5,high,100
2024-01-02,24.1,low,95

Data Preprocessing

Select features for encoding
Handle missing values
Scale numerical features

Model Training

# Example model configuration
model_params = {
    "n_estimators": 100,
    "learning_rate": 0.1,
    "max_depth": 5
}

Evaluation & Prediction

View performance metrics
Analyze feature importance
Make predictions on new data

Advanced Features

Time Series Forecasting

# Example Prophet configuration
prophet_params = {
    "yearly_seasonality": True,
    "weekly_seasonality": True,
    "daily_seasonality": False
}

Custom Model Training

Parameter tuning
Cross-validation
Feature selection

🔧 Configuration

System Requirements

RAM: 8GB minimum (16GB recommended)
Storage: 1GB free space
Processor: Multi-core processor recommended

Environment Variables

# Optional configuration
STREAMLIT_SERVER_PORT=8501
STREAMLIT_SERVER_ADDRESS=localhost

📁 Project Structure

ml-swiss-army-knife/
├── .streamlit/               # Streamlit configuration
│   └── config.toml          # Theme and settings
├── app.py                   # Main application
├── requirements.txt         # Dependencies
├── README.md               # Documentation
├── styles.py               # Custom styling
└── modules/               # Application modules
    ├── data_analysis.py    # Data analysis functionality
    ├── time_series.py      # Time series analysis
    ├── model_training.py   # Model training functionality
    ├── predictions.py      # Prediction functionality
    └── tutorial.py         # Tutorial content

💻 Usage Guide

Start the application
Upload your data (CSV format)
Explore data insights automatically
Train models with guided selection
Make predictions with trained models

🔧 Configuration

Theme Configuration

Located in .streamlit/config.toml:

[theme]
primaryColor = "#7C3AED"
backgroundColor = "#FFFFFF"
secondaryBackgroundColor = "#F3F4F6"
textColor = "#111827"
font = "sans serif"

🔍 Example Use Cases

1. Sales Forecasting

# Sample data structure
sales_data = {
    'date': ['2024-01-01', '2024-01-02'],
    'sales': [1000, 1200],
    'promotion': ['yes', 'no']
}

2. Category Prediction

# Sample categorical features
category_data = {
    'feature1': ['A', 'B', 'C'],
    'feature2': [1, 2, 3],
    'target': ['cat1', 'cat2', 'cat1']
}

🤝 Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a feature branch:

git checkout -b feature/AmazingFeature

Commit changes:

git commit -m 'Add AmazingFeature'

Push to branch:

git push origin feature/AmazingFeature

Open a Pull Request

🐛 Troubleshooting

Common Issues

Installation Problems

# If you encounter SSL errors
pip install --trusted-host pypi.org --trusted-host files.pythonhosted.org -r requirements.txt

Memory Issues

# Reduce memory usage
import pandas as pd
pd.read_csv('large_file.csv', nrows=1000)  # Load subset for testing

📊 Performance Tips

Large Datasets

Use chunked processing
Implement memory optimization
Consider data sampling

Model Training

Start with simple models
Use cross-validation
Monitor resource usage

📝 Version History

v1.0.0 (2024-01-01)
- Initial release
- Basic model support
- Data preprocessing
v1.1.0 (2024-02-01)
- Added time series support
- Improved visualization
- Bug fixes

📫 Support

Create an issue on GitHub
Email: [email protected]
Documentation: [Wiki Link]

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Streamlit team
Scikit-learn community
All contributors

Made with ❤️ by [Mlawali]

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.streamlit		.streamlit
__pycache__		__pycache__
config		config
data		data
models		models
modules		modules
utils		utils
venv		venv
Documentation.md		Documentation.md
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
styles.py		styles.py
tutorial.md		tutorial.md

bomino/ML-SWAK

Folders and files

Latest commit

History

Repository files navigation

ML Swiss Army Knife 🛠️

🌟 Key Features

📊 Data Analysis & Preprocessing

🤖 Model Training & Evaluation

📈 Visualization

📚 Interactive Tutorial

🚀 Quick Start

Prerequisites

Installation

📖 Usage Guide

Basic Workflow

Advanced Features

Time Series Forecasting

Custom Model Training

🔧 Configuration

System Requirements

Environment Variables

📁 Project Structure

💻 Usage Guide

🔧 Configuration

Theme Configuration

🔍 Example Use Cases

1. Sales Forecasting

2. Category Prediction

🤝 Contributing

🐛 Troubleshooting

Common Issues

📊 Performance Tips

📝 Version History

📫 Support

📜 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages