examples

MLCLI Examples

This directory contains example configurations and scripts to help you get started with MLCLI.

Directory Structure

examples/
├── README.md                    # This file
├── configs/                     # Example configuration files
│   ├── random_forest.json
│   ├── xgboost.json
│   ├── logistic_regression.json
│   ├── svm.json
│   ├── tensorflow_dnn.json
│   └── tuning/
│       ├── tune_rf.json
│       └── tune_xgb.json
├── data/                        # Sample datasets
│   └── README.md
└── notebooks/                   # Jupyter notebooks
    └── getting_started.ipynb

Quick Start Examples

1. Train a Random Forest Model

mlcli train --config examples/configs/random_forest.json

2. Hyperparameter Tuning

mlcli tune --config examples/configs/tuning/tune_rf.json --method random --n-trials 20

3. Model Explanation

mlcli explain --model models/rf_model.pkl --data data/test.csv --method shap

4. Preprocessing Pipeline

mlcli preprocess --data data/raw.csv --output data/processed.csv --methods standard_scaler,select_k_best

Sample Configurations

Random Forest

{
  "dataset": {
    "path": "data/your_data.csv",
    "type": "csv",
    "target_column": "target"
  },
  "model": {
    "type": "random_forest",
    "params": {
      "n_estimators": 100,
      "max_depth": 10,
      "min_samples_split": 2,
      "random_state": 42
    }
  },
  "training": {
    "test_size": 0.2,
    "random_state": 42
  },
  "output": {
    "model_dir": "models",
    "save_format": ["pickle", "onnx"]
  }
}

XGBoost

{
  "dataset": {
    "path": "data/your_data.csv",
    "type": "csv",
    "target_column": "target"
  },
  "model": {
    "type": "xgboost",
    "params": {
      "n_estimators": 100,
      "max_depth": 6,
      "learning_rate": 0.1,
      "random_state": 42
    }
  },
  "training": {
    "test_size": 0.2,
    "random_state": 42
  },
  "output": {
    "model_dir": "models",
    "save_format": ["pickle"]
  }
}

TensorFlow DNN

{
  "dataset": {
    "path": "data/your_data.csv",
    "type": "csv",
    "target_column": "target"
  },
  "model": {
    "type": "tf_dnn",
    "params": {
      "hidden_layers": [128, 64, 32],
      "activation": "relu",
      "dropout_rate": 0.3,
      "learning_rate": 0.001
    }
  },
  "training": {
    "epochs": 100,
    "batch_size": 32,
    "validation_split": 0.2,
    "early_stopping": true,
    "patience": 10
  },
  "output": {
    "model_dir": "models",
    "save_format": ["keras", "h5"]
  }
}

Using Your Own Data

Place your CSV file in the data/ directory
Update the dataset.path in any config file
Set the correct target_column name
Run training!

Need Help?

Check the documentation
Open an issue
Start a discussion

Name		Name	Last commit message	Last commit date
parent directory ..
configs		configs
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

MLCLI Examples

Directory Structure

Quick Start Examples

1. Train a Random Forest Model

2. Hyperparameter Tuning

3. Model Explanation

4. Preprocessing Pipeline

Sample Configurations

Random Forest

XGBoost

TensorFlow DNN

Using Your Own Data

Need Help?

FilesExpand file tree

examples

Directory actions

More options

Directory actions

More options

Latest commit

History

examples

Folders and files

parent directory

README.md

MLCLI Examples

Directory Structure

Quick Start Examples

1. Train a Random Forest Model

2. Hyperparameter Tuning

3. Model Explanation

4. Preprocessing Pipeline

Sample Configurations

Random Forest

XGBoost

TensorFlow DNN

Using Your Own Data

Need Help?