MKYZ

All train just one line

What is MKYZ?

MKYZ is a powerful Python library that simplifies machine learning workflows, combining data processing, visualization, and model training in one intuitive package.

20+
ML Models
40+
Visualizations
1000+
Users

Data Processing

Automated data cleaning, preprocessing, and feature engineering with intelligent missing value handling.

Visualization

Create stunning visualizations with over 40 customizable graph types for comprehensive data analysis.

Machine Learning

Train and optimize models automatically with state-of-the-art algorithms and hyperparameter tuning.

Key Features

MKYZ provides a comprehensive suite of tools designed to streamline your machine learning workflow, from data preparation to model deployment.

Automated Workflow

Transform your machine learning projects with automated workflows that handle everything from data preprocessing to model deployment, reducing development time by up to 80%.

Learn more about automation

Smart Preprocessing

Automatic handling of missing values, outliers, and feature scaling with intelligent defaults.

Advanced Visualization

Create publication-ready plots and interactive dashboards with minimal code.

AutoML Integration

Automated model selection and hyperparameter optimization using state-of-the-art techniques.

Easy Integration

Seamlessly integrate with existing ML workflows and popular frameworks.

High Performance

Optimized for speed and efficiency.

Time Series

Specialized tools for temporal data.

Validation

Robust model validation suite.

Customizable

Flexible configuration options.

Get Started with MKYZ

$pip install mkyz

Get up and running with MKYZ in seconds. Just run the command above and you're ready to start!

View Quick Start Guide

Comprehensive Documentation

Documentation Sections

  • Data Processing Guide
  • Visualization Reference
  • Model Training Tutorial
  • API Reference

Resources

Find everything you need to master MKYZ in our detailed documentation.

Read the Docs

How to Use MKYZ

MKYZ simplifies machine learning workflows with just a few lines of code. Here's how you can get started:

Data Preparation

Load and preprocess your dataset with automatic handling of missing values and encoding.

from mkyz import data_processing as dp

# Load and preprocess data
data = dp.prepare_data(
    'data.csv',
    target_column='target',
    handle_missing=True,
    encode_categorical=True
)

Output:

Dataset loaded successfully:
- Features: 15
- Samples: 1000
- Missing values handled: 23
- Categorical columns encoded: 4
Data Visualization

Create insightful visualizations with a single line of code.

from mkyz import visualization as viz

# Create correlation matrix and feature importance plots
viz.visualize(
    data=data,
    plot_type=['corr', 'feature_importance'],
    save_path='plots/'
)

Output:

Plots generated:
- correlation_matrix.png
- feature_importance.png
Saved to: plots/
Model Training

Train your model with automated hyperparameter optimization.

from mkyz import training as tr

# Train model with automatic optimization
model = tr.train(
    data=data,
    task='classification',
    model='rf',
    optimize_params=True
)

Output:

Model trained successfully:
Accuracy: 0.95
F1 Score: 0.94
Best parameters found:
- n_estimators: 100
- max_depth: 15
Model Evaluation

Evaluate your model's performance with comprehensive metrics.

# Make predictions and evaluate
predictions = model.predict(data.test_x)
results = tr.evaluate(
    true_values=data.test_y,
    predictions=predictions,
    task='classification'
)
print(results)

Output:

Classification Report:
              precision    recall  f1-score
Class 0       0.96        0.94    0.95
Class 1       0.93        0.95    0.94
Accuracy: 0.945

Comprehensive Capabilities

MKYZ provides a complete suite of tools for your entire machine learning workflow, from data preparation to model deployment.

Data Processing

Comprehensive data preparation and preprocessing capabilities

Missing Value Handling

Intelligent imputation strategies for both numerical and categorical data

Outlier Detection

Advanced statistical methods for identifying and handling anomalies

Feature Engineering

Automated feature creation and selection techniques

Data Validation

Schema validation and data quality checks

Type Inference

Automatic detection of data types and appropriate transformations

Data Cleaning

Automated cleanup of inconsistent or malformed data

Visualization

Rich set of visualization tools for data analysis and model evaluation

Statistical Plots

Histograms, box plots, and distribution visualizations

Correlation Analysis

Correlation matrices and relationship plots

Feature Importance

Visual representation of feature significance

Model Performance

ROC curves, confusion matrices, and learning curves

Interactive Dashboards

Dynamic visualizations for real-time analysis

Custom Styling

Flexible customization options for all plots

Machine Learning

State-of-the-art algorithms and automated model training

AutoML

Automated model selection and hyperparameter optimization

Model Zoo

Wide range of pre-configured models for different tasks

Cross-Validation

Robust model validation and performance assessment

Ensemble Methods

Advanced ensemble techniques for improved performance

Transfer Learning

Utilize pre-trained models for specific tasks

Model Interpretability

Tools for understanding model decisions and behavior

Advanced Features

Additional tools and utilities for enhanced workflow

Pipeline Management

Create and manage end-to-end ML pipelines

Experiment Tracking

Log and compare different model experiments

Model Deployment

Tools for deploying models to production

Time Series Analysis

Specialized tools for temporal data processing

GPU Acceleration

Automatic GPU utilization when available

API Integration

Easy integration with external services and APIs

Powered By

MKYZ leverages the most powerful and popular Python libraries for machine learning and data science

Python logo

Python

Core programming language

scikit-learn logo

scikit-learn

Machine learning algorithms

NumPy logo

NumPy

Numerical computing foundation

pandas logo

pandas

Data manipulation and analysis

Matplotlib logo

Matplotlib

Data visualization library

Seaborn logo

Seaborn

Statistical data visualization

SciPy logo

SciPy

Scientific computing tools

Plotly logo

Plotly

Interactive visualizations

Optuna logo

Optuna

Hyperparameter optimization

XGBoost logo

XGBoost

Gradient boosting framework

LightGBM logo

LightGBM

Gradient boosting framework

CatBoost logo

CatBoost

Gradient boosting library

Ready to Elevate Your Machine Learning Projects?