MKYZ is a powerful Python library that simplifies machine learning workflows, combining data processing, visualization, and model training in one intuitive package.
Automated data cleaning, preprocessing, and feature engineering with intelligent missing value handling.
Create stunning visualizations with over 40 customizable graph types for comprehensive data analysis.
Train and optimize models automatically with state-of-the-art algorithms and hyperparameter tuning.
MKYZ provides a comprehensive suite of tools designed to streamline your machine learning workflow, from data preparation to model deployment.
Transform your machine learning projects with automated workflows that handle everything from data preprocessing to model deployment, reducing development time by up to 80%.
Learn more about automationAutomatic handling of missing values, outliers, and feature scaling with intelligent defaults.
Create publication-ready plots and interactive dashboards with minimal code.
Automated model selection and hyperparameter optimization using state-of-the-art techniques.
Seamlessly integrate with existing ML workflows and popular frameworks.
Optimized for speed and efficiency.
Specialized tools for temporal data.
Robust model validation suite.
Flexible configuration options.
Get up and running with MKYZ in seconds. Just run the command above and you're ready to start!
View Quick Start GuideMKYZ simplifies machine learning workflows with just a few lines of code. Here's how you can get started:
Load and preprocess your dataset with automatic handling of missing values and encoding.
from mkyz import data_processing as dp
# Load and preprocess data
data = dp.prepare_data(
'data.csv',
target_column='target',
handle_missing=True,
encode_categorical=True
)
Output:
Dataset loaded successfully:
- Features: 15
- Samples: 1000
- Missing values handled: 23
- Categorical columns encoded: 4
Create insightful visualizations with a single line of code.
from mkyz import visualization as viz
# Create correlation matrix and feature importance plots
viz.visualize(
data=data,
plot_type=['corr', 'feature_importance'],
save_path='plots/'
)
Output:
Plots generated:
- correlation_matrix.png
- feature_importance.png
Saved to: plots/
Train your model with automated hyperparameter optimization.
from mkyz import training as tr
# Train model with automatic optimization
model = tr.train(
data=data,
task='classification',
model='rf',
optimize_params=True
)
Output:
Model trained successfully:
Accuracy: 0.95
F1 Score: 0.94
Best parameters found:
- n_estimators: 100
- max_depth: 15
Evaluate your model's performance with comprehensive metrics.
# Make predictions and evaluate
predictions = model.predict(data.test_x)
results = tr.evaluate(
true_values=data.test_y,
predictions=predictions,
task='classification'
)
print(results)
Output:
Classification Report:
precision recall f1-score
Class 0 0.96 0.94 0.95
Class 1 0.93 0.95 0.94
Accuracy: 0.945
MKYZ provides a complete suite of tools for your entire machine learning workflow, from data preparation to model deployment.
Comprehensive data preparation and preprocessing capabilities
Intelligent imputation strategies for both numerical and categorical data
Advanced statistical methods for identifying and handling anomalies
Automated feature creation and selection techniques
Schema validation and data quality checks
Automatic detection of data types and appropriate transformations
Automated cleanup of inconsistent or malformed data
Rich set of visualization tools for data analysis and model evaluation
Histograms, box plots, and distribution visualizations
Correlation matrices and relationship plots
Visual representation of feature significance
ROC curves, confusion matrices, and learning curves
Dynamic visualizations for real-time analysis
Flexible customization options for all plots
State-of-the-art algorithms and automated model training
Automated model selection and hyperparameter optimization
Wide range of pre-configured models for different tasks
Robust model validation and performance assessment
Advanced ensemble techniques for improved performance
Utilize pre-trained models for specific tasks
Tools for understanding model decisions and behavior
Additional tools and utilities for enhanced workflow
Create and manage end-to-end ML pipelines
Log and compare different model experiments
Tools for deploying models to production
Specialized tools for temporal data processing
Automatic GPU utilization when available
Easy integration with external services and APIs
MKYZ leverages the most powerful and popular Python libraries for machine learning and data science
Core programming language
Machine learning algorithms
Numerical computing foundation
Data manipulation and analysis
Data visualization library
Statistical data visualization
Scientific computing tools
Interactive visualizations
Hyperparameter optimization
Gradient boosting framework
Gradient boosting framework
Gradient boosting library