benchm-ml

This repository is designed to provide a minimal benchmark framework comparing commonly used machine learning libraries in terms of scalability, speed, and classification accuracy. The focus is on binary classification tasks without missing data, where inputs can be numeric or categorical (after one-hot encoding). It targets large scale settings by varying the number of observations (n) up to millions and the number of features (after expansion) to about a thousand, to stress test different implementations. The benchmarks cover algorithms like logistic regression, random forest, gradient boosting, and deep neural networks, and they compare across toolkits such as scikit-learn, R packages, xgboost, H2O, Spark MLlib, etc. The repository is structured in logical folders, each corresponding to algorithm categories.

Features

Comparative benchmarks across ML toolkits (scikit-learn, R, H2O, xgboost, Spark MLlib)
Algorithm coverage: logistic regression, random forests, boosting, deep neural nets
Scalable testing with large n (e.g. 10K → 10M) and p (~1K)
Synthetic data generation and real dataset integration (e.g. Higgs)
Structured folder organization by algorithm type
Runtime, memory, and accuracy measurement tools to compare implementations

Project Samples

Project Activity

See All Activity >

License

MIT License

Follow benchm-ml

benchm-ml Web Site

Other Useful Business Software

Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free

Rate This Project

User Reviews

Be the first to post a review of benchm-ml!

Additional Project Details

Programming Language

Related Categories

R Libraries

Registered

2025-10-01

Similar Business Software

DHTMLX

DHTMLX is a JavaScript UI library that provides a set of highly customizable and flexible components for building modern and responsive web applications. The library includes more than 30 UI components, such as Gantt, Scheduler, Kanban, diagrams, charts, grids, spreadsheets, calendars, trees,...

See Software
Webix

JavaScript UI library and framework for speeding up web development. JS Framework for cross-platform web Apps development 102 UI widgets and feature-rich CSS / HTML5 JavaScript controls. Save at least 3000+ development hours by using ready-made widgets and UI controls. Develop Web UI 30% faster....

See Software
JsPlumb

JsPlumb is a Javascript diagramming library that allows you to rapidly create complex applications featuring visual connectivity without having to build any of the boring stuff: it provides pan/zoom, a minimap widget, automatic layouts, data binding, path finding, and much, much more. JsPlumb...

See Software
Kendo UI

Kendo UI is the ultimate collection of JavaScript UI components with libraries for jQuery, Angular, React, and Vue. Quickly build eye-catching, high-performance, responsive web applications—regardless of your JavaScript framework choice. Easily add advanced JavaScript components into your...

See Software
Bryntum

Bryntum is a leading provider of high-performance scheduling solutions for the web. Our suite of JavaScript components—including Gantt, Scheduler, Task Board, and Calendar—enables developers to build modern project management applications with features like drag-and-drop scheduling, resource...

See Software
Voca

The Voca library offers helpful functions to make string manipulations comfortable: change case, trim, pad, slugify, latinise, sprintf'y, truncate, escape and much more. The modular design allows to load the entire library, or individual functions to minimize the application builds. The library...

See Software