The power of machine learning comes from its ability to learn patterns from large amounts of data. Understanding your data is critical to building a powerful machine learning system. Facets contains two robust visualizations to aid in understanding and analyzing machine learning datasets. Get a sense of the shape of each feature of your dataset using Facets Overview, or explore individual observations using Facets Dive. Explore Facets Overview and Facets Dive on the UCI Census Income dataset, used for predicting whether an individual’s income exceeds $50K/yr based on their census data. The census data contains features such as age, education level, and occupation for each individual. Overview gives users a quick understanding of the distribution of values across the features of their dataset(s). Uncover several uncommon and common issues such as unexpected feature values, missing feature values for a large number of observation, training/serving skew and train/test/validation set skew.
Features
- Overview takes input feature data from any number of datasets, analyzes them feature by feature and visualizes the analysis
- Dive is a tool for interactively exploring large numbers of data points at once
- Dive provides an interactive interface for exploring the relationship between data points across all of the different features of a dataset
- Each individual item in the visualization represents a data point
- Better data leads to better models
- Uncover several uncommon and common issues such as unexpected feature values