Code Documentation: Plot Helper

mastml.plot_helper Module

This module contains a collection of functions which make plots (saved as png files) using matplotlib, generated from some model fits and cross-validation evaluation within a MAST-ML run.

This module also contains a method to create python notebooks containing plotted data and the relevant source code from this module, to enable the user to make their own modifications to the created plots in a straightforward way (useful for tweaking plots for a presentation or publication).

Functions

auc(x, y) Compute Area Under the Curve (AUC) using the trapezoidal rule.
ceil Return the ceiling of x as an Integral.
confusion_matrix(y_true, y_pred, *[, …]) Compute confusion matrix to evaluate the accuracy of a classification.
figaspect(arg) Calculate the width and height for a figure with a specified aspect ratio.
floor Return the floor of x as an Integral.
get_divisor(high, low) Method to obtain a sensible divisor based on range of two values
get_histogram_bins(y_df) Method to obtain the number of bins to use when plotting a histogram
ipynb_maker(plot_func) This method creates Jupyter Notebooks so user can modify and regenerate the plots produced by MAST-ML.
join(a, *p) Join two or more pathname components, inserting ‘/’ as needed.
log(x, [base=math.e]) Return the logarithm of x to the given base.
make_axes_locatable(axes)
make_axis_same(ax, max1, min1) Method to make the x and y ticks for each axis the same.
make_error_plots(run, path, …[, groups])
make_fig_ax([aspect_ratio, x_align, left]) Method to make matplotlib figure and axes objects.
make_fig_ax_square([aspect, aspect_ratio]) Method to make square shaped matplotlib figure and axes objects.
make_train_test_plots(run, path, …[, groups]) General plotting method used to execute sequence of specific plots of train-test data analysis
mark_inset(parent_axes, inset_axes, loc1, …) Draw a box to mark the location of an area represented by an inset axes.
nice_mean(ls) Method to return mean of a list or equivalent array with NaN values
nice_names()
nice_range(lower, upper) Method to create a range of values, including the specified start and end points, with nicely spaced intervals
nice_std(ls) Method to return standard deviation of a list or equivalent array with NaN values
parse_error_data(dataset_stdev, …)
plot_1d_heatmap(xs, heats, savepath[, …]) Method to plot a heatmap for values of a single variable; used for plotting GridSearch results in hyperparameter optimization.
plot_2d_heatmap(xs, ys, heats, savepath[, …]) Method to plot a heatmap for values of two variables; used for plotting GridSearch results in hyperparameter optimization.
plot_3d_heatmap(xs, ys, zs, heats, savepath) Method to plot a heatmap for values of three variables; used for plotting GridSearch results in hyperparameter optimization.
plot_average_cumulative_normalized_error(…) Method to plot the cumulative normalized residual errors of a model prediction
plot_average_normalized_error(y_true, …[, …]) Method to plot the normalized residual errors of a model prediction
plot_best_worst_per_point(y_true, …[, …]) Method to create a parity plot (predicted vs.
plot_best_worst_split(y_true, best_run, …) Method to create a parity plot (predicted vs.
plot_confusion_matrix(y_true, y_pred, …[, …]) Method used to generate a confusion matrix for a classification run.
plot_cumulative_normalized_error(y_true, …) Method to plot the cumulative normalized residual errors of a model prediction
plot_keras_history(model_history, savepath, …)
plot_learning_curve(train_sizes, train_mean, …) Method used to plot both data and feature learning curves
plot_learning_curve_convergence(train_sizes, …) Method used to plot both the convergence of data and feature learning curves as a function of amount of data or features
plot_metric_vs_group(metric, groups, stats, …) Method to plot the value of a particular calculated metric (e.g.
plot_metric_vs_group_size(metric, groups, …) Method to plot the value of a particular calculated metric (e.g.
plot_normalized_error(y_true, y_pred, …[, …]) Method to plot the normalized residual errors of a model prediction
plot_precision_recall_curve(y_true, y_pred, …) Method to calculate and plot the precision-recall curve for classification model results
plot_predicted_vs_true(train_quad, …) Method to create a parity plot (predicted vs.
plot_predicted_vs_true_bars(y_true, …[, …]) Method to calculate parity plot (predicted vs.
plot_real_vs_predicted_error(y_true, …)
plot_residuals_histogram(y_true, y_pred, …) Method to calculate and plot the histogram of residuals from regression model
plot_roc_curve(y_true, y_pred, savepath) Method to calculate and plot the receiver-operator characteristic curve for classification model results
plot_scatter(x, y, savepath[, groups, …]) Method to create a general scatter plot
plot_stats(fig, stats[, x_align, y_align, …]) Method that prints stats onto the plot.
plot_target_histogram(y_df, savepath[, …]) Method to plot the histogram of true y values
precision_recall_curve(y_true, probas_pred, *) Compute precision-recall pairs for different probability thresholds.
prediction_intervals(model, X, …) Method to calculate prediction intervals when using Random Forest and Gaussian Process regression models.
r2_score(y_true, y_pred, *[, sample_weight, …]) R^2 (coefficient of determination) regression score function.
recursive_max(arr) Method to recursively find the max value of an array of iterables.
recursive_max_and_min(arr) Method to recursively return max and min of values or iterables in array
recursive_min(arr) Method to recursively find the min value of an array of iterables.
roc_curve(y_true, y_score, *[, pos_label, …]) Compute Receiver operating characteristic (ROC).
round_down(num, divisor) Method to return a rounded down number
round_up(num, divisor) Method to return a rounded up number
rounder(delta) Method to obtain number of decimal places to report on plots
stat_to_string(name, value, nice_names) Method that converts a metric object into a string for displaying on a plot
trim_array(arr_list) Method used to trim a set of arrays to make all arrays the same shape
wraps(wrapped[, assigned, updated]) Decorator factory to apply update_wrapper() to a wrapper function
zoomed_inset_axes(parent_axes, zoom[, loc, …]) Create an anchored inset axes by scaling a parent axes.