Reference

Functions

Utility functions

clean_columns()

Clean messy column names in a pandas dataframe.

generate_test_data()

Generate a pandas dataframe with several different datatypes.

skim()

Skim a pandas or polars dataframe and return visual summary statistics on it.

skim_get_data()

Skim a pandas or polars dataframe and return summary statistics as a dictionary, and without printing to the console.

skim_get_figure()

Skim a pandas or polars dataframe, print the stats to the console, and save a version of the table as an SVG, HTML, or text file.

Constants

Module-level constants and data

CASE_STYLES
COMPLETE_COL
DATE_COL_FIRST
DATE_COL_LAST
HIST_BINS
MAX_COL_WIDTH
MIN_COL_WIDTH
MISSING_COL
NULL_VALUES
NUM_COL_MEAN
QUANTILES