Int Column Profile

class dataprofiler.profilers.int_column_profile.IntColumn(name, options=None)

Bases: dataprofiler.profilers.numerical_column_stats.NumericStatsMixin, dataprofiler.profilers.base_column_profilers.BaseColumnPrimitiveTypeProfiler

Integer column profile mixin with of numerical stats. Represents a column in the dataset which is an integer column.

Initialization of column base properties and itself.

Parameters
  • name (String) – Name of the data

  • options (IntOptions) – Options for the integer column

type = 'int'
property profile

Property for profile. Returns the profile of the column.

Returns

property data_type_ratio

Calculates the ratio of samples which match this data type.

Returns

ratio of data type

Return type

float

update(df_series)

Updates the column profile.

Parameters

df_series (pandas.core.series.Series) – df series

Returns

None

col_type = None
diff(other_profile, options=None)

Finds the differences for several numerical stats.

Parameters

other_profile (NumericStatsMixin Profile) – profile to find the difference with

Returns

the numerical stats differences

Return type

dict

static is_float(x)

For “0.80” this function returns True For “1.00” this function returns True For “1” this function returns True

Parameters

x (str) – string to test

Returns

if is float or not

Return type

bool

static is_int(x)

For “0.80” This function returns False For “1.00” This function returns True For “1” this function returns True

Parameters

x (str) – string to test

Returns

if is integer or not

Return type

bool

property kurtosis
property mean
static np_type_to_type(val)

Converts numpy variables to base python type variables

Parameters

val (numpy type or base type) – value to check & change

Return val

base python type

Rtype val

int or float

property skewness
property stddev
property variance