Base Column Profilers

coding=utf-8

Profiles the data.

class dataprofiler.profilers.base_column_profilers.BaseColumnProfiler(name)

Bases: object

Abstract class for profiling a column of data.

Initialization of base class properties for the subclass.

Parameters

name (String) – Name of the dataset

col_type = None
diff(other_profile, options=None)

Finds the differences for columns.

Parameters

other_profile (BaseColumnProfiler) – profile to find the difference with

Returns

the stat differences

Return type

dict

abstract update(df_series)

Private abstract method for updating the profile.

Parameters

df_series (Pandas Dataframe) – Data to profile.

abstract property profile

Property for profile. Returns the profile of the column.

class dataprofiler.profilers.base_column_profilers.BaseColumnPrimitiveTypeProfiler(name)

Bases: dataprofiler.profilers.base_column_profilers.BaseColumnProfiler

Abstract class for profiling the primative data type for a column of data.

Initialization of base class properties for the subclass.

Parameters

name (String) – Name of the data

col_type = None
diff(other_profile, options=None)

Finds the differences for columns.

Parameters

other_profile (BaseColumnProfiler) – profile to find the difference with

Returns

the stat differences

Return type

dict

abstract property profile

Property for profile. Returns the profile of the column.

abstract update(df_series)

Private abstract method for updating the profile.

Parameters

df_series (Pandas Dataframe) – Data to profile.