Base Column Profilers¶
coding=utf-8
Profiles the data.
- class dataprofiler.profilers.base_column_profilers.BaseColumnProfiler(name)¶
Bases:
object
Abstract class for profiling a column of data.
Initialization of base class properties for the subclass.
- Parameters
name (String) – Name of the dataset
- col_type = None¶
- diff(other_profile, options=None)¶
Finds the differences for columns.
- Parameters
other_profile (BaseColumnProfiler) – profile to find the difference with
- Returns
the stat differences
- Return type
dict
- abstract update(df_series)¶
Private abstract method for updating the profile.
- Parameters
df_series (Pandas Dataframe) – Data to profile.
- abstract property profile¶
Property for profile. Returns the profile of the column.
- abstract report(remove_disabled_flag=False)¶
Private abstract method for returning report.
- Parameters
remove_disabled_flag (boolean) – flag to determine if disabled options should be excluded in the report.
- class dataprofiler.profilers.base_column_profilers.BaseColumnPrimitiveTypeProfiler(name)¶
Bases:
dataprofiler.profilers.base_column_profilers.BaseColumnProfiler
Abstract class for profiling the primative data type for a column of data.
Initialization of base class properties for the subclass.
- Parameters
name (String) – Name of the data
- col_type = None¶
- diff(other_profile, options=None)¶
Finds the differences for columns.
- Parameters
other_profile (BaseColumnProfiler) – profile to find the difference with
- Returns
the stat differences
- Return type
dict
- abstract property profile¶
Property for profile. Returns the profile of the column.
- abstract report(remove_disabled_flag=False)¶
Private abstract method for returning report.
- Parameters
remove_disabled_flag (boolean) – flag to determine if disabled options should be excluded in the report.
- abstract update(df_series)¶
Private abstract method for updating the profile.
- Parameters
df_series (Pandas Dataframe) – Data to profile.