Numerical Column Stats¶
coding=utf-8 Build model for a dataset by identifying type of column along with its respective parameters.
-
class
dataprofiler.profilers.numerical_column_stats.
abstractstaticmethod
(function)¶ Bases:
staticmethod
-
class
dataprofiler.profilers.numerical_column_stats.
NumericStatsMixin
(options=None)¶ Bases:
object
Abstract numerical column profile subclass of BaseColumnProfiler. Represents a column in the dataset which is a text column. Has Subclasses itself.
Initialization of column base properties and itself.
- Parameters
options (NumericalOptions) – Options for the numerical stats.
-
type
= None¶
-
diff
(other_profile, options=None)¶ Finds the differences for several numerical stats.
- Parameters
other_profile (NumericStatsMixin Profile) – profile to find the difference with
- Returns
the numerical stats differences
- Return type
dict
-
property
mean
¶
-
property
variance
¶
-
property
stddev
¶
-
property
skewness
¶
-
property
kurtosis
¶
-
abstract
update
(df_series)¶ Abstract Method for updating the numerical profile properties with an uncleaned dataset.
- Parameters
df_series (pandas.core.series.Series) – df series with nulls removed
- Returns
None
-
static
is_float
(x)¶ For “0.80” this function returns True For “1.00” this function returns True For “1” this function returns True
- Parameters
x (str) – string to test
- Returns
if is float or not
- Return type
bool
-
static
is_int
(x)¶ For “0.80” This function returns False For “1.00” This function returns True For “1” this function returns True
- Parameters
x (str) – string to test
- Returns
if is integer or not
- Return type
bool
-
static
np_type_to_type
(val)¶ Converts numpy variables to base python type variables
- Parameters
val (numpy type or base type) – value to check & change
- Return val
base python type
- Rtype val
int or float