dataprofiler.profilers package¶
Subpackages¶
Submodules¶
- dataprofiler.profilers.base_column_profilers module
BaseColumnProfilerBaseColumnPrimitiveTypeProfilerBaseColumnPrimitiveTypeProfiler.sample_sizeBaseColumnPrimitiveTypeProfiler.col_typeBaseColumnPrimitiveTypeProfiler.diff()BaseColumnPrimitiveTypeProfiler.load_from_dict()BaseColumnPrimitiveTypeProfiler.profileBaseColumnPrimitiveTypeProfiler.report()BaseColumnPrimitiveTypeProfiler.update()BaseColumnPrimitiveTypeProfiler.nameBaseColumnPrimitiveTypeProfiler.metadataBaseColumnPrimitiveTypeProfiler.timesBaseColumnPrimitiveTypeProfiler.thread_safe
- dataprofiler.profilers.categorical_column_profile module
CategoricalColumnCategoricalColumn.typeCategoricalColumn.gini_impurityCategoricalColumn.unalikeabilityCategoricalColumn.diff()CategoricalColumn.report()CategoricalColumn.load_from_dict()CategoricalColumn.profileCategoricalColumn.categoriesCategoricalColumn.categorical_countsCategoricalColumn.unique_ratioCategoricalColumn.unique_countCategoricalColumn.is_matchCategoricalColumn.col_typeCategoricalColumn.nameCategoricalColumn.sample_sizeCategoricalColumn.metadataCategoricalColumn.timesCategoricalColumn.thread_safeCategoricalColumn.update()
- dataprofiler.profilers.column_profile_compilers module
- dataprofiler.profilers.data_labeler_column_profile module
DataLabelerColumnDataLabelerColumn.typeDataLabelerColumn.thread_safeDataLabelerColumn.assert_equal_conditions()DataLabelerColumn.reverse_label_mappingDataLabelerColumn.possible_data_labelsDataLabelerColumn.rank_distributionDataLabelerColumn.sum_predictionsDataLabelerColumn.data_labelDataLabelerColumn.avg_predictionsDataLabelerColumn.label_representationDataLabelerColumn.profileDataLabelerColumn.load_from_dict()DataLabelerColumn.report()DataLabelerColumn.col_typeDataLabelerColumn.diff()DataLabelerColumn.nameDataLabelerColumn.sample_sizeDataLabelerColumn.metadataDataLabelerColumn.timesDataLabelerColumn.update()
- dataprofiler.profilers.datetime_column_profile module
DateTimeColumnDateTimeColumn.typeDateTimeColumn.report()DateTimeColumn.load_from_dict()DateTimeColumn.profileDateTimeColumn.data_type_ratioDateTimeColumn.diff()DateTimeColumn.update()DateTimeColumn.col_typeDateTimeColumn.match_countDateTimeColumn.sample_sizeDateTimeColumn.nameDateTimeColumn.metadataDateTimeColumn.timesDateTimeColumn.thread_safe
- dataprofiler.profilers.float_column_profile module
FloatColumnFloatColumn.typeFloatColumn.diff()FloatColumn.report()FloatColumn.load_from_dict()FloatColumn.profileFloatColumn.precisionFloatColumn.data_type_ratioFloatColumn.col_typeFloatColumn.is_float()FloatColumn.is_int()FloatColumn.kurtosisFloatColumn.meanFloatColumn.medianFloatColumn.median_abs_deviationFloatColumn.modeFloatColumn.np_type_to_type()FloatColumn.skewnessFloatColumn.stddevFloatColumn.update()FloatColumn.varianceFloatColumn.match_countFloatColumn.sample_sizeFloatColumn.nameFloatColumn.metadataFloatColumn.timesFloatColumn.thread_safe
- dataprofiler.profilers.graph_profiler module
- dataprofiler.profilers.histogram_utils module
- dataprofiler.profilers.int_column_profile module
IntColumnIntColumn.typeIntColumn.report()IntColumn.load_from_dict()IntColumn.profileIntColumn.data_type_ratioIntColumn.update()IntColumn.col_typeIntColumn.diff()IntColumn.is_float()IntColumn.is_int()IntColumn.kurtosisIntColumn.meanIntColumn.medianIntColumn.median_abs_deviationIntColumn.modeIntColumn.np_type_to_type()IntColumn.skewnessIntColumn.stddevIntColumn.varianceIntColumn.match_countIntColumn.sample_sizeIntColumn.nameIntColumn.metadataIntColumn.timesIntColumn.thread_safe
- dataprofiler.profilers.json_decoder module
- dataprofiler.profilers.json_encoder module
- dataprofiler.profilers.numerical_column_stats module
abstractstaticmethodNumericStatsMixinNumericStatsMixin.typeNumericStatsMixin.profile()NumericStatsMixin.report()NumericStatsMixin.diff()NumericStatsMixin.meanNumericStatsMixin.modeNumericStatsMixin.medianNumericStatsMixin.varianceNumericStatsMixin.stddevNumericStatsMixin.skewnessNumericStatsMixin.kurtosisNumericStatsMixin.median_abs_deviationNumericStatsMixin.col_typeNumericStatsMixin.load_from_dict()NumericStatsMixin.nameNumericStatsMixin.sample_sizeNumericStatsMixin.metadataNumericStatsMixin.timesNumericStatsMixin.thread_safeNumericStatsMixin.update()NumericStatsMixin.is_float()NumericStatsMixin.is_int()NumericStatsMixin.np_type_to_type()
- dataprofiler.profilers.order_column_profile module
- dataprofiler.profilers.profile_builder module
- dataprofiler.profilers.profiler_options module
BaseOptionBooleanOptionHistogramAndQuantilesOptionModeOptionBaseInspectorOptionsNumericalOptionsIntOptionsPrecisionOptionsFloatOptionsTextOptionsDateTimeOptionsOrderOptionsCategoricalOptionsCorrelationOptionsHyperLogLogOptionsUniqueCountOptionsRowStatisticsOptionsDataLabelerOptionsTextProfilerOptionsStructuredOptionsUnstructuredOptionsProfilerOptions
- dataprofiler.profilers.profiler_utils module
recursive_dict_update()KeyDictshuffle_in_chunks()warn_on_profile()partition()auto_multiprocess_toggle()suggest_pool_size()generate_pool()overlap()add_nested_dictionaries()biased_skew()biased_kurt()Subtractablefind_diff_of_numbers()find_diff_of_strings_and_bools()find_diff_of_lists_and_sets()find_diff_of_dates()find_diff_of_dicts()find_diff_of_matrices()find_diff_of_dicts_with_diff_keys()get_memory_size()method_timeit()perform_chi_squared_test_for_homogeneity()chunk()merge()merge_profile_list()reload_labeler_from_options_or_get_new()
- dataprofiler.profilers.text_column_profile module
TextColumnTextColumn.typeTextColumn.report()TextColumn.profileTextColumn.diff()TextColumn.data_type_ratioTextColumn.update()TextColumn.load_from_dict()TextColumn.col_typeTextColumn.is_float()TextColumn.is_int()TextColumn.kurtosisTextColumn.meanTextColumn.medianTextColumn.median_abs_deviationTextColumn.modeTextColumn.np_type_to_type()TextColumn.skewnessTextColumn.stddevTextColumn.varianceTextColumn.minTextColumn.maxTextColumn.sumTextColumn.max_histogram_binTextColumn.min_histogram_binTextColumn.histogram_bin_method_namesTextColumn.histogram_selectionTextColumn.user_set_histogram_binTextColumn.bias_correctionTextColumn.num_zerosTextColumn.num_negativesTextColumn.histogram_methodsTextColumn.quantilesTextColumn.match_countTextColumn.nameTextColumn.sample_sizeTextColumn.metadataTextColumn.timesTextColumn.thread_safe
- dataprofiler.profilers.unstructured_labeler_profile module
- dataprofiler.profilers.unstructured_text_profile module
Module contents¶
Package for providing statistics and predictions for a given dataset.