dataprofiler package¶
Subpackages¶
- dataprofiler.data_readers package
- Submodules
- dataprofiler.data_readers.avro_data module
- dataprofiler.data_readers.base_data module
- dataprofiler.data_readers.csv_data module
- dataprofiler.data_readers.data module
- dataprofiler.data_readers.data_utils module
data_generator()
generator_on_file()
convert_int_to_string()
unicode_to_str()
json_to_dataframe()
read_json_df()
read_json()
reservoir()
rsample()
read_csv_df()
convert_unicode_col_to_utf8()
sample_parquet()
read_parquet_df()
read_text_as_list_of_strs()
detect_file_encoding()
detect_cell_type()
get_delimiter_regex()
find_nth_loc()
load_as_str_from_file()
is_valid_url()
url_to_bytes()
S3Helper
- dataprofiler.data_readers.filepath_or_buffer module
- dataprofiler.data_readers.graph_data module
- dataprofiler.data_readers.json_data module
- dataprofiler.data_readers.parquet_data module
- dataprofiler.data_readers.structured_mixins module
- dataprofiler.data_readers.text_data module
- Module contents
- Submodules
- dataprofiler.labelers package
- Submodules
- dataprofiler.labelers.base_data_labeler module
- dataprofiler.labelers.base_model module
- dataprofiler.labelers.char_load_tf_model module
- dataprofiler.labelers.character_level_cnn_model module
- dataprofiler.labelers.classification_report_utils module
- dataprofiler.labelers.column_name_model module
- dataprofiler.labelers.data_labelers module
- dataprofiler.labelers.data_processing module
- dataprofiler.labelers.labeler_utils module
- dataprofiler.labelers.regex_model module
- dataprofiler.labelers.utils module
- Module contents
- Submodules
- dataprofiler.plugins package
- dataprofiler.profilers package
- Subpackages
- Submodules
- dataprofiler.profilers.base_column_profilers module
- dataprofiler.profilers.categorical_column_profile module
- dataprofiler.profilers.column_profile_compilers module
- dataprofiler.profilers.data_labeler_column_profile module
- dataprofiler.profilers.datetime_column_profile module
- dataprofiler.profilers.float_column_profile module
- dataprofiler.profilers.graph_profiler module
- dataprofiler.profilers.histogram_utils module
- dataprofiler.profilers.int_column_profile module
- dataprofiler.profilers.json_decoder module
- dataprofiler.profilers.json_encoder module
- dataprofiler.profilers.numerical_column_stats module
- dataprofiler.profilers.order_column_profile module
- dataprofiler.profilers.profile_builder module
- dataprofiler.profilers.profiler_options module
BaseOption
BooleanOption
HistogramAndQuantilesOption
ModeOption
BaseInspectorOptions
NumericalOptions
IntOptions
PrecisionOptions
FloatOptions
TextOptions
DateTimeOptions
OrderOptions
CategoricalOptions
CorrelationOptions
HyperLogLogOptions
UniqueCountOptions
RowStatisticsOptions
DataLabelerOptions
TextProfilerOptions
StructuredOptions
UnstructuredOptions
ProfilerOptions
- dataprofiler.profilers.profiler_utils module
recursive_dict_update()
KeyDict
shuffle_in_chunks()
warn_on_profile()
partition()
auto_multiprocess_toggle()
suggest_pool_size()
generate_pool()
overlap()
add_nested_dictionaries()
biased_skew()
biased_kurt()
Subtractable
find_diff_of_numbers()
find_diff_of_strings_and_bools()
find_diff_of_lists_and_sets()
find_diff_of_dates()
find_diff_of_dicts()
find_diff_of_matrices()
find_diff_of_dicts_with_diff_keys()
get_memory_size()
method_timeit()
perform_chi_squared_test_for_homogeneity()
chunk()
merge()
merge_profile_list()
reload_labeler_from_options_or_get_new()
- dataprofiler.profilers.text_column_profile module
- dataprofiler.profilers.unstructured_labeler_profile module
- dataprofiler.profilers.unstructured_text_profile module
- Module contents
- dataprofiler.reports package
- dataprofiler.validators package
Submodules¶
Module contents¶
Package for dataprofiler.
- dataprofiler.set_seed(seed=None)¶