Base Data

class dataprofiler.data_readers.base_data.BaseData(input_file_path, data, options)

Bases: object

Abstract class for data loading and saving

Base class for loading a dataset. Options can be specified and maybe more specific to the subclasses.

Parameters
  • input_file_path (str) – path to the file being loaded or None

  • data (multiple types) – data being loaded into the class instead of an input file

  • options (dict) – options pertaining to the data type

Returns

None

data_type = None
info = None
property data
property is_structured

Determines compatibility with StructuredProfiler

property data_format
property file_encoding
get_batch_generator(batch_size)
classmethod is_match(input_file_path, options)
reload(input_file_path, data, options)

Reload the data class with a new dataset. This erases all existing data/options and replaces it with the input data/options.

Parameters
  • input_file_path (str) – path to the file being loaded or None

  • data (multiple types) – data being loaded into the class instead of an input file

  • options (dict) – options pertaining to the data type

Returns

None

property length

Returns the length of the dataset which is loaded.

Returns

length of the dataset