Unstructured Labeler Profile

Profile analysis for applying labels within unstructured profiling.

class dataprofiler.profilers.unstructured_labeler_profile.UnstructuredLabelerProfile(data_labeler_dirpath=None, options=None)

Bases: object

Profiles and labels unstructured data.

Initialize of Data Label profiling for unstructured datasets.

Parameters
  • data_labeler_dirpath (String) – Directory path to the data labeler

  • options (DataLabelerOptions) – Options for the data labeler column

type = 'data_labeler'
report(remove_disabled_flag=False)

Return profile object.

Parameters

remove_disabled_flag (boolean) – flag to determine if disabled options should be excluded in report.

diff(other_profile, options=None)

Find the differences for two unstructured labeler profiles.

Parameters
Returns

the difference between entity counts/percentages

Return type

dict

property label_encoding

Return list of labels.

update(df_series)

Update profile.

property profile

Return a profile.