Unstructured Labeler Profile

Profile analysis for applying labels within unstructured profiling.

class dataprofiler.profilers.unstructured_labeler_profile.UnstructuredLabelerProfile(data_labeler_dirpath: Optional[str] = None, options: Optional[dataprofiler.profilers.profiler_options.DataLabelerOptions] = None)

Bases: object

Profiles and labels unstructured data.

Initialize of Data Label profiling for unstructured datasets.

Parameters
  • data_labeler_dirpath (String) – Directory path to the data labeler

  • options (DataLabelerOptions) – Options for the data labeler column

type = 'data_labeler'
report(remove_disabled_flag: bool = False) dict

Return profile object.

Parameters

remove_disabled_flag (boolean) – flag to determine if disabled options should be excluded in report.

diff(other_profile: dataprofiler.profilers.unstructured_labeler_profile.UnstructuredLabelerProfile, options: Optional[dict] = None) dict

Find the differences for two unstructured labeler profiles.

Parameters
Returns

the difference between entity counts/percentages

Return type

dict

property label_encoding: list

Return list of labels.

update(df_series: pandas.core.series.Series) None

Update profile.

property profile: dict

Return a profile.