Unstructured Labeler Profile¶
Profile analysis for applying labels within unstructured profiling.
- class dataprofiler.profilers.unstructured_labeler_profile.UnstructuredLabelerProfile(data_labeler_dirpath: Optional[str] = None, options: Optional[dataprofiler.profilers.profiler_options.DataLabelerOptions] = None)¶
Bases:
object
Profiles and labels unstructured data.
Initialize of Data Label profiling for unstructured datasets.
- Parameters
data_labeler_dirpath (String) – Directory path to the data labeler
options (DataLabelerOptions) – Options for the data labeler column
- type = 'data_labeler'¶
- report(remove_disabled_flag: bool = False) dict ¶
Return profile object.
- Parameters
remove_disabled_flag (boolean) – flag to determine if disabled options should be excluded in report.
- diff(other_profile: dataprofiler.profilers.unstructured_labeler_profile.UnstructuredLabelerProfile, options: Optional[dict] = None) dict ¶
Find the differences for two unstructured labeler profiles.
- Parameters
other_profile (UnstructuredLabelerProfile) – profile to find the difference with
options (dict) – options for diff output
- Returns
the difference between entity counts/percentages
- Return type
dict
- property label_encoding: list[str]¶
Return list of labels.
- update(df_series: pandas.core.series.Series) None ¶
Update profile.
- property profile: dict¶
Return a profile.