Unstructured Text Profile

For profiling unstructured text data.

class dataprofiler.profilers.unstructured_text_profile.TextProfiler(name, options=None)

Bases: object

Profiles text data.

Initialize TextProfiler object.

Parameters
  • name (String) – Name of the data

  • options (UnstructuredTextOptions) – Options for the Text Profiler

type = 'text'
diff(other_profile, options=None)

Find the differences for two unstructured text profiles.

Parameters
  • other_profile (TextProfiler) – profile to find the difference with

  • options (dict) – options for diff output

Returns

the difference between profiles

Return type

dict

report(remove_disabled_flag=False)

Report profile attribute of class; potentially pop val from self.profile.

property profile

Return the profile of the column.

Returns

update(data)

Update the column profile.

Parameters

data (pandas.core.series.Series) – df series

Returns

None