dataprofiler.labelers package¶
Submodules¶
- dataprofiler.labelers.base_data_labeler module
BaseDataLabelerBaseDataLabeler.help()BaseDataLabeler.label_mappingBaseDataLabeler.reverse_label_mappingBaseDataLabeler.labelsBaseDataLabeler.preprocessorBaseDataLabeler.modelBaseDataLabeler.postprocessorBaseDataLabeler.set_params()BaseDataLabeler.add_label()BaseDataLabeler.set_labels()BaseDataLabeler.predict()BaseDataLabeler.set_preprocessor()BaseDataLabeler.set_model()BaseDataLabeler.set_postprocessor()BaseDataLabeler.check_pipeline()BaseDataLabeler.load_from_library()BaseDataLabeler.load_from_disk()BaseDataLabeler.load_with_components()BaseDataLabeler.save_to_disk()
TrainableDataLabelerTrainableDataLabeler.fit()TrainableDataLabeler.set_model()TrainableDataLabeler.load_with_components()TrainableDataLabeler.add_label()TrainableDataLabeler.check_pipeline()TrainableDataLabeler.help()TrainableDataLabeler.label_mappingTrainableDataLabeler.labelsTrainableDataLabeler.load_from_disk()TrainableDataLabeler.load_from_library()TrainableDataLabeler.modelTrainableDataLabeler.postprocessorTrainableDataLabeler.predict()TrainableDataLabeler.preprocessorTrainableDataLabeler.reverse_label_mappingTrainableDataLabeler.save_to_disk()TrainableDataLabeler.set_labels()TrainableDataLabeler.set_params()TrainableDataLabeler.set_postprocessor()TrainableDataLabeler.set_preprocessor()
- dataprofiler.labelers.base_model module
AutoSubRegistrationMetaBaseModelBaseModel.requires_zero_mappingBaseModel.label_mappingBaseModel.reverse_label_mappingBaseModel.labelsBaseModel.num_labelsBaseModel.get_class()BaseModel.get_parameters()BaseModel.set_params()BaseModel.add_label()BaseModel.set_label_mapping()BaseModel.help()BaseModel.reset_weights()BaseModel.predict()BaseModel.load_from_disk()BaseModel.save_to_disk()
BaseTrainableModelBaseTrainableModel.fit()BaseTrainableModel.add_label()BaseTrainableModel.get_class()BaseTrainableModel.get_parameters()BaseTrainableModel.help()BaseTrainableModel.label_mappingBaseTrainableModel.labelsBaseTrainableModel.load_from_disk()BaseTrainableModel.num_labelsBaseTrainableModel.predict()BaseTrainableModel.requires_zero_mappingBaseTrainableModel.reset_weights()BaseTrainableModel.reverse_label_mappingBaseTrainableModel.save_to_disk()BaseTrainableModel.set_label_mapping()BaseTrainableModel.set_params()
- dataprofiler.labelers.char_load_tf_model module
CharLoadTFModelCharLoadTFModel.requires_zero_mappingCharLoadTFModel.set_label_mapping()CharLoadTFModel.save_to_disk()CharLoadTFModel.load_from_disk()CharLoadTFModel.reset_weights()CharLoadTFModel.fit()CharLoadTFModel.predict()CharLoadTFModel.details()CharLoadTFModel.add_label()CharLoadTFModel.get_class()CharLoadTFModel.get_parameters()CharLoadTFModel.help()CharLoadTFModel.label_mappingCharLoadTFModel.labelsCharLoadTFModel.num_labelsCharLoadTFModel.reverse_label_mappingCharLoadTFModel.set_params()
- dataprofiler.labelers.character_level_cnn_model module
build_embd_dictionary()create_glove_char()ThreshArgMaxLayerThreshArgMaxLayer.get_config()ThreshArgMaxLayer.call()ThreshArgMaxLayer.add_loss()ThreshArgMaxLayer.add_metric()ThreshArgMaxLayer.add_variable()ThreshArgMaxLayer.add_weight()ThreshArgMaxLayer.build()ThreshArgMaxLayer.build_from_config()ThreshArgMaxLayer.compute_dtypeThreshArgMaxLayer.compute_mask()ThreshArgMaxLayer.compute_output_shape()ThreshArgMaxLayer.compute_output_spec()ThreshArgMaxLayer.count_params()ThreshArgMaxLayer.dtypeThreshArgMaxLayer.dtype_policyThreshArgMaxLayer.from_config()ThreshArgMaxLayer.get_build_config()ThreshArgMaxLayer.get_weights()ThreshArgMaxLayer.inputThreshArgMaxLayer.input_dtypeThreshArgMaxLayer.input_specThreshArgMaxLayer.load_own_variables()ThreshArgMaxLayer.lossesThreshArgMaxLayer.metricsThreshArgMaxLayer.metrics_variablesThreshArgMaxLayer.non_trainable_variablesThreshArgMaxLayer.non_trainable_weightsThreshArgMaxLayer.outputThreshArgMaxLayer.pathThreshArgMaxLayer.quantization_modeThreshArgMaxLayer.quantize()ThreshArgMaxLayer.quantized_call()ThreshArgMaxLayer.save_own_variables()ThreshArgMaxLayer.set_weights()ThreshArgMaxLayer.stateless_call()ThreshArgMaxLayer.supports_maskingThreshArgMaxLayer.symbolic_call()ThreshArgMaxLayer.trainableThreshArgMaxLayer.trainable_variablesThreshArgMaxLayer.trainable_weightsThreshArgMaxLayer.variable_dtypeThreshArgMaxLayer.variablesThreshArgMaxLayer.weights
EncodingLayerEncodingLayer.get_config()EncodingLayer.call()EncodingLayer.add_loss()EncodingLayer.add_metric()EncodingLayer.add_variable()EncodingLayer.add_weight()EncodingLayer.build()EncodingLayer.build_from_config()EncodingLayer.compute_dtypeEncodingLayer.compute_mask()EncodingLayer.compute_output_shape()EncodingLayer.compute_output_spec()EncodingLayer.count_params()EncodingLayer.dtypeEncodingLayer.dtype_policyEncodingLayer.from_config()EncodingLayer.get_build_config()EncodingLayer.get_weights()EncodingLayer.inputEncodingLayer.input_dtypeEncodingLayer.input_specEncodingLayer.load_own_variables()EncodingLayer.lossesEncodingLayer.metricsEncodingLayer.metrics_variablesEncodingLayer.non_trainable_variablesEncodingLayer.non_trainable_weightsEncodingLayer.outputEncodingLayer.pathEncodingLayer.quantization_modeEncodingLayer.quantize()EncodingLayer.quantized_call()EncodingLayer.save_own_variables()EncodingLayer.set_weights()EncodingLayer.stateless_call()EncodingLayer.supports_maskingEncodingLayer.symbolic_call()EncodingLayer.trainableEncodingLayer.trainable_variablesEncodingLayer.trainable_weightsEncodingLayer.variable_dtypeEncodingLayer.variablesEncodingLayer.weights
CharacterLevelCnnModelCharacterLevelCnnModel.requires_zero_mappingCharacterLevelCnnModel.set_label_mapping()CharacterLevelCnnModel.save_to_disk()CharacterLevelCnnModel.load_from_disk()CharacterLevelCnnModel.reset_weights()CharacterLevelCnnModel.fit()CharacterLevelCnnModel.predict()CharacterLevelCnnModel.details()CharacterLevelCnnModel.add_label()CharacterLevelCnnModel.get_class()CharacterLevelCnnModel.get_parameters()CharacterLevelCnnModel.help()CharacterLevelCnnModel.label_mappingCharacterLevelCnnModel.labelsCharacterLevelCnnModel.num_labelsCharacterLevelCnnModel.reverse_label_mappingCharacterLevelCnnModel.set_params()
- dataprofiler.labelers.classification_report_utils module
- dataprofiler.labelers.column_name_model module
ColumnNameModelColumnNameModel.reset_weights()ColumnNameModel.predict()ColumnNameModel.load_from_disk()ColumnNameModel.save_to_disk()ColumnNameModel.add_label()ColumnNameModel.get_class()ColumnNameModel.get_parameters()ColumnNameModel.help()ColumnNameModel.label_mappingColumnNameModel.labelsColumnNameModel.num_labelsColumnNameModel.requires_zero_mappingColumnNameModel.reverse_label_mappingColumnNameModel.set_label_mapping()ColumnNameModel.set_params()
- dataprofiler.labelers.data_labelers module
train_structured_labeler()UnstructuredDataLabelerUnstructuredDataLabeler.add_label()UnstructuredDataLabeler.check_pipeline()UnstructuredDataLabeler.help()UnstructuredDataLabeler.label_mappingUnstructuredDataLabeler.labelsUnstructuredDataLabeler.load_from_disk()UnstructuredDataLabeler.load_from_library()UnstructuredDataLabeler.load_with_components()UnstructuredDataLabeler.modelUnstructuredDataLabeler.postprocessorUnstructuredDataLabeler.predict()UnstructuredDataLabeler.preprocessorUnstructuredDataLabeler.reverse_label_mappingUnstructuredDataLabeler.save_to_disk()UnstructuredDataLabeler.set_labels()UnstructuredDataLabeler.set_model()UnstructuredDataLabeler.set_params()UnstructuredDataLabeler.set_postprocessor()UnstructuredDataLabeler.set_preprocessor()
StructuredDataLabelerStructuredDataLabeler.add_label()StructuredDataLabeler.check_pipeline()StructuredDataLabeler.help()StructuredDataLabeler.label_mappingStructuredDataLabeler.labelsStructuredDataLabeler.load_from_disk()StructuredDataLabeler.load_from_library()StructuredDataLabeler.load_with_components()StructuredDataLabeler.modelStructuredDataLabeler.postprocessorStructuredDataLabeler.predict()StructuredDataLabeler.preprocessorStructuredDataLabeler.reverse_label_mappingStructuredDataLabeler.save_to_disk()StructuredDataLabeler.set_labels()StructuredDataLabeler.set_model()StructuredDataLabeler.set_params()StructuredDataLabeler.set_postprocessor()StructuredDataLabeler.set_preprocessor()
DataLabeler
- dataprofiler.labelers.data_processing module
AutoSubRegistrationMetaBaseDataProcessorBaseDataPreprocessorBaseDataPreprocessor.processor_typeBaseDataPreprocessor.process()BaseDataPreprocessor.get_class()BaseDataPreprocessor.get_parameters()BaseDataPreprocessor.help()BaseDataPreprocessor.load_from_disk()BaseDataPreprocessor.load_from_library()BaseDataPreprocessor.save_to_disk()BaseDataPreprocessor.set_params()
BaseDataPostprocessorBaseDataPostprocessor.processor_typeBaseDataPostprocessor.process()BaseDataPostprocessor.get_class()BaseDataPostprocessor.get_parameters()BaseDataPostprocessor.help()BaseDataPostprocessor.load_from_disk()BaseDataPostprocessor.load_from_library()BaseDataPostprocessor.save_to_disk()BaseDataPostprocessor.set_params()
DirectPassPreprocessorDirectPassPreprocessor.help()DirectPassPreprocessor.process()DirectPassPreprocessor.get_class()DirectPassPreprocessor.get_parameters()DirectPassPreprocessor.load_from_disk()DirectPassPreprocessor.load_from_library()DirectPassPreprocessor.processor_typeDirectPassPreprocessor.save_to_disk()DirectPassPreprocessor.set_params()
CharPreprocessorCharEncodedPreprocessorCharEncodedPreprocessor.process()CharEncodedPreprocessor.get_class()CharEncodedPreprocessor.get_parameters()CharEncodedPreprocessor.help()CharEncodedPreprocessor.load_from_disk()CharEncodedPreprocessor.load_from_library()CharEncodedPreprocessor.processor_typeCharEncodedPreprocessor.save_to_disk()CharEncodedPreprocessor.set_params()
CharPostprocessorCharPostprocessor.help()CharPostprocessor.convert_to_NER_format()CharPostprocessor.match_sentence_lengths()CharPostprocessor.process()CharPostprocessor.get_class()CharPostprocessor.get_parameters()CharPostprocessor.load_from_disk()CharPostprocessor.load_from_library()CharPostprocessor.processor_typeCharPostprocessor.save_to_disk()CharPostprocessor.set_params()
StructCharPreprocessorStructCharPreprocessor.help()StructCharPreprocessor.get_parameters()StructCharPreprocessor.convert_to_unstructured_format()StructCharPreprocessor.process()StructCharPreprocessor.get_class()StructCharPreprocessor.load_from_disk()StructCharPreprocessor.load_from_library()StructCharPreprocessor.processor_typeStructCharPreprocessor.save_to_disk()StructCharPreprocessor.set_params()
StructCharPostprocessorStructCharPostprocessor.help()StructCharPostprocessor.match_sentence_lengths()StructCharPostprocessor.convert_to_structured_analysis()StructCharPostprocessor.process()StructCharPostprocessor.get_class()StructCharPostprocessor.get_parameters()StructCharPostprocessor.load_from_disk()StructCharPostprocessor.load_from_library()StructCharPostprocessor.processor_typeStructCharPostprocessor.save_to_disk()StructCharPostprocessor.set_params()
RegexPostProcessorRegexPostProcessor.help()RegexPostProcessor.priority_prediction()RegexPostProcessor.split_prediction()RegexPostProcessor.process()RegexPostProcessor.get_class()RegexPostProcessor.get_parameters()RegexPostProcessor.load_from_disk()RegexPostProcessor.load_from_library()RegexPostProcessor.processor_typeRegexPostProcessor.save_to_disk()RegexPostProcessor.set_params()
StructRegexPostProcessorStructRegexPostProcessor.set_params()StructRegexPostProcessor.help()StructRegexPostProcessor.process()StructRegexPostProcessor.get_class()StructRegexPostProcessor.get_parameters()StructRegexPostProcessor.load_from_disk()StructRegexPostProcessor.load_from_library()StructRegexPostProcessor.processor_typeStructRegexPostProcessor.save_to_disk()
ColumnNameModelPostprocessorColumnNameModelPostprocessor.help()ColumnNameModelPostprocessor.process()ColumnNameModelPostprocessor.get_class()ColumnNameModelPostprocessor.get_parameters()ColumnNameModelPostprocessor.load_from_disk()ColumnNameModelPostprocessor.load_from_library()ColumnNameModelPostprocessor.processor_typeColumnNameModelPostprocessor.save_to_disk()ColumnNameModelPostprocessor.set_params()
- dataprofiler.labelers.labeler_utils module
f1_report_dict_to_str()evaluate_accuracy()get_tf_layer_index_from_name()hide_tf_logger_warnings()protected_register_keras_serializable()FBetaScoreFBetaScore.update_state()FBetaScore.result()FBetaScore.get_config()FBetaScore.add_variable()FBetaScore.add_weight()FBetaScore.dtypeFBetaScore.from_config()FBetaScore.reset_state()FBetaScore.stateless_reset_state()FBetaScore.stateless_result()FBetaScore.stateless_update_state()FBetaScore.variables
F1Score
- dataprofiler.labelers.regex_model module
RegexModelRegexModel.reset_weights()RegexModel.predict()RegexModel.load_from_disk()RegexModel.save_to_disk()RegexModel.add_label()RegexModel.get_class()RegexModel.get_parameters()RegexModel.help()RegexModel.label_mappingRegexModel.labelsRegexModel.num_labelsRegexModel.requires_zero_mappingRegexModel.reverse_label_mappingRegexModel.set_label_mapping()RegexModel.set_params()
- dataprofiler.labelers.utils module
Module contents¶
The following will list the built-in models, processors, and data labelers.
- Models:
CharacterLevelCnnModel - character classification of text.
RegexModel - character classification of text.
- Processors:
- Preprocessors
CharPreprocessor
StructCharPreprocessor
DirectPassPreprocessor
- PostProcessors
CharPreprocessor
StructCharPostprocessor
RegexPostProcessor
- Data Labelers:
- Classes
UnstructuredDataLabeler
StructuredDataLabeler
- Files to load from disk using BaseDataLabeler.load_from_library(<NAME>)
unstructured_model
structured_model
regex_model