create_text_model_readerT_create_text_model_readerCreateTextModelReaderCreateTextModelReadercreate_text_model_reader创建文本模型阅读器(算子)

名称

create_text_model_readerT_create_text_model_readerCreateTextModelReaderCreateTextModelReadercreate_text_model_reader — 创建文本模型。

签名

create_text_model_reader( : : Mode, OCRClassifier : TextModel)

Herror T_create_text_model_reader(const Htuple Mode, const Htuple OCRClassifier, Htuple* TextModel)

void CreateTextModelReader(const HTuple& Mode, const HTuple& OCRClassifier, HTuple* TextModel)

void HTextModel::HTextModel(const HString& Mode, const HTuple& OCRClassifier)

void HTextModel::HTextModel(const HString& Mode, const HString& OCRClassifier)

void HTextModel::HTextModel(const char* Mode, const char* OCRClassifier)

void HTextModel::HTextModel(const wchar_t* Mode, const wchar_t* OCRClassifier)   ( Windows only)

void HTextModel::CreateTextModelReader(const HString& Mode, const HTuple& OCRClassifier)

void HTextModel::CreateTextModelReader(const HString& Mode, const HString& OCRClassifier)

void HTextModel::CreateTextModelReader(const char* Mode, const char* OCRClassifier)

void HTextModel::CreateTextModelReader(const wchar_t* Mode, const wchar_t* OCRClassifier)   ( Windows only)

static void HOperatorSet.CreateTextModelReader(HTuple mode, HTuple OCRClassifier, out HTuple textModel)

public HTextModel(string mode, HTuple OCRClassifier)

public HTextModel(string mode, string OCRClassifier)

void HTextModel.CreateTextModelReader(string mode, HTuple OCRClassifier)

void HTextModel.CreateTextModelReader(string mode, string OCRClassifier)

def create_text_model_reader(mode: str, ocrclassifier: Union[int, str]) -> HHandle

描述

create_text_model_readercreate_text_model_readerCreateTextModelReaderCreateTextModelReaderCreateTextModelReadercreate_text_model_reader 会创建一个 TextModelTextModelTextModelTextModeltextModeltext_model,该模型描述了将通过 find_textfind_textFindTextFindTextFindTextfind_text 进行分割的文本。

ModeModeModeModemodemode 参数的值决定了使用哪种文本分割方法。可能的取值为 'auto'"auto""auto""auto""auto""auto"'manual'"manual""manual""manual""manual""manual"

通常,应将 ModeModeModeModemodemode 参数设置为 'auto'"auto""auto""auto""auto""auto",因为该模式更稳定,且配置工作量更少。请注意,在此情况下,还必须通过 OCRClassifierOCRClassifierOCRClassifierOCRClassifierOCRClassifierocrclassifier 参数传入一个 OCR 分类器。仅当满足以下任一限制条件时,才需将 ModeModeModeModemodemode 设置为 'manual'"manual""manual""manual""manual""manual"

如果 ModeModeModeModemodemode = 'auto'"auto""auto""auto""auto""auto"find_textfind_textFindTextFindTextFindTextfind_text 能够提取任意大小的文本。可以将搜索范围限制为具有特定属性的字符,详情请参阅 set_text_model_paramset_text_model_paramSetTextModelParamSetTextModelParamSetTextModelParamset_text_model_param。特别是,如果待分割的文本包含点印字符,则必须将文本模型参数 'dot_print'"dot_print""dot_print""dot_print""dot_print""dot_print" 设置为 'true'"true""true""true""true""true"。此外,必须在 OCRClassifierOCRClassifierOCRClassifierOCRClassifierOCRClassifierocrclassifier 中传入一个 OCR 分类器。该 OCR 分类器必须基于卷积神经网络 (CNN) 或多层感知器 (MLP)。此外,强烈建议使用具有拒绝类的基于 CNN 的 OCR 分类器,或使用已通过正则化参数训练的基于 MLP 的分类器(参见 set_regularization_params_ocr_class_mlpset_regularization_params_ocr_class_mlpSetRegularizationParamsOcrClassMlpSetRegularizationParamsOcrClassMlpSetRegularizationParamsOcrClassMlpset_regularization_params_ocr_class_mlp)并提供拒绝类 (参见 set_rejection_params_ocr_class_mlpset_rejection_params_ocr_class_mlpSetRejectionParamsOcrClassMlpSetRejectionParamsOcrClassMlpSetRejectionParamsOcrClassMlpset_rejection_params_ocr_class_mlp)。合适的 OCR 分类器既可以通过 read_ocr_class_cnnread_ocr_class_cnnReadOcrClassCnnReadOcrClassCnnReadOcrClassCnnread_ocr_class_cnnread_ocr_class_mlpread_ocr_class_mlpReadOcrClassMlpReadOcrClassMlpReadOcrClassMlpread_ocr_class_mlp 读取,也可以通过 create_ocr_class_mlpcreate_ocr_class_mlpCreateOcrClassMlpCreateOcrClassMlpCreateOcrClassMlpcreate_ocr_class_mlp 创建。此外,还可以传入一个字符串,其中包含预训练 OCR 分类器的路径,或通过 write_ocr_class_mlpwrite_ocr_class_mlpWriteOcrClassMlpWriteOcrClassMlpWriteOcrClassMlpwrite_ocr_class_mlp 存储的 OCR 分类器。

若在 ModeModeModeModemodemode = 'manual'"manual""manual""manual""manual""manual" 时启用文本分割,必须使用 set_text_model_paramset_text_model_paramSetTextModelParamSetTextModelParamSetTextModelParamset_text_model_param 设置文本模型的合理参数,包括预期的字符高度和宽度。在此情况下,OCRClassifierOCRClassifierOCRClassifierOCRClassifierOCRClassifierocrclassifier 的值将被忽略。

可通过 set_text_model_paramset_text_model_paramSetTextModelParamSetTextModelParamSetTextModelParamset_text_model_paramget_text_model_paramget_text_model_paramGetTextModelParamGetTextModelParamGetTextModelParamget_text_model_param 设置和查询 TextModelTextModelTextModelTextModeltextModeltext_model 的参数。

由于在调用 create_text_model_readercreate_text_model_readerCreateTextModelReaderCreateTextModelReaderCreateTextModelReadercreate_text_model_reader 以及随后的操作过程中会为文本模型分配内存,因此一旦不再使用该模型,应立即通过 clear_text_modelclear_text_modelClearTextModelClearTextModelClearTextModelclear_text_model 算子显式释放内存。

执行信息

此算子返回一个句柄。请注意,即使该句柄被用作特定算子的输入参数,这些算子仍可能改变此句柄类型的实例状态。

参数

ModeModeModeModemodemode (输入控制)  string HTuplestrHTupleHtuple (string) (string) (HString) (char*)

文本模型的模式。

默认值: 'auto' "auto" "auto" "auto" "auto" "auto"

值列表: 'auto'"auto""auto""auto""auto""auto", 'manual'"manual""manual""manual""manual""manual"

OCRClassifierOCRClassifierOCRClassifierOCRClassifierOCRClassifierocrclassifier (输入控制)  string HTupleUnion[int, str]HTupleHtuple (string / integer) (string / int / long) (HString / Hlong) (char* / Hlong)

OCR 分类器。

默认值: 'Universal_Rej.occ' "Universal_Rej.occ" "Universal_Rej.occ" "Universal_Rej.occ" "Universal_Rej.occ" "Universal_Rej.occ"

建议值: 'Document_Rej.omc'"Document_Rej.omc""Document_Rej.omc""Document_Rej.omc""Document_Rej.omc""Document_Rej.omc", 'Document_0-9_Rej.omc'"Document_0-9_Rej.omc""Document_0-9_Rej.omc""Document_0-9_Rej.omc""Document_0-9_Rej.omc""Document_0-9_Rej.omc", 'Document_0-9A-Z_Rej.omc'"Document_0-9A-Z_Rej.omc""Document_0-9A-Z_Rej.omc""Document_0-9A-Z_Rej.omc""Document_0-9A-Z_Rej.omc""Document_0-9A-Z_Rej.omc", 'Document_A-Z+_Rej.omc'"Document_A-Z+_Rej.omc""Document_A-Z+_Rej.omc""Document_A-Z+_Rej.omc""Document_A-Z+_Rej.omc""Document_A-Z+_Rej.omc", 'DotPrint_Rej.omc'"DotPrint_Rej.omc""DotPrint_Rej.omc""DotPrint_Rej.omc""DotPrint_Rej.omc""DotPrint_Rej.omc", 'DotPrint_0-9_Rej.omc'"DotPrint_0-9_Rej.omc""DotPrint_0-9_Rej.omc""DotPrint_0-9_Rej.omc""DotPrint_0-9_Rej.omc""DotPrint_0-9_Rej.omc", 'DotPrint_0-9+_Rej.omc'"DotPrint_0-9+_Rej.omc""DotPrint_0-9+_Rej.omc""DotPrint_0-9+_Rej.omc""DotPrint_0-9+_Rej.omc""DotPrint_0-9+_Rej.omc", 'DotPrint_0-9A-Z_Rej.omc'"DotPrint_0-9A-Z_Rej.omc""DotPrint_0-9A-Z_Rej.omc""DotPrint_0-9A-Z_Rej.omc""DotPrint_0-9A-Z_Rej.omc""DotPrint_0-9A-Z_Rej.omc", 'DotPrint_A-Z+_Rej.omc'"DotPrint_A-Z+_Rej.omc""DotPrint_A-Z+_Rej.omc""DotPrint_A-Z+_Rej.omc""DotPrint_A-Z+_Rej.omc""DotPrint_A-Z+_Rej.omc", 'HandWritten_0-9_Rej.omc'"HandWritten_0-9_Rej.omc""HandWritten_0-9_Rej.omc""HandWritten_0-9_Rej.omc""HandWritten_0-9_Rej.omc""HandWritten_0-9_Rej.omc", 'Industrial_Rej.omc'"Industrial_Rej.omc""Industrial_Rej.omc""Industrial_Rej.omc""Industrial_Rej.omc""Industrial_Rej.omc", 'Industrial_0-9_Rej.omc'"Industrial_0-9_Rej.omc""Industrial_0-9_Rej.omc""Industrial_0-9_Rej.omc""Industrial_0-9_Rej.omc""Industrial_0-9_Rej.omc", 'Industrial_0-9+_Rej.omc'"Industrial_0-9+_Rej.omc""Industrial_0-9+_Rej.omc""Industrial_0-9+_Rej.omc""Industrial_0-9+_Rej.omc""Industrial_0-9+_Rej.omc", 'Industrial_0-9A-Z_Rej.omc'"Industrial_0-9A-Z_Rej.omc""Industrial_0-9A-Z_Rej.omc""Industrial_0-9A-Z_Rej.omc""Industrial_0-9A-Z_Rej.omc""Industrial_0-9A-Z_Rej.omc", 'Industrial_A-Z+_Rej.omc'"Industrial_A-Z+_Rej.omc""Industrial_A-Z+_Rej.omc""Industrial_A-Z+_Rej.omc""Industrial_A-Z+_Rej.omc""Industrial_A-Z+_Rej.omc", 'OCRA_Rej.omc'"OCRA_Rej.omc""OCRA_Rej.omc""OCRA_Rej.omc""OCRA_Rej.omc""OCRA_Rej.omc", 'OCRA_0-9_Rej.omc'"OCRA_0-9_Rej.omc""OCRA_0-9_Rej.omc""OCRA_0-9_Rej.omc""OCRA_0-9_Rej.omc""OCRA_0-9_Rej.omc", 'OCRA_0-9A-Z_Rej.omc'"OCRA_0-9A-Z_Rej.omc""OCRA_0-9A-Z_Rej.omc""OCRA_0-9A-Z_Rej.omc""OCRA_0-9A-Z_Rej.omc""OCRA_0-9A-Z_Rej.omc", 'OCRA_A-Z+_Rej.omc'"OCRA_A-Z+_Rej.omc""OCRA_A-Z+_Rej.omc""OCRA_A-Z+_Rej.omc""OCRA_A-Z+_Rej.omc""OCRA_A-Z+_Rej.omc", 'OCRB_Rej.omc'"OCRB_Rej.omc""OCRB_Rej.omc""OCRB_Rej.omc""OCRB_Rej.omc""OCRB_Rej.omc", 'OCRB_0-9_Rej.omc'"OCRB_0-9_Rej.omc""OCRB_0-9_Rej.omc""OCRB_0-9_Rej.omc""OCRB_0-9_Rej.omc""OCRB_0-9_Rej.omc", 'OCRB_0-9A-Z_Rej.omc'"OCRB_0-9A-Z_Rej.omc""OCRB_0-9A-Z_Rej.omc""OCRB_0-9A-Z_Rej.omc""OCRB_0-9A-Z_Rej.omc""OCRB_0-9A-Z_Rej.omc", 'OCRB_A-Z+_Rej.omc'"OCRB_A-Z+_Rej.omc""OCRB_A-Z+_Rej.omc""OCRB_A-Z+_Rej.omc""OCRB_A-Z+_Rej.omc""OCRB_A-Z+_Rej.omc", 'OCRB_passport_Rej.omc'"OCRB_passport_Rej.omc""OCRB_passport_Rej.omc""OCRB_passport_Rej.omc""OCRB_passport_Rej.omc""OCRB_passport_Rej.omc", 'Pharma_Rej.omc'"Pharma_Rej.omc""Pharma_Rej.omc""Pharma_Rej.omc""Pharma_Rej.omc""Pharma_Rej.omc", 'Pharma_0-9_Rej.omc'"Pharma_0-9_Rej.omc""Pharma_0-9_Rej.omc""Pharma_0-9_Rej.omc""Pharma_0-9_Rej.omc""Pharma_0-9_Rej.omc", 'Pharma_0-9+_Rej.omc'"Pharma_0-9+_Rej.omc""Pharma_0-9+_Rej.omc""Pharma_0-9+_Rej.omc""Pharma_0-9+_Rej.omc""Pharma_0-9+_Rej.omc", 'Pharma_0-9A-Z_Rej.omc'"Pharma_0-9A-Z_Rej.omc""Pharma_0-9A-Z_Rej.omc""Pharma_0-9A-Z_Rej.omc""Pharma_0-9A-Z_Rej.omc""Pharma_0-9A-Z_Rej.omc", 'SEMI_Rej.omc'"SEMI_Rej.omc""SEMI_Rej.omc""SEMI_Rej.omc""SEMI_Rej.omc""SEMI_Rej.omc", 'Universal_Rej.occ'"Universal_Rej.occ""Universal_Rej.occ""Universal_Rej.occ""Universal_Rej.occ""Universal_Rej.occ", 'Universal_0-9_Rej.occ'"Universal_0-9_Rej.occ""Universal_0-9_Rej.occ""Universal_0-9_Rej.occ""Universal_0-9_Rej.occ""Universal_0-9_Rej.occ", 'Universal_0-9+_Rej.occ'"Universal_0-9+_Rej.occ""Universal_0-9+_Rej.occ""Universal_0-9+_Rej.occ""Universal_0-9+_Rej.occ""Universal_0-9+_Rej.occ", 'Universal_0-9A-Z_Rej.occ'"Universal_0-9A-Z_Rej.occ""Universal_0-9A-Z_Rej.occ""Universal_0-9A-Z_Rej.occ""Universal_0-9A-Z_Rej.occ""Universal_0-9A-Z_Rej.occ", 'Universal_0-9A-Z+_Rej.occ'"Universal_0-9A-Z+_Rej.occ""Universal_0-9A-Z+_Rej.occ""Universal_0-9A-Z+_Rej.occ""Universal_0-9A-Z+_Rej.occ""Universal_0-9A-Z+_Rej.occ", 'Universal_A-Z+_Rej.occ'"Universal_A-Z+_Rej.occ""Universal_A-Z+_Rej.occ""Universal_A-Z+_Rej.occ""Universal_A-Z+_Rej.occ""Universal_A-Z+_Rej.occ"

TextModelTextModelTextModelTextModeltextModeltext_model (输出控制)  text_model HTextModel, HTupleHHandleHTupleHtuple (handle) (IntPtr) (HHandle) (handle)

新的文本模型。

示例(HDevelop)

read_image (Image, 'numbers_scale')
create_text_model_reader ('auto', 'Document_Rej.omc', TextModel)
* Optionally specify text properties
set_text_model_param (TextModel, 'min_char_height', 20)
find_text (Image, TextModel, TextResultID)
* Return character regions and corresponding classification results
get_text_object (Characters, TextResultID, 'all_lines')
get_text_result (TextResultID, 'class', Class)

结果

create_text_model_readercreate_text_model_readerCreateTextModelReaderCreateTextModelReaderCreateTextModelReadercreate_text_model_reader 返回值 2 (H_MSG_TRUE)。

可能的后继

set_text_model_paramset_text_model_paramSetTextModelParamSetTextModelParamSetTextModelParamset_text_model_param, get_text_model_paramget_text_model_paramGetTextModelParamGetTextModelParamGetTextModelParamget_text_model_param, find_textfind_textFindTextFindTextFindTextfind_text

另见

clear_text_modelclear_text_modelClearTextModelClearTextModelClearTextModelclear_text_model

模块

光学字符识别/光学字符验证