当前位置：首页 > news >正文

国内最大的app开发公司电商网站seo怎么做

news 2025/11/19 14:43:55

国内最大的app开发公司,电商网站seo怎么做,装修公司做网站好做吗,门户网站系统源码硬件平台#xff1a; 1、firefly安装Ubuntu系统的RK3588#xff1b; 2、安装Windows系统的电脑一台#xff0c;其上安装Ubuntu18.04系统虚拟机。参考手册#xff1a;《00-Rockchip_RKNPU_User_Guide_RKNN_API_V1.3.0_CN》《RKNN Toolkit Lite2 用户使用指南》 1、文…硬件平台 1、firefly安装Ubuntu系统的RK3588 2、安装Windows系统的电脑一台其上安装Ubuntu18.04系统虚拟机。参考手册《00-Rockchip_RKNPU_User_Guide_RKNN_API_V1.3.0_CN》《RKNN Toolkit Lite2 用户使用指南》 1、文字检测项目地址 GitHub - WenmuZhou/PytorchOCR: 基于Pytorch的OCR工具库支持常用的文字检测和识别算法 DBNetDynamic-Link Bi-directional Network是一种用于文本检测的深度学习模型。该模型于2019年由Minghui Liao等人提出并在文本检测领域取得了显著的成果。DBNet的设计目标是在保持高精度的同时提高文本检测的效率。传统的文本检测模型通常使用单向的横向连接或纵向连接来处理文本实例。然而这种单向连接可能导致信息的不完整传递或信息冗余影响了检测性能和速度。为了解决这些问题DBNet引入了双向动态连接机制允许横向和纵向两个方向上的信息流动。具体来说DBNet由两个关键组成部分构成 (1) Bi-directional FFMFeature Fusion Module这是DBNet的核心组件之一。它包括横向和纵向两个方向的子模块。在横向子模块中DBNet通过可变形卷积deformable convolution从不同尺度的特征图中提取并融合文本实例的特征。而在纵向子模块中DBNet使用自适应的特征选择机制动态选择最具有代表性的特征。这些子模块的组合使得文本实例的特征能够全面而高效地进行建模。 (2) Aggregation Decoder这是DBNet的另一个重要组件用于从特征图中生成文本实例的边界框和对应的文本分数。该解码器结合了横向和纵向的特征通过逐步聚合来预测文本的位置和形状。由于使用了双向动态连接解码器能够更准确地还原文本实例的形态。 DBNet的训练过程包括前向传播和反向传播。在前向传播中DBNet将图像输入网络经过一系列卷积、特征融合和解码操作得到文本检测的结果。然后通过计算预测结果和真实标签之间的损失函数使用反向传播算法来更新网络参数从而不断优化模型的性能。 DBNet在文本检测任务中取得了非常好的效果。其双向动态连接机制允许更好地利用横向和纵向的信息提高了文本检测的准确性和鲁棒性。此外相比传统的文本检测模型DBNet在保持高精度的情况下大幅提升了检测速度使得它在实际应用中更具可用性和实用性。因此DBNet在文字检测、自动化办公、图像识别等领域都具有广泛的应用前景。论文地址https://arxiv.org/abs/1911.08947 图1. DBNet网络结构 2、文字识别项目地址 GitHub - WenmuZhou/PytorchOCR: 基于Pytorch的OCR工具库支持常用的文字检测和识别算法 CRNNConvolutional Recurrent Neural Network是一种深度学习模型结合了卷积神经网络CNN和循环神经网络RNN的优势广泛应用于图像文本识别OCR任务。CRNN模型于2015年由Baoguang Shi等人首次提出并在OCR领域取得了显著的突破。 CRNN的设计思想是将卷积神经网络用于图像的特征提取并利用循环神经网络来对序列建模从而使得CRNN能够直接从图像级别到序列级别进行端到端的学习。 CRNN模型通常由以下几个部分组成 (1) 卷积层Convolutional LayersCRNN利用多个卷积层来提取图像中的局部特征。这些卷积层可以学习不同层次的图像表示从低级特征如边缘和纹理到高级特征如形状和模式。 (2) RNN层Recurrent Layers在卷积层后面CRNN采用RNN层来处理序列数据。RNN能够捕捉序列的上下文信息因此对于OCR任务而言它可以有效地处理不同长度的文本序列。 (3) 转录层Transcription Layer在RNN层之后CRNN使用转录层来将RNN输出映射到字符类别。这通常是一个全连接层将RNN输出映射到预定义的字符集合从而实现对文本的识别。 CRNN的训练过程包括两个主要步骤前向传播和反向传播。在前向传播中CRNN将图像输入模型经过卷积和循环层最终得到文本序列的预测。然后通过计算预测结果和真实标签之间的损失函数使用反向传播算法来更新网络参数从而使得模型的预测结果逐渐接近真实标签。 CRNN在OCR领域的应用广泛能够识别不同尺寸、字体、颜色和背景的文本。它在识别长文本序列方面表现优秀并且由于端到端的设计避免了传统OCR系统中复杂的流水线处理。因此CRNN在很多实际场景中都取得了很好的效果如车牌识别、文字检测和手写体识别等。总结来说CRNN是一种将CNN和RNN结合起来的深度学习模型用于图像文本识别任务。其端到端的设计、优秀的序列建模能力和在OCR领域的广泛应用使得CRNN成为了一种重要的OCR模型为自动化文本处理和识别带来了巨大的便利。论文地址https://arxiv.org/abs/1507.05717 图2. CRNN结构环境搭建 rknn-toolkit以及rknpu_sdk环境搭建手把手rknn-toolkit以及rknpu_sdk环境搭建--以rk3588为例_warren伟_的博客-CSDN博客模型的导出与验证文字检测导出onnx模型 Author: warrenDate: 2023-06-07 14:52:27LastEditors: warrenLastEditTime: 2023-06-12 15:20:28FilePath: /warren/VanillaNet1/export_onnx.pyDescription: export onnx modelCopyright (c) 2023 by ${git_name_email}, All Rights Reserved.#!/usr/bin/env python3import torchfrom torchocr.networks import build_modelMODEL_PATH./model/det_db_mbv3_new.pthDEVICEcuda:0 if torch.cuda.is_available() else cpuprint(-----------------------devices,DEVICE)class DetInfer:def __init__(self, model_path):ckpt torch.load(model_path, map_locationDEVICE)cfg ckpt[cfg]self.model build_model(cfg[model])state_dict {}for k, v in ckpt[state_dict].items():state_dict[k.replace(module., )] vself.model.load_state_dict(state_dict)self.device torch.device(DEVICE)self.model.to(self.device)self.model.eval()checkpoint torch.load(MODEL_PATH, map_locationDEVICE)# Prepare input tensorinput torch.randn(1, 3, 640, 640, requires_gradFalse).float().to(torch.device(DEVICE))# Export the torch model as onnxprint(-------------------export)torch.onnx.export(self.model,input,detect_model_small.onnx, # name of the exported onnx modelexport_paramsTrue,opset_version12,do_constant_foldingFalse)# Load the pretrained model and export it as onnxmodel DetInfer(MODEL_PATH) 验证 import numpy as npimport cv2import torchfrom torchvision import transforms# from label_convert import CTCLabelConverterimport cv2import numpy as npimport pyclipperfrom shapely.geometry import Polygon import onnxruntimeclass DBPostProcess():def __init__(self, thresh0.3, box_thresh0.7, max_candidates1000, unclip_ratio2):self.min_size 3self.thresh threshself.box_thresh box_threshself.max_candidates max_candidatesself.unclip_ratio unclip_ratiodef __call__(self, pred, h_w_list, is_output_polygonFalse):batch: (image, polygons, ignore_tagsh_w_list: 包含[h,w]的数组pred:binary: text region segmentation map, with shape (N, 1,H, W)pred pred[:, 0, :, :]segmentation self.binarize(pred)boxes_batch []scores_batch []for batch_index in range(pred.shape[0]):height, width h_w_list[batch_index]boxes, scores self.post_p(pred[batch_index], segmentation[batch_index], width, height,is_output_polygonis_output_polygon)boxes_batch.append(boxes)scores_batch.append(scores)return boxes_batch, scores_batchdef binarize(self, pred):return pred self.threshdef post_p(self, pred, bitmap, dest_width, dest_height, is_output_polygonFalse):_bitmap: single map with shape (H, W),whose values are binarized as {0, 1}height, width pred.shapeboxes []new_scores []# bitmap bitmap.cpu().numpy()if cv2.__version__.startswith(3):_, contours, _ cv2.findContours((bitmap * 255).astype(np.uint8), cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE)if cv2.__version__.startswith(4):contours, _ cv2.findContours((bitmap * 255).astype(np.uint8), cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE)for contour in contours[:self.max_candidates]:epsilon 0.005 * cv2.arcLength(contour, True)approx cv2.approxPolyDP(contour, epsilon, True)points approx.reshape((-1, 2))if points.shape[0] 4:continuescore self.box_score_fast(pred, contour.squeeze(1))if self.box_thresh score:continueif points.shape[0] 2:box self.unclip(points, unclip_ratioself.unclip_ratio)if len(box) 1:continueelse:continuefour_point_box, sside self.get_mini_boxes(box.reshape((-1, 1, 2)))if sside self.min_size 2:continueif not isinstance(dest_width, int):dest_width dest_width.item()dest_height dest_height.item()if not is_output_polygon:box np.array(four_point_box)else:box box.reshape(-1, 2)box[:, 0] np.clip(np.round(box[:, 0] / width * dest_width), 0, dest_width)box[:, 1] np.clip(np.round(box[:, 1] / height * dest_height), 0, dest_height)boxes.append(box)new_scores.append(score)return boxes, new_scoresdef unclip(self, box, unclip_ratio1.5):poly Polygon(box)distance poly.area * unclip_ratio / poly.lengthoffset pyclipper.PyclipperOffset()offset.AddPath(box, pyclipper.JT_ROUND, pyclipper.ET_CLOSEDPOLYGON)expanded np.array(offset.Execute(distance))return expandeddef get_mini_boxes(self, contour):bounding_box cv2.minAreaRect(contour)points sorted(list(cv2.boxPoints(bounding_box)), keylambda x: x[0])index_1, index_2, index_3, index_4 0, 1, 2, 3if points[1][1] points[0][1]:index_1 0index_4 1else:index_1 1index_4 0if points[3][1] points[2][1]:index_2 2index_3 3else:index_2 3index_3 2box [points[index_1], points[index_2], points[index_3], points[index_4]]return box, min(bounding_box[1])def box_score_fast(self, bitmap, _box):# bitmap bitmap.detach().cpu().numpy()h, w bitmap.shape[:2]box _box.copy()xmin np.clip(np.floor(box[:, 0].min()).astype(np.int), 0, w - 1)xmax np.clip(np.ceil(box[:, 0].max()).astype(np.int), 0, w - 1)ymin np.clip(np.floor(box[:, 1].min()).astype(np.int), 0, h - 1)ymax np.clip(np.ceil(box[:, 1].max()).astype(np.int), 0, h - 1)mask np.zeros((ymax - ymin 1, xmax - xmin 1), dtypenp.uint8)box[:, 0] box[:, 0] - xminbox[:, 1] box[:, 1] - ymincv2.fillPoly(mask, box.reshape(1, -1, 2).astype(np.int32), 1)return cv2.mean(bitmap[ymin:ymax 1, xmin:xmax 1], mask)[0]def narrow_224_32(image, expected_size(224,32)):ih, iw image.shape[0:2]ew, eh expected_size# scale eh / ihscale min((eh/ih),(ew/iw))# scale eh / max(iw,ih)nh int(ih * scale)nw int(iw * scale)image cv2.resize(image, (nw, nh), interpolationcv2.INTER_CUBIC)top 0bottom eh - nhleft 0right ew - nwnew_img cv2.copyMakeBorder(image, top, bottom, left, right, cv2.BORDER_CONSTANT, value(114, 114, 114))return image,new_imgdef draw_bbox(img_path, result, color(0, 0, 255), thickness2):import cv2if isinstance(img_path, str):img_path cv2.imread(img_path)# img_path cv2.cvtColor(img_path, cv2.COLOR_BGR2RGB)img_path img_path.copy()for point in result:point point.astype(int)cv2.polylines(img_path, [point], True, color, thickness)return img_pathif __name__ __main__:onnx_model onnxruntime.InferenceSession(detect_model_small.onnx)input_name onnx_model.get_inputs()[0].name# Set inputsimg cv2.imread(./pic/6.jpg)img0 , image narrow_224_32(img,expected_size(640,640))transform_totensor transforms.ToTensor()tensortransform_totensor(image)tensor_nortransforms.Normalize(mean[0.485, 0.456, 0.406], std[0.229, 0.224, 0.225])tensortensor_nor(tensor)tensor np.array(tensor,dtypenp.float32).reshape(1,3,640,640)post_proess DBPostProcess()is_output_polygon False#runoutputs onnx_model.run(None, {input_name:tensor})#post processfeat_2 torch.from_numpy(outputs[0])print(feat_2.size())box_list, score_list post_proess(outputs[0], [image.shape[:2]], is_output_polygonis_output_polygon)box_list, score_list box_list[0], score_list[0]if len(box_list) 0:idx [x.sum() 0 for x in box_list]box_list [box_list[i] for i, v in enumerate(idx) if v]score_list [score_list[i] for i, v in enumerate(idx) if v]else:box_list, score_list [], []print(-----------------box list,box_list)img draw_bbox(image, box_list)img img[0:img0.shape[0],0:img0.shape[1]]print(save pic)img1np.array(img,dtypenp.uint8).reshape(640,640,3)cv2.imwrite(img.jpg,img1)cv2.waitKey() 文字识别 onnx模型导出 #!/usr/bin/env python3import osimport sysimport pathlib# 将 torchocr路径加到python路径里__dir__ pathlib.Path(os.path.abspath(__file__))import numpy as npsys.path.append(str(__dir__))sys.path.append(str(__dir__.parent.parent))import torchfrom torchocr.networks import build_modelMODEL_PATH./model/ch_rec_moblie_crnn_mbv3.pthDEVICEcuda:0 if torch.cuda.is_available() else cpuprint(-----------------------devices,DEVICE)class RecInfer:def __init__(self, model_path, batch_size1):ckpt torch.load(model_path, map_locationDEVICE)cfg ckpt[cfg]self.model build_model(cfg[model])state_dict {}for k, v in ckpt[state_dict].items():state_dict[k.replace(module., )] vself.model.load_state_dict(state_dict)self.batch_size batch_sizeself.device torch.device(DEVICE)self.model.to(self.device)self.model.eval()# Prepare input tensorinput torch.randn(1, 3, 32, 224, requires_gradFalse).float().to(torch.device(DEVICE))# Export the torch model as onnxprint(-------------------export)torch.onnx.export(self.model,input,rego_model_small.onnx,export_paramsTrue,opset_version12,do_constant_foldingFalse)# Load the pretrained model and export it as onnxmodel RecInfer(MODEL_PATH) 验证 import onnxruntimeimport numpy as npimport cv2import torchDEVICEcuda:0 if torch.cuda.is_available() else cpuIMG_WIDTH448ONNX_MODEL./onnx_model/repvgg_s.onnxLABEL_FILE/root/autodl-tmp/warren/PytorchOCR_OLD/torchocr/datasets/alphabets/dict_text.txt#ONNX_MODEL./onnx_model/rego_model_small.onnx#LABEL_FILE/root/autodl-tmp/warren/PytorchOCR_OLD/torchocr/datasets/alphabets/ppocr_keys_v1.txtPIC./pic/img.jpgclass CTCLabelConverter(object): Convert between text-label and text-index def __init__(self, character):# character (str): set of the possible characters.dict_character []with open(character, rb) as fin:lines fin.readlines()for line in lines:line line.decode(utf-8).strip(\n).strip(\r\n)dict_character list(line)self.dict {}for i, char in enumerate(dict_character):# NOTE: 0 is reserved for blank token required by CTCLossself.dict[char] i 1#TODO replace ‘ ’ with special symbolself.character [[blank]] dict_character[ ] # dummy [blank] token for CTCLoss (index 0)def decode(self, preds, rawFalse): convert text-index into text-label. preds_idx preds.argmax(axis2)preds_prob preds.max(axis2)result_list []for word, prob in zip(preds_idx, preds_prob):if raw:result_list.append((.join([self.character[int(i)] for i in word]), prob))else:result []conf []for i, index in enumerate(word):if word[i] ! 0 and (not (i 0 and word[i - 1] word[i])):result.append(self.character[int(index)])conf.append(prob[i])result_list.append((.join(result), conf))return result_listdef decode(preds, rawFalse): convert text-index into text-label. dict_character []dict {}characterLABEL_FILEwith open(character, rb) as fin:lines fin.readlines()for line in lines:line line.decode(utf-8).strip(\n).strip(\r\n)dict_character list(line)for i, char in enumerate(dict_character):# NOTE: 0 is reserved for blank token required by CTCLossdict[char] i 1#TODO replace ‘ ’ with special symbolcharacter [[blank]] dict_character[ ] # dummy [blank] token for CTCLoss (index 0)preds_idx preds.argmax(axis2)preds_prob preds.max(axis2)result_list []for word, prob in zip(preds_idx, preds_prob):if raw:result_list.append((.join([character[int(i)] for i in word]), prob))else:result []conf []for i, index in enumerate(word):if word[i] ! 0 and (not (i 0 and word[i - 1] word[i])):result.append(character[int(index)])conf.append(prob[i])result_list.append((.join(result), conf))return result_listdef width_pad_img(_img, _target_width, _pad_value0):_height, _width, _channels _img.shapeto_return_img np.ones([_height, _target_width, _channels], dtype_img.dtype) * _pad_valueto_return_img[:_height, :_width, :] _imgreturn to_return_imgdef resize_with_specific_height(_img):resize_ratio 32 / _img.shape[0]return cv2.resize(_img, (0, 0), fxresize_ratio, fyresize_ratio, interpolationcv2.INTER_LINEAR)def normalize_img(_img):return (_img.astype(np.float32) / 255 - 0.5) / 0.5if __name__ __main__:onnx_model onnxruntime.InferenceSession(ONNX_MODEL)input_name onnx_model.get_inputs()[0].name# Set inputsimgs cv2.imread(PIC)if not isinstance(imgs,list):imgs [imgs]imgs [normalize_img(resize_with_specific_height(img)) for img in imgs]widths np.array([img.shape[1] for img in imgs])idxs np.argsort(widths)txts []label_convertCTCLabelConverter(LABEL_FILE)for idx in range(len(imgs)):batch_idxs idxs[idx:min(len(imgs), idx1)]batch_imgs [width_pad_img(imgs[idx],IMG_WIDTH) for idx in batch_idxs]batch_imgs np.stack(batch_imgs)print(batch_imgs.shape)tensor batch_imgs.transpose([0,3, 1, 2]).astype(np.float32)out onnx_model.run(None, {input_name:tensor})tensor_out torch.tensor(out)tensor_out torch.squeeze(tensor_out,dim1)softmax_output tensor_out.softmax(dim2)print(---------------out shape is,softmax_output.shape)txts.extend([label_convert.decode(np.expand_dims(txt, 0)) for txt in softmax_output])idxs np.argsort(idxs)out_txts [txts[idx] for idx in idxs]import sysimport codecssys.stdout codecs.getwriter(utf-8)(sys.stdout.detach())print(out_txts) 至此导出验证成功 rk3588板端部署转化为rknn模型 from rknn.api import RKNNONNX_MODEL xxx.onnx RKNN_MODEL xxxx.rknn DATASET ./dataset.txtif __name__ __main__:# Create RKNN objectrknn RKNN(verboseTrue)# pre-process configprint(-- Config model)retrknn.config(mean_values[[0, 0, 0]], std_values[[0, 0, 0]],target_platformrk3588) #wzwif ret ! 0:print(config model failed!)exit(ret)print(done)# Load ONNX modelprint(-- Loading model)ret rknn.load_onnx(modelONNX_MODEL, outputs[output, 345, 346]) if ret ! 0:print(Load model failed!)exit(ret)print(done)# Build modelprint(-- Building model)ret rknn.build(do_quantizationTrue, datasetDATASET)#ret rknn.build(do_quantizationFalse)if ret ! 0:print(Build model failed!)exit(ret)print(done)# Export RKNN modelprint(-- Export rknn model)ret rknn.export_rknn(RKNN_MODEL)if ret ! 0:print(Export rknn model failed!)exit(ret)print(done)#release rknnrknn.release() 使用pyqt进行开发 5.4 PyQt软件设计使用pyqt进行开发ui界面如图所示图6. 基于PYQT的ui界面该界面包含了三个功能按钮其中包裹一个选择静态图片一个使用相机一个检测按钮TextEdit用于显示识别结果label用于显示处理完成后的图片。软件流程图如下总体目录参照下面依次介绍图片检测的相关代码 import platformimport sysimport cv2import numpy as npimport torchimport pyclipperfrom shapely.geometry import Polygonfrom torchvision import transformsimport timeimport osimport globimport threadingfrom PyQt5.QtGui import *from PyQt5.QtWidgets import *from PyQt5.QtCore import *import platformfrom rknnlite.api import RKNNLiteimport osos.environ.pop(QT_QPA_PLATFORM_PLUGIN_PATH)DETECT_MODEL ./model/model_small.rknnREGO_MODEL./model/repvgg_s.rknnLABEL_FILE./dict/dict_text.txtLABEL_SIZE_PRIVIOUS0LABEL_SIZE_LATTER0# 文件夹路径folder_path ./crop_pic# 使用 glob 来获取所有图片文件的路径image_files glob.glob(os.path.join(folder_path, *.png)) glob.glob(os.path.join(folder_path, *.jpg))def resize_img_self(image,reszie_size(0,0)):ih,iwimage.shape[0:2]ew,ehreszie_sizescaleeh/ihwidthint(iw*scale)heightint(ih*scale)if height!eh:heightehimagecv2.resize(image,(width,height),interpolationcv2.INTER_LINEAR)top 0bottom 0left 0right ew-widthnew_img cv2.copyMakeBorder(image, top, bottom, left, right, cv2.BORDER_CONSTANT, value(114, 114, 114))#print(new image shape,new_img.shape)return new_imgdef narrow_224_32(image, expected_size(224,32)):ih, iw image.shape[0:2]ew, eh expected_size# scale eh / ihscale min((eh/ih),(ew/iw))# scale eh / max(iw,ih)nh int(ih * scale)nw int(iw * scale)image cv2.resize(image, (nw, nh), interpolationcv2.INTER_CUBIC)top 0bottom eh - nhleft 0right ew - nwnew_img cv2.copyMakeBorder(image, top, bottom, left, right, cv2.BORDER_CONSTANT, value(114, 114, 114))return image,new_imgdef draw_bbox(img_path, result, color(0, 0, 255), thickness2):import cv2if isinstance(img_path, str):img_path cv2.imread(img_path)# img_path cv2.cvtColor(img_path, cv2.COLOR_BGR2RGB)img_path img_path.copy()for point in result:point point.astype(int)cv2.polylines(img_path, [point], True, color, thickness)return img_pathdef delay_milliseconds(milliseconds):seconds milliseconds / 1000.0time.sleep(seconds) Convert between text-label and text-index class CTCLabelConverter(object):def __init__(self, character):# character (str): set of the possible characters.dict_character []with open(character, rb) as fin:lines fin.readlines()for line in lines:line line.decode(utf-8).strip(\n).strip(\r\n)dict_character list(line)self.dict {}for i, char in enumerate(dict_character):# NOTE: 0 is reserved for blank token required by CTCLossself.dict[char] i 1#TODO replace ‘ ’ with special symbolself.character [[blank]] dict_character[ ] # dummy [blank] token for CTCLoss (index 0)def decode(self, preds, rawFalse): convert text-index into text-label. preds_idx preds.argmax(axis2)preds_prob preds.max(axis2)result_list []for word, prob in zip(preds_idx, preds_prob):if raw:result_list.append((.join([self.character[int(i)] for i in word]), prob))else:result []conf []for i, index in enumerate(word):if word[i] ! 0 and (not (i 0 and word[i - 1] word[i])):result.append(self.character[int(index)])#conf.append(prob[i])#result_list.append((.join(result), conf))result_list.append((.join(result)))return result_listclass DBPostProcess():def __init__(self, thresh0.3, box_thresh0.7, max_candidates1000, unclip_ratio2):self.min_size 3self.thresh threshself.box_thresh box_threshself.max_candidates max_candidatesself.unclip_ratio unclip_ratiodef __call__(self, pred, h_w_list, is_output_polygonFalse):pred pred[:, 0, :, :]segmentation self.binarize(pred)boxes_batch []scores_batch []for batch_index in range(pred.shape[0]):height, width h_w_list[batch_index]boxes, scores self.post_p(pred[batch_index], segmentation[batch_index], width, height,is_output_polygonis_output_polygon)boxes_batch.append(boxes)scores_batch.append(scores)return boxes_batch, scores_batchdef binarize(self, pred):return pred self.threshdef post_p(self, pred, bitmap, dest_width, dest_height, is_output_polygonFalse):_bitmap: single map with shape (H, W),whose values are binarized as {0, 1}height, width pred.shapeboxes []new_scores []# bitmap bitmap.cpu().numpy()if cv2.__version__.startswith(3):_, contours, _ cv2.findContours((bitmap * 255).astype(np.uint8), cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE)if cv2.__version__.startswith(4):contours, _ cv2.findContours((bitmap * 255).astype(np.uint8), cv2.RETR_LIST, cv2.CHAIN_APPROX_SIMPLE)for contour in contours[:self.max_candidates]:epsilon 0.005 * cv2.arcLength(contour, True)approx cv2.approxPolyDP(contour, epsilon, True)points approx.reshape((-1, 2))if points.shape[0] 4:continuescore self.box_score_fast(pred, contour.squeeze(1))if self.box_thresh score:continueif points.shape[0] 2:box self.unclip(points, unclip_ratioself.unclip_ratio)if len(box) 1:continueelse:continuefour_point_box, sside self.get_mini_boxes(box.reshape((-1, 1, 2)))if sside self.min_size 2:continueif not isinstance(dest_width, int):dest_width dest_width.item()dest_height dest_height.item()if not is_output_polygon:box np.array(four_point_box)else:box box.reshape(-1, 2)box[:, 0] np.clip(np.round(box[:, 0] / width * dest_width), 0, dest_width)box[:, 1] np.clip(np.round(box[:, 1] / height * dest_height), 0, dest_height)boxes.append(box)new_scores.append(score)return boxes, new_scoresdef unclip(self, box, unclip_ratio1.5):poly Polygon(box)distance poly.area * unclip_ratio / poly.lengthoffset pyclipper.PyclipperOffset()offset.AddPath(box, pyclipper.JT_ROUND, pyclipper.ET_CLOSEDPOLYGON)expanded np.array(offset.Execute(distance))return expandeddef get_mini_boxes(self, contour):bounding_box cv2.minAreaRect(contour)points sorted(list(cv2.boxPoints(bounding_box)), keylambda x: x[0])index_1, index_2, index_3, index_4 0, 1, 2, 3if points[1][1] points[0][1]:index_1 0index_4 1else:index_1 1index_4 0if points[3][1] points[2][1]:index_2 2index_3 3else:index_2 3index_3 2box [points[index_1], points[index_2], points[index_3], points[index_4]]return box, min(bounding_box[1])def box_score_fast(self, bitmap, _box):# bitmap bitmap.detach().cpu().numpy()h, w bitmap.shape[:2]box _box.copy()xmin np.clip(np.floor(box[:, 0].min()).astype(np.int), 0, w - 1)xmax np.clip(np.ceil(box[:, 0].max()).astype(np.int), 0, w - 1)ymin np.clip(np.floor(box[:, 1].min()).astype(np.int), 0, h - 1)ymax np.clip(np.ceil(box[:, 1].max()).astype(np.int), 0, h - 1)mask np.zeros((ymax - ymin 1, xmax - xmin 1), dtypenp.uint8)box[:, 0] box[:, 0] - xminbox[:, 1] box[:, 1] - ymincv2.fillPoly(mask, box.reshape(1, -1, 2).astype(np.int32), 1)return cv2.mean(bitmap[ymin:ymax 1, xmin:xmax 1], mask)[0]class Process_Class(QWidget):detect_end pyqtSignal(str)clear_text pyqtSignal()def __init__(self):super().__init__()self.image Noneself.imgNoneself.camera_statusFalseself.result_stringNoneself.cap cv2.VideoCapture()#detectrknn_model_detect DETECT_MODELself.rknn_lite_detect RKNNLite()self.rknn_lite_detect.load_rknn(rknn_model_detect)# load RKNN modelself.rknn_lite_detect.init_runtime(core_maskRKNNLite.NPU_CORE_2)# init runtime environment#regorknn_model_rego REGO_MODELself.rknn_lite_rego RKNNLite()self.rknn_lite_rego.load_rknn(rknn_model_rego)# load RKNN modelself.rknn_lite_rego.init_runtime(core_maskRKNNLite.NPU_CORE_0_1)# init runtime environmentself.detect_end.connect(self.update_text_box)self.clear_text.connect(self.clear_text_box)def cv2_to_qpixmap(self, cv_image):height, width, channel cv_image.shapebytes_per_line 3 * widthq_image QImage(cv_image.data, width, height, bytes_per_line, QImage.Format_RGB888).rgbSwapped()return QPixmap.fromImage(q_image)def show_pic(self, cv_image):pixmap self.cv2_to_qpixmap(cv_image)if MainWindow.pic_label is not None:MainWindow.pic_label.setPixmap(pixmap)QApplication.processEvents()else:print(wrong!!!!!!!)def camera_open(self):self.camera_status not self.camera_statusprint(------------camera status is,self.camera_status)if self.camera_status:self.cap.open(12)if self.cap.isOpened():print(run camera)while(True):frame self.cap.read()if not frame[0]:print(read frame failed!!!!)exit()self.imageframe[1]self.detect_pic()if not self.camera_status: breakelse:print(Cannot open camera)exit()else:self.release_camera()def release_camera(self):if self.cap.isOpened():self.cap.release()self.camera_status Falseprint(摄像头关闭)def open_file(self):# 获取图像的路径img_path, _ QFileDialog.getOpenFileName()if img_path ! :self.image cv2.imread(img_path)self.show_pic(self.image)def crop_and_save_image(self,image, box_points):global LABEL_SIZE_PRIVIOUSglobal LABEL_SIZE_LATTERi-1# 将box_points转换为NumPy数组并取整数值box_points np.array(box_points, dtypenp.int32)mask np.zeros_like(image) # 创建与图像相同大小的全黑图像print(LABEL_SIZE_PRIVIOUS ,LABEL_SIZE_PRIVIOUS,LABEL_SIZE_LATTER ,LABEL_SIZE_LATTER)if LABEL_SIZE_PRIVIOUSLABEL_SIZE_LATTER:LABEL_SIZE_PRIVIOUSlen(box_points)for box_point in box_points:ii1cropped_image image.copy()# 使用OpenCV的函数裁剪图像x, y, w, h cv2.boundingRect(box_point)cropped_image image[y:yh, x:xw]# 创建与图像大小相同的全黑掩码mask np.zeros_like(cropped_image)# 在掩码上绘制多边形cv2.fillPoly(mask, [box_point - (x, y)], (255, 255, 255))# 使用 bitwise_and 进行图像裁剪masked_cropped_image cv2.bitwise_and(cropped_image, mask) # 保存裁剪后的图像output_path f{./crop_pic/}img_{i}.jpgcv2.imwrite(output_path, masked_cropped_image)else:#self.clear_text.emit()LABEL_SIZE_LATTERLABEL_SIZE_PRIVIOUScurrent_directory os.getcwd()/crop_pic # Get the current directoryfor filename in os.listdir(current_directory):if filename.endswith(.jpg):file_path os.path.join(current_directory, filename)os.remove(file_path)print(fDeleted: {file_path})def detect_thread(self):#detect inferenceimg0 , image narrow_224_32(self.image,expected_size(640,640))outputs self.rknn_lite_detect.inference(inputs[image])post_proess DBPostProcess()is_output_polygon Falsebox_list, score_list post_proess(outputs[0], [image.shape[:2]], is_output_polygonis_output_polygon)box_list, score_list box_list[0], score_list[0]if len(box_list) 0:idx [x.sum() 0 for x in box_list]box_list [box_list[i] for i, v in enumerate(idx) if v]score_list [score_list[i] for i, v in enumerate(idx) if v]else:box_list, score_list [], []self.image draw_bbox(image, box_list)self.crop_and_save_image(image,box_list)self.image self.image[0:img0.shape[0],0:img0.shape[1]]self.show_pic(self.image)def rego_thread(self):label_convertCTCLabelConverter(LABEL_FILE)self.clear_text.emit()for image_file in image_files:if os.path.exists(image_file):print(-----------image file,image_file,len(image_files))self.img cv2.imread(image_file)image resize_img_self(self.img,reszie_size(448,32))# Inferenceoutputs self.rknn_lite_rego.inference(inputs[image])#post processfeat_2 torch.tensor(outputs[0],dtypetorch.float32)txt label_convert.decode(feat_2.detach().numpy())self.result_string .join(txt)print(self.result_string)self.detect_end.emit(self.result_string)else:print(-----------no crop image!!!)def detect_pic(self):self.detect_thread()my_thread threading.Thread(targetself.rego_thread)# 启动线程my_thread.start()# 等待线程结束my_thread.join()def update_text_box(self, text):# 在主线程中更新文本框的内容MainWindow.text_box.append(text)def clear_text_box(self):print(clear--------------------------------)# 在主线程中更新文本框的内容MainWindow.text_box.clear()class MainWindow(QMainWindow):#pic_label Nonedef __init__(self):pic_label Nonetext_box Nonesuper().__init__()self.process_functions Process_Class()self.window QWidget()# 创建小部件self.pic_label QLabel(Show Window!, parentself.window)self.pic_label.setMinimumHeight(500) # 设置最小高度self.pic_label.setMaximumHeight(500) # 设置最大高度self.pic_button QPushButton(Picture, parentself.window)self.pic_button.clicked.connect(self.process_functions.open_file)self.camera_button QPushButton(Camera, parentself.window)self.camera_button.clicked.connect(self.process_functions.camera_open)self.detect_button QPushButton(Detect, parentself.window)self.detect_button.clicked.connect(self.process_functions.detect_pic)self.text_box QTextEdit()# 创建垂直布局管理器并将小部件添加到布局中self.left_layout QVBoxLayout()self.right_layout QVBoxLayout()self.layout QHBoxLayout()self.create_ui()self.window.closeEvent self.closeEventdef create_ui(self):self.window.setWindowTitle(Scene_text_rego)self.window.setGeometry(0, 0, 800, 600) # 设置窗口位置和大小# 设置主窗口的布局self.pic_label.setStyleSheet(border: 2px solid black; padding: 10px;)self.left_layout.addWidget(self.pic_label)self.left_layout.addWidget(self.text_box)self.right_layout.addWidget(self.pic_button)self.right_layout.addWidget(self.camera_button)self.right_layout.addWidget(self.detect_button)self.layout.addLayout(self.left_layout)self.layout.addLayout(self.right_layout)self.window.setLayout(self.layout)self.window.show()def closeEvent(self, event):# 释放摄像头资源self.process_functions.release_camera()event.accept()def main():# 创建应用程序对象app QApplication(sys.argv)win MainWindow()MainWindow.pic_label win.pic_label # 设置类变量pic_label为MainWindow对象的pic_labelMainWindow.text_box win.text_box # 设置类变量pic_label为MainWindow对象的pic_label# 运行应用程序sys.exit(app.exec_())rknn_lite_detect.release()if __name__ __main__:main() 运行结果参考资料博文【工程部署】手把手教你在RKNN上部署OCR服务上_rknn ocr_三叔家的猫的博客-CSDN博客

查看全文

http://www.pierceye.com/news/657349/